Last modified: 2014-11-20 15:19:15 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T52832, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 50832 - Results of Disambiguator extension are not complete
Results of Disambiguator extension are not complete
Status: ASSIGNED
Product: MediaWiki extensions
Classification: Unclassified
Disambiguator (Other open bugs)
unspecified
All All
: Low normal (vote)
: ---
Assigned To: Ryan Kaldari
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-07-05 20:04 UTC by taste1at
Modified: 2014-11-20 15:19 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description taste1at 2013-07-05 20:04:22 UTC
On the english wikipedia, the pages Special:DisambiguationPages and Special:DisambiguationPageLinks are not complete (they only give 1000 results!)

I want especially point to:

http://en.wikipedia.org/w/index.php?title=Special:PagesWithProp/disambiguation

gives exactly the same information as

http://en.wikipedia.org/w/index.php?title=Special:DisambiguationPages

But Special:DisambiguationPages does only give 1000 results and is only cached, while Special:PagesWithProp/disambiguation gives all results and is not cached. This is somehow pointless.
Comment 1 Ryan Kaldari 2013-07-08 20:52:37 UTC
This isn't actually specific to Disambiguator. It's defined by the wgQueryCacheLimit config var. For example, the Redirects list is also limited to 1000 results:
http://en.wikipedia.org/wiki/Special:ListRedirects

This config var is set to 1000 for enwiki, 2000 for dewiki, and 5000 for all other wikis. So any QueryPages that use caching (i.e. are marked as expensive on a wiki using MiserMode) are limited to X number of results as defined by wgQueryCacheLimit.

Unfortunately, this also limits the API results. I'll file a bug about increasing the limit for enwiki.
Comment 2 Ryan Kaldari 2013-07-08 21:01:33 UTC
The other option would be to mark Special:DisambiguationPages as not being expensive. Since it joins the page and page_props tables I imagine it is at least slightly expensive, but it may be worth doing some profiling on it.
Comment 3 Gerrit Notification Bot 2013-07-10 18:56:30 UTC
Change 73008 had a related patch set uploaded by Kaldari:
Optimizing Special:DisambiguationPages query to avoid filesort

https://gerrit.wikimedia.org/r/73008
Comment 4 Gerrit Notification Bot 2013-07-10 20:24:19 UTC
Change 73008 merged by Anomie:
Optimizing Special:DisambiguationPages query to avoid filesort

https://gerrit.wikimedia.org/r/73008
Comment 5 Andre Klapper 2014-11-20 15:19:15 UTC
Ryan Kaldari: This issue has been assigned to you a while ago. 
Could you please provide a status update and inform us whether you are still working (or still plan to work) on this issue? 
Only in case you do not plan to work on this issue anymore, should the assignee be set back to default? Thanks.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links