Last modified: 2014-05-06 14:35:13 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T60972, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 58972 - CirrusSearch: Monitor the different between the number of pages in the index and the number of pages in the Special:Statistics
CirrusSearch: Monitor the different between the number of pages in the index ...
Status: ASSIGNED
Product: MediaWiki extensions
Classification: Unclassified
CirrusSearch (Other open bugs)
unspecified
All All
: Lowest minor (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-12-26 16:12 UTC by Nik Everett
Modified: 2014-05-06 14:35 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Nik Everett 2013-12-26 16:12:19 UTC
Monitor the different between the number of pages in the index and the number of pages in the Special:Statistics.  This came up because foundationwiki had no pages in its content index for quite some time without us knowing it.  We should know about this.

I don't know what the ration between pages in Special:Statistics and the search index ought to be, nor if it would be better to check against some other count.  The problem is that those counts are expensive to execute in mysql.
Comment 1 Chad H. 2014-02-21 22:49:07 UTC
So we've got a script for this but it needs some tidying up. Would be nice to have this info somewhere so we can tell when the numbers diverge.
Comment 2 Chad H. 2014-03-05 21:08:27 UTC
I've rolled Cirrus out to more wikis so the script is more useful than before. I've also fixed some math problems so we can get better data.

Maybe alert when we're > 40%? That's almost always a problem on our end. Less than that and you could very easily hit a wiki with a high redirect:page ratio and alert pointlessly.
Comment 3 Chad H. 2014-03-12 20:26:23 UTC
Bleh, this script isn't all that useful. See for example:

demon@terbium:~$ mwscript extensions/CirrusSearch/maintenance/checkCounts.php zhwikibooks
SiteStats=12273
Elasticsearch=7258
Percentage=51%

But I just finished a force reindexing of this entire wiki to prove my point...this is hard to measure :(

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links