Last modified: 2014-03-11 13:32:38 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T48208, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 46208 - Dump stats: collect monthly totals more efficiently
Dump stats: collect monthly totals more efficiently
Status: NEW
Product: Analytics
Classification: Unclassified
Wikistats (Other open bugs)
unspecified
All All
: Low minor
: ---
Assigned To: Nobody - You can work on this!
:
Depends on: 46198
Blocks:
  Show dependency treegraph
 
Reported: 2013-03-16 12:53 UTC by Erik Zachte
Modified: 2014-03-11 13:32 UTC (History)
1 user (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Erik Zachte 2013-03-16 12:53:52 UTC
On largest dumps this routine can take hours, not trivial to rework though
Comment 2 Erik Zachte 2014-03-11 13:32:38 UTC
One of the design anomalies that dates back to an era when the English dump could be parsed in minutes rather than days ;-)

Current implementation is really inefficient (bad coded) as for some metrics in WikiCountsOutput.pm data are established month by month, and for every iteration a recursive lookup occurs. The inefficiency grows over month by month, but speed improvements in hardware make it less urgent.  

If Wikistats would change to incremental updates https://bugzilla.wikimedia.org/show_bug.cgi?id=46198 the issue is mute.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links