Last modified: 2014-10-21 16:34:21 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T68225, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 66225 - Special:RunJobs recursively calling Special:RunJobs again (?)
Special:RunJobs recursively calling Special:RunJobs again (?)
Status: NEW
Product: MediaWiki
Classification: Unclassified
JobQueue (Other open bugs)
1.24rc
All All
: Normal normal with 1 vote (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-06-05 22:48 UTC by Platonides
Modified: 2014-10-21 16:34 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Platonides 2014-06-05 22:48:07 UTC
We had a problem where the WMES wiki was painfully slow. The access log showing thousands of Special%3ARunJobs entries (NOT one per request).

 $ grep -c Special%3ARunJobs access.log.2014-06-03 
63116
 $ grep -c -v Special%3ARunJobs access.log.2014-06-03
472

(that would be 133 RunJobs per request‼)

Changing $wgJobRunRate to 0 from the default of 1 “fixed” the slowness.

From the external appearance, it looks like each Special:RunJobs then called another Special:RunJobs with an infinite recursion until the time limit.

There are 4 jobs with old dates in the queue, but even if they are not completable, they shouldn't have produced so many POSTs.

At the point $wgJobRunRate was disabled, the wiki was running 00f91912c64 (master), but it had been ‘broken’ for weeks.
Comment 1 Aaron Schulz 2014-06-05 22:58:16 UTC
Maybe the jobs where really slow and the run rate of 1 kept letting them build up. They could really some sort of bucketing scheme to set a configurable limit on the number of concurrent requests somehow...
Comment 2 Christian Kujau 2014-06-18 08:21:21 UTC
Same here with v1.23.0: for every request, the following gets logged into access.log:

::ffff:11.22.33.44 - - [18/Jun/2014:09:58:14 +0200] "POST /mediawiki/index.php?title=Special%3ARunJobs&tasks=jobs&maxjobs=1&sigexpiry=1403078299&signature=6da0cc79966609c2b51937b9175d7728b290ff0a HTTP/1.1" 411 357 "-" "-"

192.168.0.10 www.example.org - [18/Jun/2014:09:58:23 +0200] "GET /wiki/Main_Page HTTP/1.1" 200 18203 "-" "curl/7.37.0"


Setting wgJobRunRate to "0" seems to "fix" this. I don't understand what "jobs" are now not being run, but if it has any negative impact I shall change it to "0.01" or something to lessen the POST requests.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links