Last modified: 2014-10-21 16:34:21 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T68225, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 66225 - Special:RunJobs recursively calling Special:RunJobs again (?)


Summary:	Special:RunJobs recursively calling Special:RunJobs again (?)

Status:	NEW

Product:	MediaWiki
Classification:	Unclassified
Component:	JobQueue (Other open bugs)
Version:	1.24rc
Hardware:	All All

Importance:	Normal normal with 1 vote (vote)
Target Milestone:	---
Assigned To:	Nobody - You can work on this!

URL:
Whiteboard:
Keywords:

Depends on:
Blocks:
	Show dependency tree / graph

Reported:	2014-06-05 22:48 UTC by Platonides
Modified:	2014-10-21 16:34 UTC (History)
CC List:	3 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Platonides 2014-06-05 22:48:07 UTC

We had a problem where the WMES wiki was painfully slow. The access log showing thousands of Special%3ARunJobs entries (NOT one per request).

 $ grep -c Special%3ARunJobs access.log.2014-06-03 
63116
 $ grep -c -v Special%3ARunJobs access.log.2014-06-03
472

(that would be 133 RunJobs per request‼)

Changing $wgJobRunRate to 0 from the default of 1 “fixed” the slowness.

From the external appearance, it looks like each Special:RunJobs then called another Special:RunJobs with an infinite recursion until the time limit.

There are 4 jobs with old dates in the queue, but even if they are not completable, they shouldn't have produced so many POSTs.

At the point $wgJobRunRate was disabled, the wiki was running 00f91912c64 (master), but it had been ‘broken’ for weeks.

Comment 1 Aaron Schulz 2014-06-05 22:58:16 UTC

Maybe the jobs where really slow and the run rate of 1 kept letting them build up. They could really some sort of bucketing scheme to set a configurable limit on the number of concurrent requests somehow...

Comment 2 Christian Kujau 2014-06-18 08:21:21 UTC

Same here with v1.23.0: for every request, the following gets logged into access.log:

::ffff:11.22.33.44 - - [18/Jun/2014:09:58:14 +0200] "POST /mediawiki/index.php?title=Special%3ARunJobs&tasks=jobs&maxjobs=1&sigexpiry=1403078299&signature=6da0cc79966609c2b51937b9175d7728b290ff0a HTTP/1.1" 411 357 "-" "-"

192.168.0.10 www.example.org - [18/Jun/2014:09:58:23 +0200] "GET /wiki/Main_Page HTTP/1.1" 200 18203 "-" "curl/7.37.0"


Setting wgJobRunRate to "0" seems to "fix" this. I don't understand what "jobs" are now not being run, but if it has any negative impact I shall change it to "0.01" or something to lessen the POST requests.

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links