Last modified: 2013-04-26 12:22:23 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T32452, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 30452 - gracefully handle a poolcounterd outage
gracefully handle a poolcounterd outage
Status: NEW
Product: MediaWiki
Classification: Unclassified
Parser (Other open bugs)
unspecified
All All
: Normal normal (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2011-08-18 21:15 UTC by Asher Feldman
Modified: 2013-04-26 12:22 UTC (History)
4 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Asher Feldman 2011-08-18 21:15:11 UTC
PoolCounter as currently deployed is a SPOF in our infrastructure.  If it's enabled in MediaWiki and the poolcounterd server is completely down, an error page will displayed for any article in need of parsing.  

There is a separate RT ticket to make poolcounterd redundant in our infrastructure but we'd still like to make sure total failure is handled gracefully.
Comment 1 Tim Starling 2011-08-19 14:08:41 UTC
I thought I fixed this in r84322, which was deployed in March.
Comment 2 sheac 2012-04-21 19:14:48 UTC
(In reply to comment #0)
> PoolCounter as currently deployed is a SPOF in our infrastructure.  If it's
> enabled in MediaWiki and the poolcounterd server is completely down, an error
> page will displayed for any article in need of parsing.  
> 
> There is a separate RT ticket to make poolcounterd redundant in our
> infrastructure but we'd still like to make sure total failure is handled
> gracefully.

Are the conditions available for you to reproduce this bug (e.g. poolcounter server down), or can we trust Tim that it's been fixed in https://www.mediawiki.org/wiki/Special:Code/MediaWiki/84322 ?
Comment 3 Platonides 2012-04-21 19:30:39 UTC
A connection error would return a Status of type fatal, thus with r84322 the apache instance would do the work itself.
The poolcounter failing won't result in downtime for the wiki *if* Michael Jackson doesn't die. In which case we would be subject to the same overload as without the poolcounter (and the solution is just to restart it).

Assuming that the server would cope with all those connections in an overload (fd max, tcp buffers...), this is fixed.
Comment 4 sheac 2012-05-04 16:36:04 UTC
With the exception of the recent conversation I generated, this bug has not been touched in at least six months. With this in mind, I've been asked by the bugmeister to bump this bug's priority down for "High". Concerns should be addressed to mah@everybody.org.
Comment 5 Andre Klapper 2013-04-26 12:22:23 UTC
Does not look like "high" priority to me, hence setting to normal.

More general info: https://wikitech.wikimedia.org/wiki/PoolCounter

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links