Last modified: 2013-08-21 22:25:26 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T54527, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 52527 - tools-webserver-01 seems to be out of memory from time to time
tools-webserver-01 seems to be out of memory from time to time
Status: RESOLVED FIXED
Product: Wikimedia Labs
Classification: Unclassified
tools (Other open bugs)
unspecified
All All
: Unprioritized normal
: ---
Assigned To: Marc A. Pelletier
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-08-04 17:07 UTC by Tim Landscheidt
Modified: 2013-08-21 22:25 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Tim Landscheidt 2013-08-04 17:07:23 UTC
I don't know if this is related to the NFS stalls, but tools-webserver-01 runs out of memory from time to time; exim paniclog:

| 2013-08-03 03:13:23 daemon: fork of queue-runner process failed: Cannot allocate memory

Daily anacron, Sun, 04 Aug 2013 06:30:12 +0000:

| /etc/cron.daily/apt:
| FATAL -> Failed to fork.

Weekly anacron, Sun, 04 Aug 2013 06:47:25 +0000:

| /etc/cron.weekly/apt-xapian-index:
| FATAL -> Failed to fork.
| run-parts: /etc/cron.weekly/apt-xapian-index exited with return code 100

Ganglia graphs (http://ganglia.wmflabs.org/latest/graph_all_periods.php?h=tools-webserver-01&m=load_one&r=hour&s=by%20name&hc=4&mc=2&st=1375632437&g=mem_report&z=large&c=tools) look rather peaceful, with most of the memory only being used for buffers/cache.

But the webserver should never impede the system jobs from running, so this needs to be looked into.  Setting up tools-webserver-03 is certainly an option, but may only defer the problem.
Comment 1 Tim Landscheidt 2013-08-17 18:22:01 UTC
exim paniclog again (deleted afterwards by me):

| 2013-08-13 12:53:23 daemon: fork of queue-runner process failed: Cannot allocate memory
| 2013-08-16 12:52:33 daemon: fork of queue-runner process failed: Cannot allocate memory
Comment 2 Marc A. Pelletier 2013-08-21 22:25:26 UTC
Added a new webserver to the rotation, this should ease the pressure.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links