Last modified: 2013-12-01 15:38:46 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T59794, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 57794 - Tool labs web servers give 503's
Tool labs web servers give 503's
Status: RESOLVED FIXED
Product: Wikimedia Labs
Classification: Unclassified
tools (Other open bugs)
unspecified
All All
: High major
: ---
Assigned To: Marc A. Pelletier
http://tools.wmflabs.org/
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-12-01 10:32 UTC by Maarten Dammers
Modified: 2013-12-01 15:38 UTC (History)
13 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Maarten Dammers 2013-12-01 10:32:44 UTC
No tools seem to be working anymore. All tools give a 503:

Service Temporarily Unavailable

The server is temporarily unable to service your request due to maintenance downtime or capacity problems. Please try again later.

If I look in the access log I see that one of my tools last served a request at [30/Nov/2013:23:42:24 +0000]
Comment 1 metatron 2013-12-01 10:47:53 UTC
Do you have any url?

For me it just works fine:
http://tools.wmflabs.org/

(php)
https://tools.wmflabs.org/wikiviewstats/
Comment 2 Pietrodn 2013-12-01 10:50:03 UTC
(In reply to comment #1)
> Do you have any url?
> 
> For me it just works fine:
> http://tools.wmflabs.org/
> 
> (php)
> https://tools.wmflabs.org/wikiviewstats/

I notice that some tools work and some don't. Maybe it's related with PHP?
My tool for example is written in PHP and doesn't work:
https://tools.wmflabs.org/intersect-contribs/
Comment 3 metatron 2013-12-01 10:56:57 UTC
Mine is php, too - and it works. But there seems to be an issue though.

Grid-Status doesn't show up:
http://tools.wmflabs.org/?status

btw: Are you using new web already and is your http service in list when »qstat«?

As writing these lines: qstat fails w/ error. Maybe the Grid is partially broken.

local-wikiviewstats@tools-login:~$ qstat
error: commlib error: got select error (No route to host)
error: unable to send message to qmaster using port 6444 on host "tools-master.pmtpa.wmflabs": got send error
Comment 4 Pietrodn 2013-12-01 11:02:33 UTC
(In reply to comment #3)
> Mine is php, too - and it works. But there seems to be an issue though.

Oops, that's true :)

> Grid-Status doesn't show up:
> http://tools.wmflabs.org/?status
> 
> btw: Are you using new web already and is your http service in list when
> »qstat«?

My tools aren't using the new web service; "webservice start" just hangs.

> 
> As writing these lines: qstat fails w/ error. Maybe the Grid is partially
> broken.
> 
> local-wikiviewstats@tools-login:~$ qstat
> error: commlib error: got select error (No route to host)
> error: unable to send message to qmaster using port 6444 on host
> "tools-master.pmtpa.wmflabs": got send error

qstat fails with the same error for me.
Comment 5 Nemo 2013-12-01 11:14:27 UTC
From 208.80.153.237 icmp_seq=14 Destination Host Unreachable
^C
--- ee-dashboard.wmflabs.org ping statistics ---
14 packets transmitted, 0 received, +6 errors, 100% packet loss, time 13001ms
Comment 6 Liangent 2013-12-01 12:53:39 UTC
(In reply to comment #5)
> From 208.80.153.237 icmp_seq=14 Destination Host Unreachable
> ^C
> --- ee-dashboard.wmflabs.org ping statistics ---
> 14 packets transmitted, 0 received, +6 errors, 100% packet loss, time 13001ms

My tool's web interface seems working: https://tools.wmflabs.org/liangent-php/index.php/enwiki?title=Special:BlankPage
Comment 7 Yuvi Panda 2013-12-01 12:59:12 UTC
Looks like a number of hosts are down: http://ganglia.wmflabs.org/latest/?r=hour&cs=&ce=&s=by+name&c=tools&tab=m&vn=

I restarted the grid master and the webservers, let's see what happens
Comment 8 Yuvi Panda 2013-12-01 13:00:25 UTC
(In reply to comment #6)
> My tool's web interface seems working:
> https://tools.wmflabs.org/liangent-php/index.php/enwiki?title=Special:
> BlankPage

Webgrid is working, so newweb tools will continue to work - just can't submit any new ones. One of the webservers (and the proxy) is also operational, so there's a 1/3 chance of your tool working even if you are using apache.
Comment 9 Yuvi Panda 2013-12-01 13:03:23 UTC
So the instances I tried to reboot seem to be stuck in a 'rebooting' state, and I'm not able to check what the console says either (wikitech just says 'failed to get console output'). Looks like resolving this will need someone with higher powers than what I have.
Comment 10 Cyberpower678 2013-12-01 13:25:39 UTC
Looks like tools using the new webserver are immune.  Xtools, all written in PHP btw, are running just fine.
Comment 11 Cyberpower678 2013-12-01 13:36:25 UTC
(In reply to comment #10)
> Looks like tools using the new webserver are immune.  Xtools, all written in
> PHP btw, are running just fine.

Or not.  It just seems to hang now.
Comment 12 Marc A. Pelletier 2013-12-01 14:07:27 UTC
One of the hardware servers providing virtual servers had crashed during the night, disrupting service from those virtual environments running on it (and only those).  It's in the process of returning to service now.
Comment 13 Cyberpower678 2013-12-01 15:13:53 UTC
Something broke.  Every internal link on xtools is being redirected else where and doesn't work.

For example when I hover over a link on the edit counter, it points to http://tools.wmflabs.org/xtools/ec/, when I click on it, it instead goes to tools-webgrid-01:4040/xtools/ec/
Comment 14 Andrew Bogott 2013-12-01 15:36:38 UTC
which virt box crashed?  And, any idea why?
Comment 15 Nemo 2013-12-01 15:38:46 UTC
Did you close this again on purpose or was it a mid-air collision?

(In reply to comment #14)
> which virt box crashed?  And, any idea why?

virt10: https://ganglia.wikimedia.org/latest/?c=Virtualization%20cluster%20pmtpa&h=virt10.pmtpa.wmnet&m=cpu_report&r=day&s=by%20name&hc=4&mc=2

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links