Last modified: 2014-10-24 14:48:32 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T54867, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 52867 - monitor that application servers are responding
monitor that application servers are responding
Status: NEW
Product: Wikimedia Labs
Classification: Unclassified
deployment-prep (beta) (Other open bugs)
unspecified
All All
: High enhancement
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks: 51497
  Show dependency treegraph
 
Reported: 2013-08-14 22:04 UTC by Antoine "hashar" Musso (WMF)
Modified: 2014-10-24 14:48 UTC (History)
8 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Antoine "hashar" Musso (WMF) 2013-08-14 22:04:33 UTC
We had one of the application server that was not responding anymore despite the Apache process being up (bug 52776). We would need to monitor that the application server are actually serving something.
Comment 1 Antoine "hashar" Musso (WMF) 2013-08-14 22:05:17 UTC
That needs monitoring Apache daemon is running AND that it is serving content.
Comment 2 Sam Reed (reedy) 2013-08-15 19:09:51 UTC
incinga has "Apache HTTP" monitoring in WMF production, but only apparently for hosts in PMTPA, not in EQIAD (another issue)
Comment 3 Sam Reed (reedy) 2013-08-15 19:14:04 UTC
(In reply to comment #2)
> incinga has "Apache HTTP" monitoring in WMF production, but only apparently
> for
> hosts in PMTPA, not in EQIAD (another issue)

Ignore the EQIAD part - Apache isn't monitored on job runners
Comment 4 Antoine "hashar" Musso (WMF) 2013-08-15 20:26:25 UTC
And this bug is about the beta cluster :)
Comment 5 Sam Reed (reedy) 2013-08-15 20:27:34 UTC
(In reply to comment #4)
> And this bug is about the beta cluster :)

I was more meaning there should be incinga config you can steal/hack/copy and paste or whatever ;)
Comment 6 Antoine "hashar" Musso (WMF) 2014-10-24 14:48:32 UTC
Resetting severity. If it was really critical it would have been fixed long ago.

Yuvi Panda is working on integrating Shinken for labs, a drop in replacement for Nagios/Icinga.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links