Last modified: 2014-09-23 22:56:48 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T53434, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 51434 - Setup an icinga instance to monitor tools on tool-labs
Setup an icinga instance to monitor tools on tool-labs
Status: NEW
Product: Wikimedia Labs
Classification: Unclassified
tools (Other open bugs)
unspecified
All All
: High normal
: ---
Assigned To: Yuvi Panda
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-07-16 11:30 UTC by Yuvi Panda
Modified: 2014-09-23 22:56 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Yuvi Panda 2013-07-16 11:30:31 UTC
Automated monitoring + alerts for tool users would be awesome, and will probably increase reliability, etc of toollabs a fair bit. No need to play 'is this tool up or not?!' guessing game.
Comment 1 Yuvi Panda 2013-07-16 11:31:08 UTC
This should be a separate setup than what we have for production and also for critical infrastructure on toollabs (such as the mysql or apache deamons).
Comment 2 Sumana Harihareswara 2013-09-28 00:15:40 UTC
I am willing to be told I'm wrong - but I think this is a pretty important step in improving our own reliability, and in providing high-reassurance support to our users.
Comment 3 Tim Landscheidt 2014-01-19 21:11:19 UTC
(In reply to comment #2)
> I am willing to be told I'm wrong - but I think this is a pretty important
> step
> in improving our own reliability, and in providing high-reassurance support
> to
> our users.

IIRC the scope of this bug is Icinga for users' tools; for Tools's reliability in general we have icinga.wmflabs.org with (currently) various shortcomings (if it is running at all) that should be addressed in a different bug.  For the latter, I remember hashar being interested in using it more for beta as well.
Comment 4 scott.leea 2014-07-09 16:21:05 UTC
Is this something I can work on?
Comment 5 Marc A. Pelletier 2014-07-09 17:03:17 UTC
Not just yet; we're currently at the stage where we are setting equipment aside for the task and doing our first round of specifications.  I expect we'll spend some time at the Hackaton in London working on this; if you're around then you'd be welcome to join us.

Otherwise, as we return, we'll probably have something worth hacking on.
Comment 6 Marc A. Pelletier 2014-08-27 17:18:53 UTC
Handing off to Yuvi, who is the gatekeeper of labmon1001
Comment 7 Sumana Harihareswara 2014-09-23 22:56:48 UTC
Good luck, Yuvi!

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links