Last modified: 2013-11-13 09:44:53 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T38994, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 36994 - [OPS] Add disk I/O to ganglia reports
[OPS] Add disk I/O to ganglia reports
Status: RESOLVED FIXED
Product: Wikimedia Labs
Classification: Unclassified
Infrastructure (Other open bugs)
unspecified
All All
: Normal enhancement
: ---
Assigned To: Antoine "hashar" Musso (WMF)
: ops
Depends on:
Blocks: 55406 41967 54787
  Show dependency treegraph
 
Reported: 2012-05-21 09:45 UTC by Antoine "hashar" Musso (WMF)
Modified: 2013-11-13 09:44 UTC (History)
10 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Antoine "hashar" Musso (WMF) 2012-05-21 09:45:16 UTC
Ganglia on wmflabs is missing disk I/O reporting. The reason we want them, is to be able to tell which instance is doing heavy I/O activities which might be kill GlusterFS (see bug 36993).


There is a Gmetric plugin which we might want to use. Based on /proc/diskstats 

https://github.com/ganglia/gmetric/blob/master/disk/diskio.pl/ganglia_disk_stats.pl

We used to have a homegrown `ganglia-metrics` debian package in /trunk/ganglia_metrics, it is probably obsolete nowadays.  Anyway, there was a python script there:

http://svn.wikimedia.org/viewvc/mediawiki/trunk/ganglia_metrics/DiskStats.py?view=markup&pathrev=69278


OR, maybe Ganglia already provides the metrics and it is all about enabling them?
Comment 1 Andre Klapper 2013-03-14 19:04:58 UTC
Ryan: Do you plan to work on this (as you're set as assignee)?
Comment 2 Ryan Lane 2013-03-14 21:52:35 UTC
I'm the default assignee. I added this for anyone to work on.
Comment 3 Gerrit Notification Bot 2013-09-23 13:40:54 UTC
Change 85669 had a related patch set uploaded by Hashar:
ganglia wrapper for py plugins (and add diskstat plugin)

https://gerrit.wikimedia.org/r/85669
Comment 4 Antoine "hashar" Musso (WMF) 2013-09-23 13:51:49 UTC
I wrote a puppet patch which is now pending review/merge by ops.
Comment 5 Gerrit Notification Bot 2013-10-23 08:26:40 UTC
Change 91351 had a related patch set uploaded by Hashar:
ganglia: diskstat.py plugin

https://gerrit.wikimedia.org/r/91351
Comment 6 Gerrit Notification Bot 2013-10-23 08:26:45 UTC
Change 91352 had a related patch set uploaded by Hashar:
contint: monitor CI server diskstats in Ganglia

https://gerrit.wikimedia.org/r/91352
Comment 7 Gerrit Notification Bot 2013-10-23 08:37:34 UTC
Change 91351 merged by Ori.livneh:
ganglia: diskstat.py plugin

https://gerrit.wikimedia.org/r/91351
Comment 8 Gerrit Notification Bot 2013-10-23 08:59:56 UTC
Change 91352 had a related patch set uploaded by Ori.livneh:
contint: monitor CI server diskstats in Ganglia

https://gerrit.wikimedia.org/r/91352
Comment 9 Gerrit Notification Bot 2013-10-23 09:00:55 UTC
Change 91352 merged by Ori.livneh:
contint: monitor CI server diskstats in Ganglia

https://gerrit.wikimedia.org/r/91352
Comment 10 Antoine "hashar" Musso (WMF) 2013-11-13 09:44:53 UTC
We got disk stats on the production continuous integration server (gallium and lanthanum). That was the purpose of this bug and it got solved by the changes above.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links