Last modified: 2013-11-13 09:44:53 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T38994, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 36994 - [OPS] Add disk I/O to ganglia reports


Summary:	[OPS] Add disk I/O to ganglia reports

Status:	RESOLVED FIXED

Product:	Wikimedia Labs
Classification:	Unclassified
Component:	Infrastructure (Other open bugs)
Version:	unspecified
Hardware:	All All

Importance:	Normal enhancement
Target Milestone:	---
Assigned To:	Antoine "hashar" Musso (WMF)

URL:
Whiteboard:
Keywords:	ops

Depends on:
Blocks:	55406 41967 54787
	Show dependency tree / graph

Reported:	2012-05-21 09:45 UTC by Antoine "hashar" Musso (WMF)
Modified:	2013-11-13 09:44 UTC (History)
CC List:	10 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Antoine "hashar" Musso (WMF) 2012-05-21 09:45:16 UTC

Ganglia on wmflabs is missing disk I/O reporting. The reason we want them, is to be able to tell which instance is doing heavy I/O activities which might be kill GlusterFS (see bug 36993).


There is a Gmetric plugin which we might want to use. Based on /proc/diskstats 

https://github.com/ganglia/gmetric/blob/master/disk/diskio.pl/ganglia_disk_stats.pl

We used to have a homegrown `ganglia-metrics` debian package in /trunk/ganglia_metrics, it is probably obsolete nowadays.  Anyway, there was a python script there:

http://svn.wikimedia.org/viewvc/mediawiki/trunk/ganglia_metrics/DiskStats.py?view=markup&pathrev=69278


OR, maybe Ganglia already provides the metrics and it is all about enabling them?

Comment 1 Andre Klapper 2013-03-14 19:04:58 UTC

Ryan: Do you plan to work on this (as you're set as assignee)?

Comment 2 Ryan Lane 2013-03-14 21:52:35 UTC

I'm the default assignee. I added this for anyone to work on.

Comment 3 Gerrit Notification Bot 2013-09-23 13:40:54 UTC

Change 85669 had a related patch set uploaded by Hashar:
ganglia wrapper for py plugins (and add diskstat plugin)

https://gerrit.wikimedia.org/r/85669

Comment 4 Antoine "hashar" Musso (WMF) 2013-09-23 13:51:49 UTC

I wrote a puppet patch which is now pending review/merge by ops.

Comment 5 Gerrit Notification Bot 2013-10-23 08:26:40 UTC

Change 91351 had a related patch set uploaded by Hashar:
ganglia: diskstat.py plugin

https://gerrit.wikimedia.org/r/91351

Comment 6 Gerrit Notification Bot 2013-10-23 08:26:45 UTC

Change 91352 had a related patch set uploaded by Hashar:
contint: monitor CI server diskstats in Ganglia

https://gerrit.wikimedia.org/r/91352

Comment 7 Gerrit Notification Bot 2013-10-23 08:37:34 UTC

Change 91351 merged by Ori.livneh:
ganglia: diskstat.py plugin

https://gerrit.wikimedia.org/r/91351

Comment 8 Gerrit Notification Bot 2013-10-23 08:59:56 UTC

Change 91352 had a related patch set uploaded by Ori.livneh:
contint: monitor CI server diskstats in Ganglia

https://gerrit.wikimedia.org/r/91352

Comment 9 Gerrit Notification Bot 2013-10-23 09:00:55 UTC

Change 91352 merged by Ori.livneh:
contint: monitor CI server diskstats in Ganglia

https://gerrit.wikimedia.org/r/91352

Comment 10 Antoine "hashar" Musso (WMF) 2013-11-13 09:44:53 UTC

We got disk stats on the production continuous integration server (gallium and lanthanum). That was the purpose of this bug and it got solved by the changes above.

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links