Last modified: 2013-07-12 07:43:10 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T53068, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 51068 - ganglia.wmflabs.org is missing most projects
ganglia.wmflabs.org is missing most projects
Status: VERIFIED FIXED
Product: Wikimedia Labs
Classification: Unclassified
Infrastructure (Other open bugs)
unspecified
All All
: Unprioritized normal
: ---
Assigned To: Ryan Lane
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-07-09 20:43 UTC by Antoine "hashar" Musso (WMF)
Modified: 2013-07-12 07:43 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Antoine "hashar" Musso (WMF) 2013-07-09 20:43:26 UTC
http://ganglia.wmflabs.org/ used to list all the projects. It is now down to 17 and miss my favorite project 'deployment-prep'.

It would be nice to have it fixed up to list all the labs project.
Comment 1 Ryan Lane 2013-07-09 20:51:05 UTC
There's a crontab, run as root:

/usr/local/sbin/generate-ganglia-conf.py

When run manually it regenerates all ganglia configuration files. Restarting /etc/init.d/ganglia-monitor-aggrs re-adds the sources. The cron seems to not be running. More worrying, though, is that ganglia seems to just break itself every once in a while. I'm assuming that puppet is doing so.
Comment 2 Ariel T. Glenn 2013-07-10 06:40:50 UTC
Output from the cron job trying to run on aggregator1:
-----
Rather than invoking init scripts through /etc/init.d, use the service(8)
utility, e.g. service ganglia-monitor restart

Since the script you are attempting to invoke has been converted to an
Upstart job, you may also use the restart(8) utility, e.g. restart ganglia-monitor
/etc/init.d/ganglia-monitor: 73: start: not found
Rather than invoking init scripts through /etc/init.d, use the service(8)
utility, e.g. service ganglia-monitor restart

Since the script you are attempting to invoke has been converted to an
Upstart job, you may also use the restart(8) utility, e.g. restart ganglia-monitor
/etc/init.d/ganglia-monitor: 73: start: not found
-----

In the meantime new members of the project can't get onto the instance because it can't create their home directories (likely it's using the gluster home dir and that's read-only).
Comment 3 Ryan Lane 2013-07-10 17:17:41 UTC
Home directories were set to use nfs, for some reason, and there was an issue with the automount config for lucid. It was fixed by https://gerrit.wikimedia.org/r/#/c/72936/. The homedirs are now accessible.
Comment 4 Ariel T. Glenn 2013-07-10 18:31:50 UTC
Homedirs were set to nfs by me in an attempt to get them to write my homedir to something other than a read-only gluster fs (I guess it's ro so people move off ot it).  We couldn't figure out a good way to test that gerrit patch, thanks for pushing that through.
Comment 5 Chris Johnson 2013-07-11 20:42:45 UTC
We found that Default PATH for cron environment on Debian/Ubuntu does not contain /sbin as a valid path. Therefore we added /sbin as part of the PATH.  Fixed with  https://gerrit.wikimedia.org/r/#/c/73278/
Comment 6 Antoine "hashar" Musso (WMF) 2013-07-12 07:43:10 UTC
Nice! Thank you everyone :-]

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links