Last modified: 2014-03-31 22:19:08 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T64862, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 62862 - scap not reporting rsync failures
scap not reporting rsync failures
Status: RESOLVED FIXED
Product: Wikimedia
Classification: Unclassified
Deployment systems (Other open bugs)
wmf-deployment
All All
: High critical (vote)
: ---
Assigned To: Bryan Davis
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-03-20 03:26 UTC by Bryan Davis
Modified: 2014-03-31 22:19 UTC (History)
4 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Bryan Davis 2014-03-20 03:26:24 UTC
During the 1.23wmf18 deploy and again today scap failed to update servers in row D of eqiad. Neither time did the scap UI report the failures. Running sync-common manually on one of the failing hosts reveals that it is correctly reporting the error:

bd808@mw1202:~$ sync-common mw1010.eqiad.wmnet mw1070.eqiad.wmnet
00:20:39 DEBUG    - Copying to mw1202.eqiad.wmnet from mw1010.eqiad.wmnet
00:20:39 DEBUG    - Started rsync common
@ERROR: access denied to common from mw1202.eqiad.wmnet (10.64.48.34)
rsync error: error starting client-server protocol (code 5) at main.c(1534) [Rec
eiver=3.0.9]
00:20:39 INFO     - Finished rsync common (duration: 00m 00s)
00:20:39 DEBUG    - Unhandled error:
Traceback (most recent call last):
  File "/srv/scap/scap/cli.py", line 201, in run
    exit_status = app.main(extra_args)
  File "/srv/scap/scap/main.py", line 70, in main
    tasks.sync_common(self.config, self.arguments.servers)
  File "/srv/scap/scap/tasks.py", line 167, in sync_common
    subprocess.check_call(rsync)
  File "/usr/lib/python2.7/subprocess.py", line 511, in check_call
    raise CalledProcessError(retcode, cmd)
CalledProcessError: Command '('sudo', '-u', 'mwdeploy', '/usr/bin/rsync', '-a',
'--delete-delay', '--delay-updates', '--compress', '--delete', '--exclude=**/.sv
n/lock', '--exclude=**/.git/objects', '--exclude=**/.git/**/objects', '--exclude
=**/cache/l10n/*.cdb', '--no-perms', 'mw1010.eqiad.wmnet::common', '/usr/local/a
pache/common-local')' returned non-zero exit status 5
00:20:39 ERROR    - sync-common failed: <CalledProcessError> Command '('sudo', '
-u', 'mwdeploy', '/usr/bin/rsync', '-a', '--delete-delay', '--delay-updates', '-
-compress', '--delete', '--exclude=**/.svn/lock', '--exclude=**/.git/objects', '
--exclude=**/.git/**/objects', '--exclude=**/cache/l10n/*.cdb', '--no-perms', 'm
w1010.eqiad.wmnet::common', '/usr/local/apache/common-local')' returned non-zero
 exit status 5
Comment 1 Gerrit Notification Bot 2014-03-28 03:12:56 UTC
Change 121571 had a related patch set uploaded by BryanDavis:
Return exit_status from Application._before_exit

https://gerrit.wikimedia.org/r/121571
Comment 2 Gerrit Notification Bot 2014-03-28 18:43:42 UTC
Change 121571 merged by jenkins-bot:
Return exit_status from Application._before_exit

https://gerrit.wikimedia.org/r/121571

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links