Last modified: 2014-08-27 17:26:50 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T52053, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 50053 - jstart doesn't signal out-of-memory kills to the user
jstart doesn't signal out-of-memory kills to the user
Status: RESOLVED WONTFIX
Product: Wikimedia Labs
Classification: Unclassified
tools (Other open bugs)
unspecified
All All
: Normal normal
: ---
Assigned To: Marc A. Pelletier
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-06-23 14:39 UTC by Tim Landscheidt
Modified: 2014-08-27 17:26 UTC (History)
1 user (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Tim Landscheidt 2013-06-23 14:39:45 UTC
When the client script is killed with -9, the wrapper script terminates as well.

While this is probably a prudent choice, the user isn't informed about this at all.

As a minimal courtesy, we should add "-m ae" to the qsub call, so that the user gets at least a mail that he probably doesn't understand :-).  Of course, even better would be to use a SGE hook that fires after a job terminates.
Comment 1 Marc A. Pelletier 2014-08-26 17:56:40 UTC
Users may request -m from jsub/jstart which are passed to qsub and behave as expected.  I'd rather not increase the default amount of cron/gridengin spam.
Comment 2 Tim Landscheidt 2014-08-27 02:01:23 UTC
My suggestion would send one (1) message if a job terminates that the user expects to be running continuously.
Comment 3 Marc A. Pelletier 2014-08-27 15:20:03 UTC
Wouldn't the default "Hey, I had to restart your job" from bigbrother fill that function?
Comment 4 Tim Landscheidt 2014-08-27 17:26:50 UTC
That requires the job being managed by bigbrother.  I just wanted to point out that IMHO one message per interaction is not spam, especially if it conveys important information for the recipient.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links