Last modified: 2014-10-21 13:14:53 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T74296, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 72296 - Raw webrequest partitions for 2014-10-20T13/1H not marked successful
Raw webrequest partitions for 2014-10-20T13/1H not marked successful
Status: RESOLVED WONTFIX
Product: Analytics
Classification: Unclassified
Refinery (Other open bugs)
unspecified
All All
: Unprioritized normal
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks: 72298
  Show dependency treegraph
 
Reported: 2014-10-21 09:48 UTC by christian
Modified: 2014-10-21 13:14 UTC (History)
7 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description christian 2014-10-21 09:48:45 UTC
None of the webrequest partitions [1] for 2014-10-20T13/1H have been
been marked successful.

What happened?


[1]
_________________________________________________________________
qchris@stat1002 // jobs: 0 // time: 09:43:10 // exit code: 0
cwd: ~/refinery/hive/webrequest
~/cluster-scripts/dump_webrequest_status.sh 
  +------------------+--------+--------+--------+--------+
  | Date             |  bits  | mobile |  text  | upload |
  +------------------+--------+--------+--------+--------+
[...]
  | 2014-10-20T11/1H |    .   |    .   |    .   |    .   |    
  | 2014-10-20T12/1H |    .   |    .   |    .   |    .   |    
  | 2014-10-20T13/1H |    X   |    X   |    X   |    X   |    
  | 2014-10-20T14/1H |    .   |    .   |    .   |    .   |    
  | 2014-10-20T15/1H |    .   |    .   |    .   |    .   |    
[...]
  +------------------+--------+--------+--------+--------+


Statuses:

  . --> Partition is ok
  M --> Partition manually marked ok
  X --> Partition is not ok (duplicates, missing, or nulls)
Comment 1 christian 2014-10-21 10:08:41 UTC
The affected period is 13:07:11--2014-10-20T13:25:38.
It affected only ulsfo caches, but all ulsfo caches.

The affected period shows round 2M duplicates, which are worth
* 79 seconds of ulsfo data, or
* 15 seconds of total data.

The affected period shows round 27M missing lines, which are worth
* 16 minutes of ulsfo data, or
*  3 minutes of total data.

Ops reported [1] that at 13:07 network issues between ulsfo and eqiad
started. This aligns and explains the issues that we're seeing.


[1] https://lists.wikimedia.org/mailman/private/ops/2014-October/042274.html

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links