Last modified: 2014-03-21 17:46:49 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T64922, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 62922 - Doubled zero tags in varnish logs
Doubled zero tags in varnish logs
Status: RESOLVED FIXED
Product: Analytics
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Unprioritized normal
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-03-21 11:31 UTC by christian
Modified: 2014-03-21 17:46 UTC (History)
4 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description christian 2014-03-21 11:31:55 UTC
Beginning with the 2014-03-21 files, zero tags may come doubled, like
  zero=250-99;zero=250-99
instead of
  zero=250-99
. (I could not find tags with differing MCC MNCs) At least
  /a/squid/archive/zero/zero.tsv.log-20140321.gz
  /a/squid/archive/sampl mobile/mobile-sampled-100.tsv.log-20140321.gz
  /a/squid/archive/sampled/sampled-1000.tsv.log-20140321.gz
  /a/log/webrequest/zero/zero.tsv.log-20140321.gz
  /a/log/webrequest/mobile/mobile-sampled-100.tsv.log-20140321.gz
  Raw data in Hadoop
  Hive's webrequest table
are affected.

Since the first occurrence was on 2014-03-21T00:15:41, it might be that
  https://gerrit.wikimedia.org/r/#/c/119795/
is relevant (which mangles zero tags and got merged around that time).
Comment 1 Yuri Astrakhan 2014-03-21 16:33:21 UTC
Patch in gerrit: https://gerrit.wikimedia.org/r/#/c/120010/
Comment 2 Yuri Astrakhan 2014-03-21 16:52:14 UTC
Patch was merged, please close the bug if duplicates disappear. Is there an easy way to clean up the logs / hadoop?
Comment 3 christian 2014-03-21 17:46:49 UTC
I checked on live udp2log stream and no more double zero tags after the
above fix have been merged.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links