Last modified: 2014-08-03 23:59:26 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T68833, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 66833 - Stripping of trailing characters from zero tags in varnish logs not effective
Stripping of trailing characters from zero tags in varnish logs not effective
Status: RESOLVED FIXED
Product: Analytics
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Unprioritized normal
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-06-19 11:18 UTC by christian
Modified: 2014-08-03 23:59 UTC (History)
8 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description christian 2014-06-19 11:18:30 UTC

    
Comment 1 christian 2014-06-19 11:34:16 UTC
Although the zero vcl comes with code to strip a trailing character of
a zero tag [1], we're nonetheless seeing tags as

  zero=404-01b

in the zero tsvs, and mobile-sampled-100 tsvs [2] [3], where we'd expect

  zero=404-01

instead. While those requests are very few and harmless, it seems this
code to strip the trailing character is not working as expected.



[1] https://git.wikimedia.org/blob/operations%2Fpuppet.git/a23daab91b67cb7a41b71aa4f77828359dc53170/templates%2Fvarnish%2Fzero.inc.vcl.erb#L312

[2] The volume of requests is extremely low (like 20-100 requests/day),
first occurrence is on 2014-05-22T19:20:30 in the zero tsvs.

The requests go to

  http://{en,pl}.{m,zero}.wikipedia.org/wiki/Main_Page

and are coming from a 10.68.0.0/16 IP, so they are probably on purpose.

[3] Up to 2014-06-19, such requests are not present in the sampled-1000
stream. But that may be caused by the sampling factor and the low volume of
requests. So I expect to see one or the other such request in the
sampled-1000 logs at some time.
Comment 2 christian 2014-06-19 11:40:35 UTC
Sent a heads up to Adam and Yuri, since the bug is filed in bugzilla's
“Analytics” product.
Comment 3 Yuri Astrakhan 2014-08-03 23:07:35 UTC
Just a thought - could this be the result of a test run? Some testing harness faking the headers from non-carrier IPs?
Comment 4 christian 2014-08-03 23:59:26 UTC
(In reply to Yuri Astrakhan from comment #3)
> Just a thought - could this be the result of a test run?

Who knows what people test :-)
At worst times it hit us with ~30K requests / day.
1 “test” every 3 seconds on average is certainly not nice testing :-)
But no one hinders carriers (or Wikipedia Zero users) from doing that.

> Some testing
> harness faking the headers from non-carrier IPs?

We log the X-Analytics of the response (not request).
With the agreements around when you set the X-Analytics on the response,
"faked headers from non-carrier IPs" should not be an issue.

But since the bug got attention again, I looked at current logs, and
seeing the issue gone, I bisected the dates, and it seems those
requests died off on 2014-06-24 around 15:29:30.

Ibe45b47218c633973c8d3bcdb209346944955876 happened ~1 hour before, and fixed
wrong imports. Not sure if the wrong imports back-fired in some way, and fixing
the wrong imports thereby fixed the requests?

Anyways, the requests are gone, and the relevant code in puppet has been
refactored since. Hence, closing the bug.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links