Last modified: 2013-05-11 10:09:39 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T33855, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 31855 - Capitalized Domain Names are not recognized from some Systems/Providers
Capitalized Domain Names are not recognized from some Systems/Providers
Status: REOPENED
Product: Wikimedia
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Normal normal (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2011-10-21 09:59 UTC by bugzilla_wikimedia
Modified: 2013-05-11 10:09 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments
Screenshot on my Nexus1 showing bogus page loading http://DE.M.WIKIPEDIA.ORG/ (123.31 KB, image/png)
2011-10-24 17:51 UTC, Brion Vibber
Details
Screenshot on my Nexus1 showing ok page loading http://de.m.wikipedia.org/ (153.36 KB, image/png)
2011-10-24 17:52 UTC, Brion Vibber
Details
Screenshot on my Nexus1 showing intermittent "user-agent" error loading http://de.m.wikipedia.org/ (28.93 KB, image/png)
2011-10-24 17:54 UTC, Brion Vibber
Details

Description bugzilla_wikimedia 2011-10-21 09:59:59 UTC
this is relevant for some QR-Code Applications where capital letters give an advantage of possible link length like this one: 
http://de.wikipedia.org/wiki/QR_Code#Design-QR-Code

the code links to 
HTTP://DE.M.WIKIPEDIA.ORG


this is sometimes answered with:
* "Scripts should use an informative User-Agent string with contact information, or they may be IP-blocked without notice. "

* With a Wikimedia file not found 

* With the expected webpage like decapitalized link

it seems to depend on either the user-system (e.g. Android) or the user-network.

Could it have to do with the server(s) configuration (rewrite-rules?)
Comment 1 Mark A. Hershberger 2011-10-24 16:12:23 UTC
(In reply to comment #0)
> * "Scripts should use an informative User-Agent string with contact
> information, or they may be IP-blocked without notice. "

This is done because there is no UA provided.  See Bug 28884 and http://meta.wikimedia.org/wiki/User-Agent_policy

> * With a Wikimedia file not found

What is the URL that gives this result?

> * With the expected webpage like decapitalized link

When do you see this?

It sounds like you're getting different results depending on the platform you're using (e.g. Android, etc).  In that case, this is probably a problem with your application.
Comment 2 bugzilla_wikimedia 2011-10-24 17:43:32 UTC
could it be that some systems decapitalize the URL before sending it and it therefore seems as if it was working?

HTTP://DE.M.WIKIPEDIA.ORG

does not work with standard Android browser...  iPhone, Win32 Firefox and Explorer all work fine
Comment 3 Brion Vibber 2011-10-24 17:46:31 UTC
This definitely doesn't look INVALID to me -- reopening.

I see a cached result "Scripts should use an informative User-Agent string with contact information, or they may be IP-blocked without notice." when following the link above -- even if I manually rekey it to http://de.m.wikipedia.org/

This indicates that there's a cache-poisoning problem; something or several things along the pipeline are failing to properly validate things?
Comment 4 Brion Vibber 2011-10-24 17:51:45 UTC
Created attachment 9276 [details]
Screenshot on my Nexus1 showing bogus page loading http://DE.M.WIKIPEDIA.ORG/
Comment 5 Brion Vibber 2011-10-24 17:52:12 UTC
Created attachment 9277 [details]
Screenshot on my Nexus1 showing ok page loading http://de.m.wikipedia.org/
Comment 6 Brion Vibber 2011-10-24 17:54:59 UTC
Created attachment 9278 [details]
Screenshot on my Nexus1 showing intermittent "user-agent" error loading http://de.m.wikipedia.org/

While the "no such wiki" page shows up regularly when loading the uppercase form in the Android browser, when loading the regular lowercase form both there and on the desktop I intermittently see this error:

"Scripts should use an informative User-Agent string with contact information, or they may be IP-blocked without notice."

Seems to show up after getting a capital bogus hit, but not always, so not sure if it's from the same thing doing partial cache-poisoning or if it's another problem.
Comment 7 Patrick Reilly 2011-10-24 18:19:03 UTC
Response:
Scripts should use an informative User-Agent string with contact information, or they may be IP-blocked without notice.

Request URL: http://de.m.wikipedia.org/
Request Method: GET
Status Code: 403 Forbidden

Request Headers:
GET / HTTP/1.1
Host: de.m.wikipedia.org
Connection: keep-alive
Cache-Control: max-age=0
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_7_2) AppleWebKit/535.1 (KHTML, like Gecko) Chrome/14.0.835.202 Safari/535.1
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Encoding: gzip,deflate,sdch
Accept-Language: en-US,en;q=0.8
Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.3

Response Headers:
HTTP/1.1 403 Forbidden
Server: Apache
X-Powered-By: PHP/5.3.2-2wm1
X-Content-Type-Options: nosniff
Content-Encoding: gzip
Vary: Accept-Encoding
X-Vary-Options: Accept-Encoding;list-contains=gzip
Content-Type: text/html
X-Varnish: 1032695801 1032692292
Via: 1.1 varnish
X-Cache: hit (1)
Cache-Control: private, s-maxage=0, max-age=0, must-revalidate
X-Device: android
Content-Length: 117
Accept-Ranges: bytes
Date: Mon, 24 Oct 2011 18:14:18 GMT
X-Varnish: 747206232 747190291
Age: 42
Via: 1.1 varnish
Connection: keep-alive
X-Cache-frontend: hit (21)

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links