Last modified: 2013-02-27 12:31:02 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T43002, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 41002 - Fails to save when some Unicode characters are used, as MD5 hash changes
Fails to save when some Unicode characters are used, as MD5 hash changes
Status: UNCONFIRMED
Product: MediaWiki
Classification: Unclassified
API (Other open bugs)
unspecified
All All
: Normal normal (vote)
: ---
Assigned To: Nobody - You can work on this!
: i18n
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2012-10-13 13:21 UTC by Marios Magioladitis
Modified: 2013-02-27 12:31 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Marios Magioladitis 2012-10-13 13:21:56 UTC
It seems that API fails to save pages that contain some special (or deprecated characters).

For instance

https://en.wikipedia.org/w/index.php?title=Wikipedia:AutoWikiBrowser/Sandbox&oldid=516371076

I fixed manually the following cases:

https://en.wikipedia.org/w/index.php?title=Dahae&diff=517542432&oldid=512734888

https://en.wikipedia.org/w/index.php?title=Osun_State&diff=517542603&oldid=516042594

When AWB meets these pages we get the following message: "Bot API returned the following error: 'The supplied MD5 hash was incorrect'".

According to Rjwilmsi: "I suspect this is an API issue, a rare one."
Comment 1 Bryan Tong Minh 2012-10-13 14:39:41 UTC
I think this has to do with Unicode normalization. WebRequest pre-normalizes any text params before passing them to the API, which will of course have as effect that the md5 sum is no longer valid.
Comment 2 Jarry1250 2012-10-13 16:07:11 UTC
So presumably AWB needs to do the normalisation itself (so it knows the post-normalisation MD5) or the API needs to accept pre-normalisation hashes (which, given the distribution of functions between WebRequest and the API seems tricky).
Comment 3 Marios Magioladitis 2013-02-27 12:31:02 UTC
http://autowikibrowser.svn.sourceforge.net/viewvc/autowikibrowser?view=revision&revision=8958
Skip page if we encounter badmd5 error from API on save, prevents cycle of attempted saves

http://autowikibrowser.svn.sourceforge.net/viewvc/autowikibrowser?view=revision&revision=8959
Unicodify to not change HTML characters 〈 and 〉 (deprecated bracket characters), if converted API returns badmd5 error

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links