Last modified: 2013-05-05 11:38:53 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T31799, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 29799 - Some pages on enwikipedia didn't have their collation updated to the 1.17 form.
Some pages on enwikipedia didn't have their collation updated to the 1.17 form.
Status: RESOLVED FIXED
Product: MediaWiki
Classification: Unclassified
Categories (Other open bugs)
unspecified
All All
: Normal normal (vote)
: ---
Assigned To: Nobody - You can work on this!
http://en.wikipedia.org/wiki/Wikipedi...
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2011-07-10 17:23 UTC by Svick
Modified: 2013-05-05 11:38 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Svick 2011-07-10 17:23:18 UTC
Some pages like [[en:Sticks and Bones (film)]] and [[en:Bill Gates]] are incorrectly sorted in their categories. See for example http://en.wikipedia.org/w/index.php?title=Category:American_films&pagefrom=Szz which incorrectly shows Sticks and Bones (film). This seems to caused by incorrect cl_sortkey in the categorylinks table: the contents of the column are not all uppercase for those articles. Also, cl_collation is an empty string for those pages.

My assumption is that during the change to the current case-insensitive sorting, some pages weren't updated for some reason.

Doing purge or null edit of the affected pages doesn't seem to fix the issue.
Comment 1 Bawolff (Brian Wolff) 2011-07-11 16:49:34 UTC
This should have been fixed when we ran the maintainence script way back. I guess the script (updateCollation.php) needs to be run again? (It should automatically only do things needing updating, so it should be fast if its only a small minority that are old).

As an aside, the only way to fix this in the editing interface is to remove cat, save, re-add cat. Purging won't do anything.


otoh, the cl_timestamp is 2011-07-07 06:18:18 which is quite recent, so maybe there's a larger problem...
Comment 2 Bartosz Dziewoński 2013-05-05 09:10:27 UTC
The issue doesn't seem to be visible at the linked category anymore.
Comment 3 Svick 2013-05-05 11:38:53 UTC
You're right. If I search for categorylinks where cl_collation != 'uppercase' on the toolserver, the only ones are for deleted pages. I think having categorylinks for deleted pages is an issue, but separate from this one.

mysql> select count(*) from categorylinks where cl_collation != 'uppercase';
+----------+
| count(*) |
+----------+
|      279 |
+----------+
1 row in set (0.00 sec)

mysql> select count(*) from categorylinks join page on cl_from = page_id where cl_collation != 'uppercase';
+----------+
| count(*) |
+----------+
|        0 |
+----------+
1 row in set (0.05 sec)

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links