Last modified: 2014-02-27 13:59:39 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T63994, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 61994 - Babel AutoCreate should check for duplicate categories
Babel AutoCreate should check for duplicate categories
Status: NEW
Product: MediaWiki extensions
Classification: Unclassified
Babel (Other open bugs)
unspecified
All All
: Normal normal (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-02-27 11:44 UTC by Mr. Stradivarius
Modified: 2014-02-27 13:59 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Mr. Stradivarius 2014-02-27 11:44:20 UTC
On enwiki, [[User:Babel AutoCreate]] has been creating several duplicate categories. The categories differ only in capitalisation, for example:

https://en.wikipedia.org/wiki/Category:User_En

which is a duplicate of

https://en.wikipedia.org/wiki/Category:User_en

This seems to depend on what capitalisation users use in their #babel invocations. (More on this specific point at bug 61993.)

I have blocked the Babel AutoCreate account on enwiki because of this issue, but if there is a way round it I would be happy to unblock.

As well as the fix I suggested in bug 61993, I think Babel should check for possible duplicate categories at different capitalisations, and avoid automatically creating categories for which it finds a match.

Let's say a user uses a Babel invocation of {{#babel: Xyz}}. Before creating the category "User Xyz", Babel should check for the existing categories "User xyz" and "User XYZ". Checking things like "xYz" probably wouldn't be necessary. However, it would be worth checking regional variations like "xyz-ab" versus "xyz-AB". This has been an issue with enwiki's [[Category:User en-gb]] and [[Category:User en-GB]].

If duplication checking is implemented, I would suggest getting Babel AutoCreate to log possible duplicates to its user page or a user subpage so that they can be checked by a human and created manually if necessary.

This bug might be fixed by fixing bug 61993, but the code would be more robust if there was an explicit check for duplicate categories as well. An explicit check would avoid the same problem happening in the future if category code capitalisation was made configurable, for example.
Comment 1 Technical 13 2014-02-27 13:11:58 UTC
I think it should still create the alternate case duplicate category, but I think it should know that it is an alternate duplicate case category and create it as a category redirect to the proper case.  Doing this <s>will</s>should allow the other existing bot to re-categorize those mis-cased instances to the proper category.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links