Last modified: 2013-01-30 12:38:16 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T35643, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 33643 - Turkish needs lc / uc methods


Summary:	Turkish needs lc / uc methods

Status:	NEW

Product:	MediaWiki
Classification:	Unclassified
Component:	Internationalization (Other open bugs)
Version:	1.20.x
Hardware:	All All

Importance:	Low enhancement with 1 vote (vote)
Target Milestone:	---
Assigned To:	Nobody - You can work on this!

URL:
Whiteboard:
Keywords:

Duplicates:	32707 33299 40012 (view as bug list)
Depends on:
Blocks:
	Show dependency tree / graph

Reported:	2012-01-10 20:23 UTC by Antoine "hashar" Musso (WMF)
Modified:	2013-01-30 12:38 UTC (History)
CC List:	5 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Antoine "hashar" Musso (WMF) 2012-01-10 20:23:02 UTC

Split from bug 31490.

Our Turkish language class lacks proper implementation of lc() and uc() for that language. It uses a dotted i and a dotless i, which mean that I and i are actually different letter in that language!

An implementation was deployed on wmf wiki for 1.18 but it was reverted by r99289 & r99290. The reason is that it broke magic words and related parser functions (i.e. {{#lcfirst}}) on the turkish wikis.

The MediaWiki code handling magic words normalize then to lower case using the content language (look for lc() calls in MagicWord class). Hence a magic word such as lcfirst is treated just like a turkish word and end up upper cased with a dotted i.

Two possibilities:
- magic words could optionally be made an array referencing the language. Then we could use that language thus using the proper lc / uc implementations
- for turkish language, forge magic words aliases having dotted or dot less i. i.e. 'ucfirst' (with dot) could have an alias UCFIRST (without dot). Both would then be valid.

Optionally, parser functions could use a parameter to change the language being used. This would let Turkish project to use the English lc / uc function, for example to upper case iPhone to IPhone (dotless i).

Comment 1 Antoine "hashar" Musso (WMF) 2012-01-10 20:25:04 UTC

*** Bug 33299 has been marked as a duplicate of this bug. ***

Comment 2 Antoine "hashar" Musso (WMF) 2012-01-10 20:25:44 UTC

*** Bug 32707 has been marked as a duplicate of this bug. ***

Comment 3 Alex Monk 2012-09-05 17:35:03 UTC

*** Bug 40012 has been marked as a duplicate of this bug. ***

Comment 4 Antoine "hashar" Musso (WMF) 2012-09-07 13:34:56 UTC

This is still an ongoing issue though I am not working in it myself.

Comment 5 Vito Genovese 2012-09-07 13:40:49 UTC

The issue about the system messages at TR projects is quite annoying (see Bug 40012), so it'd be fantastic if it could be worked out.

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links