Last modified: 2014-07-05 09:52:41 UTC
In the results of a "sitematrix" API request at both Meta and English Wikipedia: <http://meta.wikimedia.org/w/api.php?action=sitematrix> <http://en.wikipedia.org/w/api.php?action=sitematrix> (and probably others, but that's all I checked), the "localname" field for the language with code "zh-min-nan" is identical to the "name" field (i.e., it's in Min Nan instead of English). I have no idea where this needs to be fixed or what it affects other than the "sitematrix" results. (I assumed since Meta and English Wikipedia results were the same that this should be submitted here as opposed to on the individual wikis [e.g., Village Pump].)
It looks like these get loaded via the cldr extension's LanguageNames::getNames() ... the default on that function is to return native-language names for anything that's not available in the language you ask for. Possibly should use FALLBACK_NORMAL instead of FALLBACK_NATIVE if it's explicitly wanting to be different from the native name.
CLDR doesn't support zh-min-nan for now. LocalNamesEn.php in cldr extension can be used to provide English names if necessary.
After I posted my comment I noticed that there are actually a great many languages for which the "localname" is the same as the "name". A full list appears below, in case someone needs it (and doesn't feel like generating it themselves). Some of these are actually "legitimate" duplications, in that the name in English really is the same as in the native language (e.g., af = Afrikaans). Most of the rest seem to be missing a proper name in English (e.g., bxr = "Буряад"), wherever that should be specified. But some actually seem to be missing a proper _native_ language name (although I haven't done any research to reveal exactly which ones those would be). Presumably, based on the above comments, anyone who can actually fix the problem(s) would be able tell what information is missing where. In the list below I've given my suggestions [in brackets] for what the English versions should be when they differ from the native names, based on my own (somewhat old) records. I haven't "verified" the native names, although Wikipedia articles could obviously be used to do this. (As you may know, the naming of languages can be a contentious topic...) af = Afrikaans ak = Akan als = Alemannisch ["Alemannic"] bcl = Bikol Central ["Central Bicolano"] be-x-old = Беларуская (тарашкевіца) ["Belarusian (Tarashkevitsa)"] bi = Bislama bjn = Bahasa Banjar ["Banjar"] bxr = Буряад ["Buryat (Russia)"] cbk-zam = Chavacano de Zamboanga ["Zamboanga Chavacano"] ceb = Cebuano cho = Choctaw diq = Zazaki eo = Esperanto fiu-vro = Võro gag = Gagauz hif = Fiji Hindi ho = Hiri Motu ia = Interlingua ie = Interlingue ig = Igbo io = Ido jbo = Lojban kg = Kongo kl = Kalaallisut ["Greenlandic"] kr = Kanuri lad = Ladino lbe = Лакку ["Lak"] lij = Líguru ["Ligurian"] lmo = Lumbaart ["Lombard"] map-bms = Basa Banyumasan ["Banyumasan"] mg = Malagasy nds-nl = Nedersaksisch ["Dutch Low Saxon"] nov = Novial nrm = Nouormand ["Norman"] oc = Occitan pag = Pangasinan pcd = Picard pdc = Deitsch ["Pennsylvania German"] pfl = Pälzisch ["Palatinate German"] pih = Norfuk / Pitkern ["Norfolk"] rmy = Romani roa-tara = Tarandíne ["Tarantino"] rw = Kinyarwanda sco = Scots stq = Seeltersk ["Saterland Frisian"] tl = Tagalog tpi = Tok Pisin tw = Twi vec = Vèneto ["Venetian"] vls = West-Vlams ["West Flemish"] vo = Volapük wo = Wolof zh-min-nan = Bân-lâm-gú ["Min Nan"]
BTW, in the light of what I just posted, maybe someone who understands this problem better than I could suggest a more appropriate title for this bug?
It's not really a bug that language name coverage is not perfect. I think people are not used to that not everything is in English first. Anyway Robin has proposed a fix for this bug on wikitech.
(In reply to Niklas Laxström from comment #5) > Anyway Robin > has proposed a fix for this bug on wikitech. http://markmail.org/thread/oiltb5urgwcxqdl5 / https://translatewiki.net/wiki/User:SPQRobin/languages I suppose. Most of the language names above have been added to CLDR and can be translated there, see https://translatewiki.net/wiki/Thread:Support/Language_names_not_in_Unicode_CLDR zh-min-nan is not one of those 100... but you can ask it (and any other) to be added with a request similar to http://unicode.org/cldr/trac/ticket/6763, i.e. «Add language $CODE to the "Locale Display Names - Languages"». http://unicode.org/cldr/trac/newticket