Last modified: 2014-11-12 20:34:21 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T72561, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 70561 - Accents are not ignored by autocompletion in fr.wiktionary
Accents are not ignored by autocompletion in fr.wiktionary
Status: RESOLVED FIXED
Product: MediaWiki extensions
Classification: Unclassified
CirrusSearch (Other open bugs)
unspecified
All All
: Unprioritized normal (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-09-08 15:36 UTC by automatik68
Modified: 2014-11-12 20:34 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Comment 1 Nik Everett 2014-09-08 15:38:04 UTC
Its a reasonably easy thing to turn on but right now its only on in English.  We can turn it on if its the right thing.
Comment 2 Nik Everett 2014-09-08 15:45:34 UTC
Another thing - the behavior if I turn on accent ignoring is wider then just autocomplete - its in search as well.  When that is done perfect accent matching pulls the result higher in search but accent mismatching results still show up.
Comment 4 automatik68 2014-09-08 15:53:14 UTC
ever->already*
Comment 5 Nik Everett 2014-09-08 15:55:08 UTC
Hmmm......  I'll investigate that - since no one has complained about that behavior I imagine its correct or at least ok.  Either way - if prefix search should have it I'll file this bug and see about getting it in there.  Won't be super soon - but I'll get to it.
Comment 6 automatik68 2014-09-08 16:24:40 UTC
I'm not sure what you mean by "no one complained" but I opened this bug after this point: https://fr.wiktionary.org/wiki/Wiktionnaire:Wikid%C3%A9mie/septembre_2014#A_propos_du_moteur_de_recherche
Comment 7 Nik Everett 2014-09-08 17:02:37 UTC
Sorry, I mean search flattening the accents didn't receive any complaints when we turned CirrusSearch on for frwiktionary a few months ago.  At least I don't think anyone did.

Anyway - I'll have a look at turning on accent squashing for frwiktionary soon.
Comment 8 Gerrit Notification Bot 2014-09-17 17:30:14 UTC
Change 160990 had a related patch set uploaded by Manybubbles:
Add asciifolding to some French analyzers

https://gerrit.wikimedia.org/r/160990
Comment 9 Nik Everett 2014-09-17 17:34:26 UTC
I've added a proposal to flatten all accented characters into non-accented ones for prefix search and exact title matches.  It'll require rebuilding the index but that is no big deal.

Note:  I found out where the other normalization comes from.  The French stemmer we use for inexact matches performs the following mappings:
'à', 'á', 'â' -> 'a'
'ô' -> 'o'
'è', 'é', 'ê' -> 'e'
'ù', 'û' -> 'u'
'î' -> 'i'
'ç' -> 'c'

I could, if you believe it is more correct, only perform those mappings for the prefix and exact title matching.
Comment 10 automatik68 2014-09-18 09:19:57 UTC
I can't be sure it's more correct, could you tell me what is done for fr.wikipedia please? Accents are flatened for this site too.
Comment 11 Gerrit Notification Bot 2014-09-18 23:11:15 UTC
Change 160990 merged by jenkins-bot:
Add asciifolding to some French analyzers

https://gerrit.wikimedia.org/r/160990
Comment 12 Andre Klapper 2014-11-12 15:13:22 UTC
All patches mentioned in this report were merged or abandoned - is there more work left to do here (if yes: please reset the bug report status to NEW or ASSIGNED), or can you close this ticket as RESOLVED FIXED?

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links