Last modified: 2014-03-07 11:44:09 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T34029, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 32029 - Some vowel combination in Sinhala Wijesekara need to be corrected
Some vowel combination in Sinhala Wijesekara need to be corrected
Status: NEW
Product: MediaWiki extensions
Classification: Unclassified
UniversalLanguageSelector (Other open bugs)
unspecified
All All
: Low normal (vote)
: ---
Assigned To: Nobody - You can work on this!
: i18n
Depends on:
Blocks: 53014
  Show dependency treegraph
 
Reported: 2011-10-29 08:07 UTC by Junaid
Modified: 2014-03-07 11:44 UTC (History)
10 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Junaid 2011-10-29 08:07:11 UTC
Some combination of letteres with vowel are incorrect in  Sinhala Wijesekara scheme.

(These are mailed by Sinhala wikipedian)
Currently:
'sf' produce 'ෙජ', it should be 'ජෙ'
'fc' produce 'ෙජ', it should be 'ජෙ'
'fi' produce 'ෙස', it should be 'සෙ'
Comment 1 බිඟුවා 2011-10-29 08:14:07 UTC
(In reply to comment #0)
> Some combination of letteres with vowel are incorrect in  Sinhala Wijesekara
> scheme.
> 
> (These are mailed by Sinhala wikipedian)
> Currently:
> 'sf' produce 'ෙජ', it should be 'ජෙ'
> 'fc' produce 'ෙජ', it should be 'ජෙ'
> 'fi' produce 'ෙස', it should be 'සෙ'

This is not correct jv,
'sf' produce 'ෙජ', it should be 'ජෙ'

f=ෙ is correct.
but when it join with other letters like this

'fc' produce 'ෙජ', it should be 'ජෙ'
'fi' produce 'ෙස', it should be 'සෙ'
'fl' produce 'ෙක' it should be 'කෙ'
Comment 2 Thambaru Wijesekara (තඹරු විජේසේකර) 2011-10-29 13:13:58 UTC
Yes; Also letter combination isn't working.
eg: " ;|j " gives "ත|ව", instead of "ත්‍ව"
    " u|u " gives "ම|ම" instead of "ම‍්ම"
    " o|O " gives "ද|ධ" instead of "ද‍්ධ"
    " k\O " gives "න\ධ" instead of "න්‍ධ"
There're more cases on this pattern.
Comment 3 Junaid 2011-10-31 18:37:32 UTC
Wijesekara is a fixed layout so the character should be types as we want.

(In reply to comment #1)
> 'fc' produce 'ෙජ', it should be 'ජෙ'
> 'fi' produce 'ෙස', it should be 'සෙ'
> 'fl' produce 'ෙක' it should be 'කෙ'

I think the above typing is incorrect. We will be adding the vowel sign 'ෙ' to existing consonant(s). Rather than typing 'fc', 'fi', 'fl', we should type 'cf', 'if', 'lf' producing correct combinations 'ජෙ', 'සෙ', 'කෙ'.


(In reply to comment #2)
> Yes; Also letter combination isn't working.
> eg: " ;|j " gives "ත|ව", instead of "ත්‍ව"
>     " u|u " gives "ම|ම" instead of "ම‍්ම"
>     " o|O " gives "ද|ධ" instead of "ද‍්ධ"
>     " k\O " gives "න\ධ" instead of "න්‍ධ"
> There're more cases on this pattern.

From existing link, http://www.ucsc.lk/ltrl/services/layout/, I have no guide to implement this patterns. Are there any complete implementation of this layout as a help?
Comment 4 Santhosh Thottingal 2011-11-09 05:24:50 UTC
(In reply to comment #3)
> From existing link, http://www.ucsc.lk/ltrl/services/layout/, I have no guide
> to implement this patterns. Are there any complete implementation of this
> layout as a help?

GNU/Linux Distros ship this layout for Sinhala. We can refer its behavior or read the source code(package  m17n-contrib, File: /usr/share/m17n/si-wijesekera.mim)
Comment 5 Siebrand Mazeland 2012-01-16 15:36:46 UTC
What remains to be done to resolve this issue?
Comment 6 Siebrand Mazeland 2012-02-01 11:38:22 UTC
Changed priority and type.
Comment 7 Amir E. Aharoni 2012-07-12 11:53:48 UTC
I am starting to review this layout and to add tests for it. First commit: https://gerrit.wikimedia.org/r/15531 . If I find that the current implementation is correct according to the documents that describe the layout, I'll close the bug, and if not, then I'll fix them.
Comment 8 Santhosh Thottingal 2012-07-15 00:02:55 UTC
I just checked the m17n version of this input tool. 

> 'sf' produce 'ෙජ', it should be 'ජෙ'
> 'fc' produce 'ෙජ', it should be 'ජෙ'
> 'fi' produce 'ෙස', it should be 'සෙ'

All these are valid. This input method is smart to apply the vowel signs even it is written to the left or right. That means. both cf and fc produce ජෙ . or in otherwords, one can type in unicode order or visual order. Now this is something very complex and non standard. But it is implemented in m17n with lot of code. I doubt Narayam is capable of doing this, but we need to try with some regex to start with.
Comment 9 Amir E. Aharoni 2012-07-18 00:20:38 UTC
I'll use these references to check the correctness of the implementation:

* http://www.icta.lk/attachments/658_658_SLS%201134.pdf
* http://cvs.savannah.gnu.org/viewvc/m17n-db/MIM/si-wijesekera.mim?root=m17n&view=markup
Comment 10 Amir E. Aharoni 2012-07-19 04:39:30 UTC
I'm reading the documents and I think that I understand the problem now: It is supposed to be a visual layout, where typing order is not necessarily the same as the logical order. I'll write tests and implementation accordingly.
Comment 11 Amir E. Aharoni 2012-07-19 12:56:56 UTC
Partial patch submitted here:
https://gerrit.wikimedia.org/r/#/c/16019/

It only fixes the typing of consonants with inherent vowel, pure consonants and vowel letters. Other types of characters will be done later.
Comment 12 Junaid 2012-07-28 04:39:29 UTC
This can be better handled by callback(s): bug 35457
Comment 13 db [inactive,noenotif] 2012-12-01 10:48:53 UTC
(In reply to comment #11)
> https://gerrit.wikimedia.org/r/#/c/16019/
Status Merged

Merged Gerrit change #16254 links here

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links