Last modified: 2014-11-03 10:23:25 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T65918, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 63918 - Missing whitespaces between words in article's last line on hy.wp
Missing whitespaces between words in article's last line on hy.wp
Status: NEW
Product: MediaWiki
Classification: Unclassified
Parser (Other open bugs)
1.23.0
All All
: High normal (vote)
: ---
Assigned To: Parsoid Team
: i18n
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-04-14 22:52 UTC by Norayr Chilingarian
Modified: 2014-11-03 10:23 UTC (History)
7 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments
lost whitespaces in the last line. (270.27 KB, image/png)
2014-04-14 22:52 UTC, Norayr Chilingarian
Details

Description Norayr Chilingarian 2014-04-14 22:52:21 UTC
Created attachment 15103 [details]
lost whitespaces in the last line.

Whitespaces between last words of the page disappear sometimes. I have noticed the issue first in my own mediawiki's but today eventually have seen it online on Wikipedia.

This is the article for example: http://hy.wikipedia.org/wiki/%D5%87%D5%A1%D5%BC%D5%AC_%D4%B9%D5%A5%D5%B8%D5%A4%D5%B8%D6%80_%D4%B1%D5%B6%D6%80%D5%AB_%D5%A4%D5%A8_%D4%BF%D5%B8%D5%BD%D5%BF%D5%A5%D6%80

If one moves Categories to the upper part of the text, the problem disappears. I have noticed it since, I guess, 17th version of mediawiki.

Thank you
Comment 1 Andre Klapper 2014-04-15 18:33:05 UTC
Thanks for taking the time to report this!

Confirming that the text should be
Մահացել է 1879 թվականի մայիսի 7-ին, մենակ և կարիքավոր վիճակում։
but the last space is missing, using Firefox 28 on Fedora 20.
Comment 2 Andre Klapper 2014-07-03 21:48:27 UTC
Can still be seen on https://hy.wikipedia.org/wiki/Շառլ_Թեոդոր_Անրի_դը_Կոստեր

   Մահացել է 1879 թվականի մայիսի 7-ին, մենակ և կարիքավոր վիճակում։
can be seen in the wikitext when editing the page.

But the HTML source of the page says:
<p>Մահացել է 1879 թվականի մայիսի 7-ին, մենակ ևկարիքավորվիճակում։</p>

I can also confirm that removing this section fixes the problem:
{{DEFAULTSORT:դը Կոստեր, Շառլ}}
[[Կատեգորիա:Բելգիացի գրողներ]]
[[Կատեգորիա:1827 ծնունդներ]]
[[Կատեգորիա:Օգոստոսի 20 ծնունդներ]]
[[Կատեգորիա:1879 մահեր]]
[[Կատեգորիա:Մայիսի 7 մահեր]]
Comment 3 Andre Klapper 2014-08-05 22:55:22 UTC
Gabriel, James: As this seems to be Parser territory, any idea who could investigate here? See summary and testcase in comment 2.
Comment 4 James Forrester 2014-08-06 14:31:08 UTC
Adding Subbu as parser/Parsoid leader.
Comment 5 Andre Klapper 2014-09-22 09:49:38 UTC
Subbu: Any idea?
Comment 6 ssastry 2014-09-22 15:27:39 UTC
Confirmed that this is PHP parser only. http://parsoid-lb.eqiad.wikimedia.org/hywiki/%D5%87%D5%A1%D5%BC%D5%AC_%D5%A4%D5%A8_%D4%BF%D5%B8%D5%BD%D5%BF%D5%A5%D6%80?oldid=1468063 shows the right output from Parsoid.

This bug smells a bit like a bad regexp because of the presence of some characters.

Scott has traditionally poked around the PHP parser but he is currently occupied with PDF migration. We are currently stretched a bit thin, but we'll reassign to one of us if this is still not addressed in next 3 weeks (we have our quarterly review coming up). But, dont want to prematurely assign it to one of us if someone else can investigate.
Comment 7 Andre Klapper 2014-10-27 13:29:54 UTC
(In reply to ssastry from comment #6)
> we'll reassign to one of us if this is still not addressed in next 3 weeks

4 weeks are over. ssastry?
Comment 8 C. Scott Ananian 2014-10-28 16:26:47 UTC
I can probably take a look at this, but it's not high priority.

Can someone reproduce this on enwiki?  It would make writing a test case much easier if so.

(As a first step, copy-and-pasting the wikitext from hywiki to a sandbox on enwiki and looking at the render would let us know if the bug relates to a hywiki setting (ie, link prefix or link suffix).)
Comment 9 Xelgen 2014-11-03 10:23:25 UTC
I've made English only text, to check if this has something to do with page content language.

This doesn't reproduce on en:WP (https://en.wikipedia.org/wiki/User:Xelgen/sandbox)
This also doesn't reproduce on en:WP when you switch to Armenian UI language.

Same time issue is being observed on hy:WP with English UI language. (http://hy.wikipedia.org/wiki/%D5%84%D5%A1%D5%BD%D5%B6%D5%A1%D5%AF%D5%AB%D6%81:Xelgen/LastWhitespace)


But I doubt it's solely WMF configuration issues, as in first comment report mentioned he observed this on his own MediaWiki installation. 

For example:
http://chtesutyun.arnet.am/index.php/%D5%84%D5%A1%D5%BD%D5%B6%D5%A1%D5%AF%D5%AB%D6%81:Xelgen
http://grapaharan.org/index.php?title=%D5%84%D5%A1%D5%BD%D5%B6%D5%A1%D5%AF%D5%AB%D6%81:Xelgen/LastWhitespace

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links