Last modified: 2014-09-12 21:10:17 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T65273, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 63273 - Improve recognition of broken quoting in HTML attributes
Improve recognition of broken quoting in HTML attributes
Status: NEW
Product: Parsoid
Classification: Unclassified
General (Other open bugs)
unspecified
All All
: Low normal
: ---
Assigned To: Parsoid Team
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-03-30 10:35 UTC by Helder
Modified: 2014-09-12 21:10 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Helder 2014-03-30 10:35:44 UTC
On this edit
https://pt.wikipedia.org/w/index.php?diff=38540603
I just replaced the "o" by an "a", but Parsoid removed a </div> from other part of the page:
http://parsoid-lb.eqiad.wikimedia.org/_rt/ptwiki/?oldid=38538929
Comment 1 Gabriel Wicke 2014-03-31 18:33:18 UTC
The more relevant test is http://parsoid-lb.eqiad.wikimedia.org/_rtselser/ptwiki/?oldid=38538929, but that shows the same issue currently. Investigating.
Comment 2 ssastry 2014-03-31 18:40:58 UTC
<div style="background: "#ccccff"; color: #000000;" class="NavHead">Cronologia de Mandela (1918–2010) ... 

That <div> has bad quoting which cause the <div> to be parsed as plain text and somehow seems to throw off parsing and diffs. I haven't investigated why that causes diffs, but that should explain why the </div> is lost because the opening <div> is now unmatched.

Search for Cronologia de Mandela (1918–2010) on the page in Comment 1 that gwicke pasted.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links