Last modified: 2014-08-20 22:34:45 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T70146, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 68146 - XML output can contain invalid comment syntax copied from wikitext
XML output can contain invalid comment syntax copied from wikitext
Status: NEW
Product: Parsoid
Classification: Unclassified
General (Other open bugs)
unspecified
All All
: High normal
: ---
Assigned To: Parsoid Team
:
Depends on:
Blocks: 66659
  Show dependency treegraph
 
Reported: 2014-07-17 08:57 UTC by D Chan
Modified: 2014-08-20 22:34 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description D Chan 2014-07-17 08:57:19 UTC
Some articles contain de-facto HTML-style comments like <!--- foo ----> (with more than two dashes). Parsoid passes these unchanged into the output XML, which is then invalid.

Failure example:

  $ wget -q 'http://parsoid.wmflabs.org/enwiki/Bratislava?oldid=617085374' -O - | xmlwf
  STDIN:6:3: not well-formed (invalid token)"

Non-failure example:

  $ wget -q 'http://parsoid.wmflabs.org/enwiki/Bratislava?oldid=617286772' -O - | xmlwf

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links