Last modified: 2014-10-16 19:30:30 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T48705, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 46705 - Parsoid-based wikitext "linting" tool for "buggy" / "deprecated" wikitext usage; keywords: broken wikitext information
Parsoid-based wikitext "linting" tool for "buggy" / "deprecated" wikitext usa...
Status: NEW
Product: Parsoid
Classification: Unclassified
General (Other open bugs)
unspecified
All All
: Low normal
: ---
Assigned To: Gabriel Wicke
: easy
: 51945 (view as bug list)
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-03-29 20:50 UTC by ssastry
Modified: 2014-10-16 19:30 UTC (History)
8 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description ssastry 2013-03-29 20:50:47 UTC
During parsing and running various transformations, Parsoid has sufficient information about buggy wikitext usage that can be passed back to editors for fixing up (Ex: fostering of content out of tr-rows because of missing td-wikitext tags, or missing newlines, etc).

This can also be a good way to slowly deprecate reliance on edge case behavior by editors (Ex: multi-comment whitespace lines are treated different from single-comment whitespace lines -- this is just a side effect of PHP parser code and should be made consistent in the parser after deprecating its usage).

This is more a longer-term goal and can be a good self-contained project for someone.
Comment 1 Gabriel Wicke 2013-08-14 00:06:14 UTC
Another thing we could potentially lint for is auto-inserted start / end tags as mentioned in bug 51945. These are fairly common, so some filter would be needed to narrow it down to cases that are likely to cause problems.
Comment 2 Gabriel Wicke 2013-08-14 00:07:49 UTC
*** Bug 51945 has been marked as a duplicate of this bug. ***
Comment 3 Elitre 2014-02-26 16:37:24 UTC
In order to decide if you want to be more lenient in accepting bad table-row wikitext, here's an example of how things can go wrong
https://fr.wikipedia.org/w/index.php?title=Aquila_Italiana&diff=101605787&oldid=90404051
(chatting with Subbu it seems to be caused by unnecessary | marks).
Comment 6 ssastry 2014-03-06 16:11:59 UTC
Actually https://en.wikipedia.org/wiki/Wikipedia_talk:WikiProject_Check_Wikipedia is a related project that we should incorporate into our discussion.

Check the following section on that page which is relevant to this project.
https://en.wikipedia.org/wiki/Wikipedia_talk:WikiProject_Check_Wikipedia#Round_2
Comment 7 Helder 2014-03-06 18:03:38 UTC
Just in case there is something of interest there, see
https://pt.wikipedia.org/wiki/MediaWiki:Gadget-APC.js/List.js
https://pt.wikipedia.org/wiki/Wikipedia:APC

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links