Last modified: 2014-09-01 15:38:05 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T52513, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 50513 - VisualEditor: Set up mass round-trip testing infrastructure on real content
VisualEditor: Set up mass round-trip testing infrastructure on real content
Status: ASSIGNED
Product: VisualEditor
Classification: Unclassified
MediaWiki integration (Other open bugs)
unspecified
All All
: High normal
: ---
Assigned To: Krinkle
:
: 56330 (view as bug list)
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-07-01 18:49 UTC by Gabriel Wicke
Modified: 2014-09-01 15:38 UTC (History)
8 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Gabriel Wicke 2013-07-01 18:49:40 UTC
Automated mass round-trip testing on actual page content would be useful to ensure proper HTML round-tripping in VE. This is very similar to your existing DOM sanity check. Basically load it into DM, export it again and check that the result is identical.

You can probably reuse parts of our distributed test infrastructure for this (currently rt testing 160k pages from various wikis through Parsoid), and can directly use the cached HTML from production as the input.
Comment 1 James Forrester 2013-07-01 19:07:59 UTC
Timo,

Let's use this bug for what we discussed. As I suggested, we should probably run on:
* enwiki featured articles (~ 4k), fixed revision (so if we regress we notice)
* enwiki ~ 5k most recently-changed articles (Special:RecentChanges)
* {en,fr,de,it,es,nl,he,ru,ar,ja,ko,vi}wiki ~ 5k random articles (Special:Random)

Thoughts?
Comment 2 Helder 2013-07-01 19:33:28 UTC
(In reply to comment #1)
...
> * {en,fr,de,it,es,nl,he,ru,ar,ja,ko,vi}wiki ~ 5k random articles
Could you add also 'pt' to this list?
Comment 3 James Forrester 2013-07-01 20:00:49 UTC
(In reply to comment #2)
> (In reply to comment #1)
> ...
> > * {en,fr,de,it,es,nl,he,ru,ar,ja,ko,vi}wiki ~ 5k random articles
> Could you add also 'pt' to this list?

Sure. It was just writing a quick list rather than setting it in stone.

When we expand to cover language variants we'll want to expand the list further - for example, zh. :-)
Comment 4 Krinkle 2013-07-01 20:47:16 UTC
So based on discussions with Gabriel:

* Parsoid has a better organised infrastructure for this than we do, so let's use that as a base. Right now they periodically run their sets of roundtrip tests on a certain set of articles.

1) Change that set of articles to include and/or match James' specification.
2) Improve ve-dirtydiffbot to not just do parsoid-ve-ve-parsoid roundtrip but also parsoid-ve-ve rountrip (e.g. parsoid dom > ve linmod > ve dom; "sanity check")

2) Extend the test runner to include 2 pieces of information for each article in addition to the data parsoid gathers:
  - result of parsoid-dom > ve linmod > ve dom ("sanity check")
  - diff of parsoid-dom > ve linmod > ve dom > parsoid dom ("full wikitext roundtrip")[1]


[1] this is the one that ve-dirtydiffbot is currently doing.
Comment 5 James Forrester 2013-07-01 21:46:46 UTC
(In reply to comment #4)
> So based on discussions with Gabriel:
> 
> * Parsoid has a better organised infrastructure for this than we do, so let's
> use that as a base. Right now they periodically run their sets of roundtrip
> tests on a certain set of articles.
> 
> 1) Change that set of articles to include and/or match James' specification.

Include, not switch, please; the stuff that Parsoid is doing for RT tests should also be expanded, IMO.
Comment 6 James Forrester 2014-03-01 02:27:18 UTC
*** Bug 56330 has been marked as a duplicate of this bug. ***

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links