Last modified: 2013-12-13 20:08:49 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T59071, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 57071 - Round-trip test of itwiki/Campionato_mondiale_Supersport_2004 never finishes
Round-trip test of itwiki/Campionato_mondiale_Supersport_2004 never finishes
Status: RESOLVED FIXED
Product: Parsoid
Classification: Unclassified
General (Other open bugs)
unspecified
All All
: Unprioritized normal
: ---
Assigned To: Gabriel Wicke
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-11-14 19:30 UTC by Marc Ordinas i Llopis
Modified: 2013-12-13 20:08 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Marc Ordinas i Llopis 2013-11-14 19:30:22 UTC
node roundtrip-test.js --prefix it Campionato_mondiale_Supersport_2004

hasn't finished after 2 hours of CPU time. Its memoryn usage is stable between 650 and 730MB.
Comment 1 Marc Ordinas i Llopis 2013-11-14 21:22:38 UTC
After 4 hours, it finished with

FATAL ERROR: JS Allocation failed - process out of memory

so it may be related to bug #57069.
Comment 2 Arlo Breault 2013-11-15 08:53:26 UTC
I noticed that _rt in the API handles these pages just fine. The difference being that env.conf.parsoid.editMode = true there.

Setting that in roundtrip-test.js seems to solve the problem.

Not really sure what editMode is though.
Comment 3 ssastry 2013-11-15 16:12:16 UTC
Oh .. interesting .. and also odd. I rememeber clearly gabriel or I turning on edit mode for round trip testing as well. Did that regress? But, also indicates some bug when editMode is not set.

editMode indicates that we are pretending as if the document will be edited and the HTML should be serialized assuming it was edited (even though, in reality in all our commandline and roundtrip testing, it never is). editMode effectively prevents us from using certain kinds of source information about wikitext fixes that the parser makes (ex: if a closing tag was added automatically, that information is recorded in the autoInserted* flags in data-parsoid. If editMode=false, we can use that information to acccurately serialize the HTML. But if editMode=true, we cannot use that information unconditionally).
Comment 4 ssastry 2013-11-15 16:23:38 UTC
Or maybe not .. I think I was confused with turning it on for parse.js and parserTests. We know that setting editMode to true for roundtrip-test.js will generate a ton of noisy diffs which are irrelevant in production where selser suppresses all the noise.
Comment 5 Arlo Breault 2013-11-15 18:48:18 UTC
The output of `Util.parse()` is ~3x (for this test case). The difference seems to be that, in dom.cleanup.js, "mw:Placeholder/StrippedTag" metas aren't being removed when editMode is false.

Is that desirable?
Comment 6 ssastry 2013-11-15 19:55:26 UTC
3x! wow. that is a lot. That page must have a lot of stray closing tags then .. or we have a bug in our markBuilderCorrectedTags code.
Comment 7 ssastry 2013-11-15 19:56:00 UTC
And yes, with editMode false, we want them around so we can rerender the stray closing tag in html2wt mode.
Comment 8 Arlo Breault 2013-11-15 22:14:43 UTC
(In reply to comment #6)
> 3x! wow. that is a lot. That page must have a lot of stray closing tags then
> ..
> or we have a bug in our markBuilderCorrectedTags code.

It's not the quantity that's the problem, it's that the data-parsoid for some of them are getting stuffed with the entire page source. That's bloating the dom and reeking havoc on the algo. in jsDiff.
Comment 9 Gerrit Notification Bot 2013-11-16 00:56:01 UTC
Change 95747 had a related patch set uploaded by Arlolra:
tagId was incorrectly used as tsr info in TreeBuilderFixups

https://gerrit.wikimedia.org/r/95747
Comment 10 Gerrit Notification Bot 2013-11-16 03:55:22 UTC
Change 95747 merged by jenkins-bot:
tagId was incorrectly used as tsr info in TreeBuilderFixups

https://gerrit.wikimedia.org/r/95747
Comment 11 Gerrit Notification Bot 2013-12-13 19:54:49 UTC
Change 101300 had a related patch set uploaded by GWicke:
tagId was incorrectly used as tsr info in TreeBuilderFixups

https://gerrit.wikimedia.org/r/101300
Comment 12 Gerrit Notification Bot 2013-12-13 19:57:43 UTC
Change 101300 merged by GWicke:
tagId was incorrectly used as tsr info in TreeBuilderFixups

https://gerrit.wikimedia.org/r/101300

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links