Last modified: 2013-07-25 07:08:13 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T35878, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 33878 - [Regression] HTML dump is missing all images due to new FileBackend code
[Regression] HTML dump is missing all images due to new FileBackend code
Status: RESOLVED FIXED
Product: MediaWiki extensions
Classification: Unclassified
DumpHTML (Other open bugs)
unspecified
All All
: High normal (vote)
: MW 1.20 version
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2012-01-22 12:56 UTC by Thomas Bleher
Modified: 2013-07-25 07:08 UTC (History)
4 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Thomas Bleher 2012-01-22 12:56:26 UTC
The new FileBackend code (merged in r106752) broke the file handling of DumpHTML completely. With current trunk, no referenced images are added to the HTML dump at all.

Whenever the dumper encounters an image, the following error occurs:
PHP Warning:  file_exists(): Unable to find the wrapper "mwstore" - did you forget to enable it when you configured PHP? in /srv/www/mediawiki/extensions/DumpHTML/dumpHTML.inc on line 1295

Can someone knowledgeable about the new FileBackend code please fix this?
Comment 1 Aaron Schulz 2012-02-04 21:11:38 UTC
Try testing it after r110703.
Comment 2 Thomas Bleher 2012-02-05 09:48:32 UTC
Thanks a lot, it's working fine now!

I needed just one additional patch:
--- a/dumpHTML.inc
+++ b/dumpHTML.inc
@@ -1354,6 +1354,7 @@ class DumpHTML_ProxyFile {
                if ( isset( $thumb->url ) ) {
                        $thumb->url = $this->fixURL( $thumb->url );
                }
+               $thumb->file = $this;
                return $thumb;
        }
 

Without this, the links on image pages to the original image had an absolute path.
I think this is the correct fix, but it would be nice if you could check and then commit it.
Comment 3 Aaron Schulz 2012-02-05 20:22:13 UTC
That looks OK to commit.
Comment 4 Mark A. Hershberger 2012-03-02 21:51:51 UTC
(In reply to comment #3)
> That looks OK to commit.

Did you commit this?  Could you tag the revision 1.19?
Comment 5 Antoine "hashar" Musso (WMF) 2012-03-02 21:52:49 UTC
To fix this bug, we need:

- backport r110703 in REL1_19
- apply the fix from comment 2 in trunk and REL1_19
- test dumpHTML in REL1_19
Comment 6 Amadeus 2012-06-12 12:40:10 UTC
I would like to test, as I am affected by this bug.
Comment 7 Thomas Bleher 2012-08-24 19:14:09 UTC
You can checkout https://gerrit.wikimedia.org/r/#/c/17697/, which fixes this bug and a bunch of other issues with the DumpHTML extension. The change is not yet merged into the DumpHTML master branch, because it needs review by someone else than me (the author of the change).
Comment 8 Aaron Schulz 2013-06-06 07:22:50 UTC
(In reply to comment #7)
> You can checkout https://gerrit.wikimedia.org/r/#/c/17697/, which fixes this
> bug and a bunch of other issues with the DumpHTML extension. The change is
> not
> yet merged into the DumpHTML master branch, because it needs review by
> someone
> else than me (the author of the change).

This was merged.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links