Last modified: 2013-06-12 06:36:59 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T36465, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 34465 - enable importing of edits from newly released historical English Wikipedia database dumps to the current enwiki database
enable importing of edits from newly released historical English Wikipedia da...
Status: NEW
Product: Wikimedia
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Low enhancement with 2 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
http://en.wikipedia.org/wiki/Wikipedi...
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2012-02-17 08:41 UTC by Graham87
Modified: 2013-06-12 06:36 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Graham87 2012-02-17 08:41:51 UTC
Most edits from February 2002 onwards have survived intact in the Wikipedia database, but some have not, mostly due to deletion-related accidents. I've compiled a list of these at:
http://en.wikipedia.org/wiki/User:Graham87/Page_history_observations

I'd like to be able to use the newly released historical database dumps to re-add some of these missing edits. Ideally, these database dumps should be placed on read-only wikis, where any admin can use Special:Import to retrieve the necessary edits like the Nostalgia Wikipedia (see bug 20280).
Comment 1 Ariel T. Glenn 2012-02-17 13:59:32 UTC
On my todo list is to find all broken revisions on all projects and wade through the dumps to see what's recoverable. It's pretty far down on the list though :-(
Comment 2 Nemo 2013-04-04 22:33:22 UTC
(In reply to comment #0)
> I'd like to be able to use the newly released historical database dumps to
> re-add some of these missing edits. Ideally, these database dumps should be
> placed on read-only wikis, where any admin can use Special:Import to retrieve
> the necessary edits like the Nostalgia Wikipedia (see bug 20280).

Can't you "just" check them by hand and use Special:Import with importupload right (see [[m:Importer]])?
If the XML is too big you could also import it on some test wiki, delete the pages you don't need, re-export all the rest and import the XML.
Comment 3 Nemo 2013-04-04 22:35:06 UTC
Ah, remember that [[m:NWI]] are masters of such XML dumps jobs. ;-)
Comment 4 Graham87 2013-04-04 23:38:05 UTC
(In reply to comment #2)
> 
> Can't you "just" check them by hand and use Special:Import with importupload
> right (see [[m:Importer]])?
> If the XML is too big you could also import it on some test wiki, delete the
> pages you don't need, re-export all the rest and import the XML.

Hmmm, interesting idea (especially getting the help of the small-wiki importers from comment 3!). It's just that the dumps I'm most interested in (from May 2003 and 2002) aren't in XML format at all ... they're in the format of the old versions of MediaWiki and UseModWiki. They do have a warning on them that they shouldn't be wholesale dumped into the latest version of MediaWiki, after all ...
Comment 5 Nemo 2013-04-04 23:41:47 UTC
Hm, I'm not sure I understand what dumps you're talking about then: do you mean the content Tim Starling recovered from the diff txt files of UseModWiki, and which Reagle reworked a bit? http://reagle.org/joseph/blog/social/wikipedia/10k-redux.html
That may be tricky indeed. :/
Comment 6 Graham87 2013-04-05 04:59:41 UTC
Yes, it would be nice to restore the dumps from August 2001 that Tim Starling recovered, but for the purposes of this bug I'm specifically talking about these dumps:
http://dumps.wikimedia.org/archive/
Comment 7 Graham87 2013-06-12 06:36:59 UTC
I've finally bitten the bullet and imported the January 2003 dump to a local copy of MediaWiki (not an easy task for someone with almost no MySQL experience!) I did so with the help of MediaWiki 1.3 (using a skeleton database) and MediaWiki 1.5 (using its updater). I have requested import rights on Meta, so I can import some of the needed revisions, at Steward Requests/Permissions. My request is here:
http://meta.wikimedia.org/wiki/Steward_requests/Permissions#Miscellaneous_requests

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links