Last modified: 2014-01-03 15:32:11 UTC
This issue was converted from https://jira.toolserver.org/browse/DBQ-13. Summary: Create an SQL or XML dump of all the deleted pages in Wikipedia's (en) history. Issue type: Task - A task that needs to be done. Priority: Minor Status: Declined Assignee: (none) ------------------------------------------------------------------------------- From: Fred Benenson <fcb211@nyu.edu> Date: Wed, 13 Feb 2008 17:29:36 ------------------------------------------------------------------------------- I'm a graduate student doing research on Wikipedia and am interested in doing analysis on deleted pages. Ideally, I would like a raw dump of all the deleted pages in Wikipedia's history. From my understanding the SQL command would look something like this: SELECT * FROM archive
------------------------------------------------------------------------------- From: DaB. <dab@ts.wikimedia.org> Date: Wed, 13 Feb 2008 19:25:28 ------------------------------------------------------------------------------- The articles are deleted for good reasons. Maybe we can give you an list of deleted articles, but no versions or texts of corse.
------------------------------------------------------------------------------- From: Bryan Tong Minh <bryan@tools.wikimedia.de> Date: Wed, 13 Feb 2008 19:40:37 ------------------------------------------------------------------------------- Not a good idea according to Brion Vibber in IRC. I actually wonder why we have this private data available on the toolserver...
This bug was imported as RESOLVED. The original assignee has therefore not been set, and the original reporters/responders have not been added as CC, to prevent bugspam. If you re-open this bug, please consider adding these people to the CC list: Original assignee: (none) CC list: Bryan.TongMinh@Gmail.com, wikimedia-bugzilla@dabpunkt.eu, fcb@fredbenenson.com