Page MenuHomePhabricator

DBQ-13 Create an SQL or XML dump of all the deleted pages in Wikipedia's (en) history.
Closed, DeclinedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-13.
Summary: Create an SQL or XML dump of all the deleted pages in Wikipedia's (en) history.
Issue type: Task - A task that needs to be done.
Priority: Minor
Status: Declined
Assignee: (none)


From: Fred Benenson <fcb211@nyu.edu>

Date: Wed, 13 Feb 2008 17:29:36

I'm a graduate student doing research on Wikipedia and am interested in doing analysis on deleted pages. Ideally, I would like a raw dump of all the deleted pages in Wikipedia's history. From my understanding the SQL command would look something like this:

SELECT * FROM archive


Version: unspecified
Severity: minor

Details

Reference
bz59269

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:19 AM
bzimport set Reference to bz59269.

From: DaB. <dab@ts.wikimedia.org>

Date: Wed, 13 Feb 2008 19:25:28

The articles are deleted for good reasons. Maybe we can give you an list of deleted articles, but no versions or texts of corse.


From: Bryan Tong Minh <bryan@tools.wikimedia.de>

Date: Wed, 13 Feb 2008 19:40:37

Not a good idea according to Brion Vibber in IRC. I actually wonder why we have this private data available on the toolserver...

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: (none)
CC list: Bryan.TongMinh@Gmail.com, wikimedia-bugzilla@dabpunkt.eu, fcb@fredbenenson.com