Page MenuHomePhabricator

DBQ-20 database of participation in english wikipedia
Closed, ResolvedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-20.
Summary: database of participation in english wikipedia
Issue type: Task - A task that needs to be done.
Priority: Major
Status: Done
Assignee: (none)


From: zeyi He <wikipediathinker@googlemail.com>

Date: Thu, 24 Apr 2008 13:20:52

I want a database to show the participation in english wikipedia including user name, the number of edit, the number of edit articles, the number of discussin edit,started date, admin or not. I hope it can cover all registered users in english wikipedia. I am running a research to study on the participation pattern in english wikipedia which will help to use limited resources in wikipedia effectively.


Version: unspecified
Severity: major

Details

Reference
bz59276

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:20 AM
bzimport set Reference to bz59276.

From: Misza <misza1313@gmail.com>

Date: Sun, 13 Jul 2008 09:54:13

This query would be quite a strain on the servers.

There are 7.4M+ registered accounts in total, 2.5M+ of which have at least one edit (according to user_editcount field). Are you sure you want all of them? The database should be limited to those with edits of course (the rest provide no statistical information apart from their number) but maybe an even higher editcount cutoff would work as well for you?


From: zeyi He <wikipediathinker@googlemail.com>

Date: Wed, 16 Jul 2008 10:38:51

Hi, thanks for comment.

For my research, i need the database with username, a number of edit and start time of editing.

Is that possible if "edit" means more than 1 time edit excluding usercount? is that still too big to create? I really need this for my research, thanks!


From: SQL <sxwiki@gmail.com>

Date: Wed, 13 Aug 2008 18:13:16

If I understand you correctly, this file should have the information requested in your last comment:

http://toolserver.org/~sql/DBQ/20.txt.gz

If not, feel free to re-open this request.

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: (none)
CC list: misza@misza.net, sxwiki@gmail.com