Page MenuHomePhabricator

DBQ-36 enwiki list of chatter
Closed, ResolvedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-36.
Summary: enwiki list of chatter
Issue type: Task - A task that needs to be done.
Priority: Major
Status: Done
Assignee: SQL <sxwiki@gmail.com>


From: zeyi He <wikipediathinker@googlemail.com>

Date: Wed, 13 Aug 2008 14:23:50

The list i want is list of chatter in english wikipedia. Chatter is people who only edited in discussion page of wikipedia. i want the table including their username (or IP address), a number of edit in discussion page(whatever userdiscussion page or article discussion page), the distribution of edit by time, the mid length of their edit.

Is that possible?


Version: unspecified
Severity: major

Details

Reference
bz59292

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:21 AM
bzimport set Reference to bz59292.

From: SQL <sxwiki@gmail.com>

Date: Wed, 13 Aug 2008 23:13:11

Done, please see http://toolserver.org/~sql/DBQ/36.txt.gz for the results.

I did it as close as I could to what you specified, and, unfortunately, I will not be able to re-run this query at a later date, as it took about five hours to run.

Hope this helps!

Here's the query incase anyone can do it better:

SELECT CONCAT('* [[User:',user_name,']]'),
 (SELECT COUNT(*) FROM revision JOIN page ON rev_page = page_id WHERE rev_user = user_id AND page_namespace = 1) as talk_edits,
 (SELECT COUNT(*) FROM revision JOIN page ON rev_page = page_id WHERE rev_user = user_id AND page_namespace = 3) as utalk_edits
 FROM user
 WHERE user_editcount > 0
 AND user_id IN (SELECT rev_user FROM revision JOIN page ON rev_page = page_id WHERE page_namespace = 1 OR page_namespace = 3)
 AND user_id NOT IN (SELECT rev_user FROM revision JOIN page ON rev_page = page_id WHERE page_namespace != 1 AND page_namespace != 3);

From: zeyi He <wikipediathinker@googlemail.com>

Date: Mon, 25 Aug 2008 14:56:45

thanks for your help! but , i can't open the file you provided? is it a valid website?


From: SQL <sxwiki@gmail.com>

Date: Mon, 25 Aug 2008 18:13:39

Very sorry about that! ![][1] I mistyped the file name when I was saving it. That link will work now.

[1]: https://jira.toolserver.org/images/icons/emoticons/smile.gif

From: zeyi He <wikipediathinker@googlemail.com>

Date: Tue, 26 Aug 2008 14:22:04

HI, SQL,

Thanks a lot. yes, the link works now.

May i make sure this text is three columns, "user name" "the number of edit in article discuss page" "the number of edit in user discuss page"?

or maybe i am wrong?

and for research use, may i ask the time of collecting this data for reference?

thanks again.


From: zeyi He <wikipediathinker@googlemail.com>

Date: Tue, 02 Sep 2008 11:14:25

Hi, thanks.

i still need your clarifying about data, because i see some thing unusual on text. would you mind to tell me what is that mean?

i found some 0*, what does it mean? thanks!


From: Misza <misza1313@gmail.com>

Date: Sun, 28 Sep 2008 16:49:34

The file uses Unix newline characters. If you open it in braindead editors like notepad.exe, you will see the 0 (user talk edits) from the previous line concatenated with the * (wiki bullet point) from the next line. Open the file in a more sophisticated editor.

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: sxwiki@gmail.com
CC list: misza@misza.net, sxwiki@gmail.com