Page MenuHomePhabricator

DBQ-131 article random selection from English Wikipedia with quota for each grade
Closed, ResolvedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-131.
Summary: article random selection from English Wikipedia with quota for each grade
Issue type: Task - A task that needs to be done.
Priority: Major
Status: Done
Assignee: Hoo man <hoo@online.de>


From: Minn Seok Choi <MinnSeok.Choi@gmail.com>

Date: Sat, 26 Mar 2011 04:14:43

I would like to acquire an English article list where 100 articles for each the quality grade of WP(FA, A or GA, B, C, Start, Stub)is chosen randomly. Totally 600 articles appear on the list with full name of article and grade columns.


Version: unspecified
Severity: major

Details

Reference
bz59386

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:27 AM
bzimport set Reference to bz59386.

From: Hoo man <hoo@online.de>

Date: Sat, 26 Mar 2011 16:39:42

SQL:

SELECT DISTINCT page_title, 'A' FROM (SELECT page_title as cat FROM page INNER JOIN categorylinks ON page_id = cl_from INNER JOIN category ON cat_title = page_title WHERE page_namespace = 14 AND categorylinks.cl_to = 'A-Class_articles' AND cat_pages - cat_subcats > 0 ORDER BY RAND() LIMIT 125) as categories INNER JOIN categorylinks ON categories.cat = categorylinks.cl_to INNER JOIN page ON categorylinks.cl_from = page.page_id WHERE page_title NOT LIKE "%-class%" ORDER BY RAND() LIMIT 100;

(Query for A-class articles, same for every category, except of FA)
Result:
http://toolserver.org/~hoo/dbq/dbq-131.txt


From: Minn Seok Choi <MinnSeok.Choi@gmail.com>

Date: Sun, 27 Mar 2011 08:00:09

Thanks a lot, Hoo man. I would need more data about the article list that you had retrieved perfectly. I will post another request for the query. Have a good day. ~~~~

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: hoo@online.de
CC list: hoo@online.de