This issue was converted from https://jira.toolserver.org/browse/DBQ-206.
Summary: List of English Wikipedia articles along with creation date, length, and categories
Issue type: Task - A task that needs to be done.
Priority: Major
Status: Open
Assignee: (none)
From: Raj Enfield <xrajah@yahoo.com>
Date: Thu, 23 May 2013 17:46:06
Dear Sir or Madam:
I would like a list of all ~4M articles in the English Wikipedia in the following format:
TITLE - CREATION_DATE - LENGTH - CATEGORIES
where LENGTH is the CHAR_LENGTH(articleText) in characters.
I found a similar query (without the length of the article):
SQL query was : select page_title,min(rev_timestamp) AS created,group_concat(distinct cl_to) from page,revision,categorylinks where page_id=rev_page and page_namespace=0 and page_is_redirect=0 and cl_from=page_id group by page_id
*from : http://getthedata.org/questions/317/how-can-i-compile-a-log-of-wikipedia-articles-by-date-of-creation/
Thanks, I appreciate it!
Version: unspecified
Severity: normal