Page MenuHomePhabricator

DBQ-163 Category-links and pagelinks for a given category only
Closed, ResolvedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-163.
Summary: Category-links and pagelinks for a given category only
Issue type: Task - A task that needs to be done.
Priority: Critical
Status: Done
Assignee: Hoo man <hoo@online.de>


From: Wikimux <mux0x55@gmail.com>

Date: Mon, 24 Oct 2011 15:55:34

I need the categorylinks and pagelinks dumps for the Category:Software_engineering (like the huge file of the english dump). I don't know if I'm wrong or if there is an easier way to do this with wikiapi/mediawiki.
It's ok even though you can give me the results only from these columns for the given category:

cl_from,cl_to,cl_sortkey,cl_timestamp,cl_sortkey_prefix,cl_collation,cl_type (for categorylinks).

pl_from, pl_namespace, pl_title (for pagelinks).

Thanks and regards,
P


Version: unspecified
Severity: critical

Details

Reference
bz59430

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:30 AM
bzimport set Reference to bz59430.

From: Hoo man <hoo@online.de>

Date: Tue, 25 Oct 2011 18:44:51

A thing I didn't understand... do you need that data for the category itself or for the pages in it?

If you want to do that using the API may take a look at:
http://www.mediawiki.org/wiki/API:Backlinks
http://www.mediawiki.org/wiki/API:Categorymembers
http://www.mediawiki.org/wiki/API:Properties#categoryinfo_.2F_ci


From: Wikimux <mux0x55@gmail.com>

Date: Tue, 25 Oct 2011 19:27:35

I need the corresponding dump similar to the one of the english huge dump (http://dumps.wikimedia.org/enwiki/latest/,categorylinks, pagelinks), but only for the given category. Is it possible? Sorry if I didn't understand how they work. I'll check your links, thank you anyway. I checked also your tools and used the Category tool, maybe it is what I need but some 'columns' are missing... (I don't know if it is the right tool in my case).
Thank you again


From: Hoo man <hoo@online.de>

Date: Tue, 25 Oct 2011 19:44:00

Well, I still don't know what exactly you want... so I did for the pages in the category (and not for the category itself). Please reopen if you need smth. different.

SQL:

SELECT /* SLOW_OK */ categorylinks.* FROM categorylinks as page_list INNER JOIN categorylinks ON categorylinks.cl_from = page_list.cl_from WHERE page_list.cl_to = 'Software_engineering';
SELECT /* SLOW_OK */ pagelinks.* FROM categorylinks as page_list INNER JOIN pagelinks ON pagelinks.pl_from = page_list.cl_from WHERE page_list.cl_to = 'Software_engineering';

Result:
http://toolserver.org/~hoo/dbq/dbq-163_1.txt (categorylinks)
http://toolserver.org/~hoo/dbq/dbq-163_2.txt (pagelinks)

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: hoo@online.de
CC list: hoo@online.de