Page MenuHomePhabricator

DBQ-148 interlanguage links
Closed, DeclinedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-148.
Summary: interlanguage links
Issue type: Task - A task that needs to be done.
Priority: Major
Status: Done
Assignee: Hoo man <hoo@online.de>


From: Thomasmp <tm.petzold@gmail.com>

Date: Wed, 03 Aug 2011 12:45:07

Please take the 10 percent of the most viewed articles of each Wikipedia, and construct a two-way linking table of inter-language links. Please leave out languages with less than 1000 articles and generate the output in csv or txt format. Thank you!


Version: unspecified
Severity: major

Details

Reference
bz59409

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:29 AM
bzimport set Reference to bz59409.

From: Hoo man <hoo@online.de>

Date: Wed, 03 Aug 2011 12:50:51

Ok, first of all, I don't think that we have an (at least half) up to date list of the most viewed articles. Furthermore I'm not sure what you mean with two-way linking table of inter-language links.

If you have answers to that, we may can help you, but for now, I don't see a chance ![][1]

[1]: https://jira.toolserver.org/images/icons/emoticons/wink.gif

From: Thomasmp <tm.petzold@gmail.com>

Date: Wed, 03 Aug 2011 13:11:29

Re recency of most viewed articles: no problem, just let me know the date of the respective list (or, if possible, use a point in time common amongst most-viewed-article-lists)

Re 2: Oops, it should read table of inter-language links ...


From: Merlijn van Deen <valhallasw@arctus.nl>

Date: Wed, 03 Aug 2011 13:17:48

Could you create an example of what you mean? For instance, using the top 10 articles of a few languages.


From: Thomasmp <tm.petzold@gmail.com>

Date: Wed, 03 Aug 2011 14:08:30

An example would be to take the top 10 articles (most viewed) of the Spanish, French and Hindu Wikipedia and extract what inter-language links these articles show, i.e. to which other languages the top 10 articles of the Spanish Wikipedia, the French Wikipedia etc. link.


From: Hoo man <hoo@online.de>

Date: Tue, 16 Aug 2011 17:23:47

Ok, I see two problems in here, first of all 10% of all articles in all wikis are still about a million (I guess), which is quite a lot. Getting ALL iw-links for those will be a lot of data. Furthermore, you need to give me a list of the most viewed articles, I can't see that from the toolserver database ![][1]

[1]: https://jira.toolserver.org/images/icons/emoticons/wink.gif

From: Hoo man <hoo@online.de>

Date: Mon, 24 Oct 2011 20:20:07

Closed, no response in over two months, feel free to reopen if the data is still needed ![][1]

[1]: https://jira.toolserver.org/images/icons/emoticons/wink.gif

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: hoo@online.de
CC list: hoo@online.de, valhallasw@arctus.nl