Page MenuHomePhabricator

DBQ-52 Chronological categorization of backlinks to individual article
Closed, DeclinedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-52.
Summary: Chronological categorization of backlinks to individual article
Issue type: Task - A task that needs to be done.
Priority: Major
Status: Done
Assignee: (none)


From: Yuzhong <zy0329111111@gmail.com>

Date: Mon, 17 Nov 2008 13:03:41

I am doing a study about the backlinks to individual article in English Wikipedia. But it seems that through the "what links here" button we can only check the latest information of the backlinks to one article. Can I get more historical information about the backlinks to the article? For instance, if an article named "A" started from 2002 till now, can we get the information about how many backlinks to "A" in year 2002 and what is the respective backlink numbers in 2003, 2004, 2005, 2006...?

It seems that we can only get the result by querying the "pagelinks" dumps. But now I can only download the latest one. Could anyone do me a favor to query the outdated data dump to get the yearly backlinks information for the articles list ed below? Thanks a lot!
Most of the articles are started from late 2001:
1837
Analysis of algorithms
Ballroom dance
Bluetooth
Boron_nitride
Brackish_water
Casino
Computing
conquest_of
Cue_sport
European anchovy
Extreme poverty
File_archiver
Glossary of American football
IBM AIX (operating system)
International Atomic Time
International Union of Pure and Applied Chemistry
List of animated television series
List of anthropologists
Minute of arc
Motor neurone disease
Neolithic
Patriotism
The Amazing Spider-Man
Unit of alcohol
Pattern welding
Communications in Angola
Dasyproctidae
Abstract algebra
Amoeboid
Avionics
Aztlan Underground
Bell_Labs
Binary operation
Blue_law
Board_game
Bomber
Cereal
Characters in Atlas Shrugged
Cobble Hill Tunnel
Coco Chanel
Cuisine of the United States
Delian League
Dual_wield
Ecology of Africa
Ericsson
Friction
Heredity
Industry in Alberta
Joule
Miss Marple
Navy
Silicon Valley
Snake oil
Tank destroyer
The_Bush_(Alaska)
Wheel
Zoology
Axiom of choice
Black_Sea
Blindness
BMW
Civil engineering
Computer keyboard
Czech Republic
Damascus steel
Economy_of_Belize
Electromagnetic radiation
Engineering
Euclidean space
Group action
Hercule Poirot
Ibn
Intelligence quotient
Latitude
Lynx_(console)
Mouthwash
Nuclear fission
Number
Operating system
Pinyin
Primate
Radium
Sid_Meier's_Alpha_Centauri
States and territories of Australia
The_Birth_of_a_Nation
The_Bronx
Utilitarianism
Vim (text editor)
X-ray
Kolmogorov complexity
Maxwell's equations
Quantum mechanics
Britney_Spears
Bill Clinton
Computer
Jewellery
Standard conditions for temperature and pressure
Autism
Oxygen


Version: unspecified
Severity: major

Details

Reference
bz59307

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:22 AM
bzimport set Reference to bz59307.

From: Bryan Tong Minh <bryan@tools.wikimedia.de>

Date: Mon, 17 Nov 2008 13:28:12

Not possible. MediaWiki does not store historical data about pagelinks.


From: Yuzhong <zy0329111111@gmail.com>

Date: Mon, 17 Nov 2008 13:52:48

There are "pagelinks" dumps retrieved from different time may keep the backlink information.
See http://download.wikimedia.org/enwiki/
But only the "pagelink" dumps from March 2008 to October 2008 are available. Anywhere I can find the early dumps?


From: Yuzhong <zy0329111111@gmail.com>

Date: Mon, 17 Nov 2008 14:08:17

Alternatively, I have a rough idea.
I use the "what links here" function of article "A" to get the list of articles linked to article "A". And then search the key word "A" in historical versions of thoes articles linked to "A" to see when were they linked to article "A". I am now doing it manaually. It is very slow and tedious. Anyidea to make it automatic or semi-automatic? Thanks a lot for your suggestion!


From: Misza <misza1313@gmail.com>

Date: Mon, 17 Nov 2008 19:19:43

I use the "what links here" function of article "A" to get the list of articles linked to article "A". And then search the key word "A" in historical versions of thoes articles linked to "A" to see when were they linked to article "A".

Unfortunately, to my knowledge, the toolserver databases do not store the text of historical revisions.


From: Betacommand <phoenixoverride@gmail.com>

Date: Thu, 12 Mar 2009 02:17:38

This is something that the database on the toolserver cannot do

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: (none)
CC list: Bryan.TongMinh@Gmail.com, misza@misza.net, phoenixoverride@gmail.com