Page MenuHomePhabricator

DBQ-138 Need to find list of articles which contains <!-- WikiBhasha v=X time=YYYY-MM-DD hh:ii:ss:mmm-->
Closed, DeclinedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-138.
Summary: Need to find list of articles which contains <!-- WikiBhasha v=X time=YYYY-MM-DD hh:ii:ss:mmm-->
Issue type: Task - A task that needs to be done.
Priority: Major
Status: Done
Assignee: Hoo man <hoo@online.de>


From: WikiBhasha MSR <wbmsri@microsoft.com>

Date: Tue, 03 May 2011 07:42:34

Hi,

I need to find list of revisions which contains <!-- WikiBhasha v=X time=YYYY-MM-DD hh:ii:ss:mmm-->
I have toolserver access but on querying the following sql

Select old_text from revision r inner join text t on r.rev_text_id = t.old_id where t.old_text like "%WikiBhasha%" limit 2;

I am getting the old_text as
DB://cluster22/144813013
DB://cluster22/123998601

In the above pattern/string ( ex. DB://cluster22/144813013) what is 'cluster22' and '144813013'?
I figured out that the above pattern indicates an external db server. How can i access this external db server?
How to query the content?

regards
anilande


Version: unspecified
Severity: major

Details

Reference
bz59395

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:28 AM
bzimport set Reference to bz59395.

From: Hoo man <hoo@online.de>

Date: Tue, 03 May 2011 13:52:09

Sorry, but first of all, this is the wrong place to ask, but I'll answer it anyway.

The revision text isn't available on the toolserver (to save resources). You have to per hand load it (using http and ?action=raw) from the live servers (if you only need a few revisions). If you need all revisions please use a dump from http://dumps.wikimedia.org/backup-index.html those xml dumps seem to be the right thing for your need ![][1]

[1]: https://jira.toolserver.org/images/icons/emoticons/wink.gif

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: hoo@online.de
CC list: wbmsri@microsoft.com, hoo@online.de