Last modified: 2014-01-03 16:02:03 UTC
This issue was converted from https://jira.toolserver.org/browse/DBQ-138. Summary: Need to find list of articles which contains <!-- WikiBhasha v=X time=YYYY-MM-DD hh:ii:ss:mmm--> Issue type: Task - A task that needs to be done. Priority: Major Status: Done Assignee: Hoo man <hoo@online.de> ------------------------------------------------------------------------------- From: WikiBhasha MSR <wbmsri@microsoft.com> Date: Tue, 03 May 2011 07:42:34 ------------------------------------------------------------------------------- Hi, I need to find list of revisions which contains <!-- WikiBhasha v=X time=YYYY-MM-DD hh:ii:ss:mmm--> I have toolserver access but on querying the following sql Select old_text from revision r inner join text t on r.rev_text_id = t.old_id where t.old_text like "%WikiBhasha%" limit 2; I am getting the old_text as DB://cluster22/144813013 DB://cluster22/123998601 In the above pattern/string ( ex. DB://cluster22/144813013) what is 'cluster22' and '144813013'? I figured out that the above pattern indicates an external db server. How can i access this external db server? How to query the content? regards anilande
------------------------------------------------------------------------------- From: Hoo man <hoo@online.de> Date: Tue, 03 May 2011 13:52:09 ------------------------------------------------------------------------------- Sorry, but first of all, this is the wrong place to ask, but I'll answer it anyway. The revision text isn't available on the toolserver (to save resources). You have to per hand load it (using http and ?action=raw) from the live servers (if you only need a few revisions). If you need all revisions please use a dump from http://dumps.wikimedia.org/backup-index.html those xml dumps seem to be the right thing for your need ![][1] [1]: https://jira.toolserver.org/images/icons/emoticons/wink.gif
This bug was imported as RESOLVED. The original assignee has therefore not been set, and the original reporters/responders have not been added as CC, to prevent bugspam. If you re-open this bug, please consider adding these people to the CC list: Original assignee: hoo@online.de CC list: wbmsri@microsoft.com, hoo@online.de