Last modified: 2013-10-13 09:23:23 UTC
I run interwiki.py Many existing categories and it's interwiki links are reported as missing https://cs.wikinews.org/w/index.php?title=Kategorie:Srpen_2013&curid=7375&diff=34600&oldid=34561 https://cs.wikinews.org/w/index.php?title=Kategorie:21._%C4%8Dervenec_2013&curid=7378&diff=34599&oldid=34505 and many others
the same for wiktionary main namespace https://cs.wiktionary.org/w/index.php?title=%D1%81%D0%B5%D0%BE%D1%81%D0%BA%D0%B8&curid=41303&diff=437508&oldid=172166 other changes I stopped
Could you give any hints, tracebacks, messages while processing iw.py?
Created attachment 13448 [details] log from interwiki py See attached log. But now seems to work correctly
try interwiki.py -lang:tr -family:wikinews -subcatsr:2013 Although all categories have all members, in february some categories "does not exist" When I run -subcatsr:2013/02 exist all It seems that bot takes only first 50 pages from some languages, because on interwiki.py -lang:cs -family:wikinews -new -namespace:14 it deleted some links and these languages now have only 50 pages to work. Additionally there is bug https://bugzilla.wikimedia.org/show_bug.cgi?id=55374 very slow run - loading about 1 page per second and error messages every few minutes ------------ interwiki.py -lang:tr -family:wikinews -subcatsr:2013 ... NOTE: [[tr:Kategori:2013/02/18]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/19]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/20]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/21]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/22]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/23]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/24]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/25]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/26]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/27]] does not exist. Skipping. ... NOTE: The first unfinished subject is [[tr:Kategori:2013/01]] NOTE: Number of pages queued is 50, trying to add 60 more. Getting [[Kategori:2013/02/27]] list... Getting [[Kategori:2013/02/28]] list... Getting [[Kategori:2013/03]] list... Getting [[Kategori:2013/03/01]] list... Getting [[Kategori:2013/03/02]] list... Getting [[Kategori:2013/03/03]] list... Getting [[Kategori:2013/03/04]] list... Getting [[Kategori:2013/03/05]] list... Getting [[Kategori:2013/03/06]] list... Getting [[Kategori:2013/03/07]] list... Getting [[Kategori:2013/03/08]] list...
*** Bug 55655 has been marked as a duplicate of this bug. ***
I can add that the interwikis removed are somewhat random. In two consecutive runs, interwiki.py readds the interwikis removed in the previous run, and sometimes also removes others it didn't remove in the previous run. The problem is definitely related to the "NOTE: [[**:***]] does not exist. Skipping." message, which sometimes isn't correct.
Got it: https://no.wiktionary.org/w/api.php?action=query&prop=info&format=xml&inprop=protection|subjectid&titles=calaria|calariam|calarias|calarmos|calar%25C3%25A1|calar%25C3%25A1s|calar%25C3%25A3o|calar%25C3%25ADamos|calar%25C3%25ADeis|calas|calasse|calassem|calasses|calaste|calastes|calava|calavam|calavas|calaveira|calavera|cala%25C3%25A7a|calcadela|calcadura|calcanhar|calcanhar%2520de%2520Aquiles|calcar|calcarizar|calcariz%25C3%25A1mos|calce|calcearius|calced%25C3%25B3nia|calced%25C3%25B4nia|calceo|calcer|calcetai|calcetais|calcetam|calcetamos|calcetando|calcetar|calcetara|calcetaram|calcetaras|calcetardes|calcetarei|calcetareis|calcetarem|calcetaremos|calcetares|calcetaria|calcetariam|calcetarias|calcetarmos|calcetar%25C3%25A1|calcetar%25C3%25A1s|calcetar%25C3%25A3o|calcetar%25C3%25ADamos|calcetar%25C3%25ADeis|calcetas|calcetasse returns results (just 50...) but also: <warnings> <query xml:space="preserve">Too many values supplied for parameter 'titles': the limit is 50</query> </warnings> Default query value in pywiki is 60... Setting it to 50 or lower (-query:50) should do the trick. This is probably some new change to the API.
Change 89500 had a related patch set uploaded by Xqt: (Bug 55414) Initial bugfix for non existing pages https://gerrit.wikimedia.org/r/89500
Change 89500 merged by Xqt: (Bug 55414) Initial bugfix for non existing pages https://gerrit.wikimedia.org/r/89500
Decreased maxquerysize to 50 which is the same value as in core