Last modified: 2013-10-13 09:23:23 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T57414, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 55414 - Existing pages does not exist
Existing pages does not exist
Status: RESOLVED FIXED
Product: Pywikibot
Classification: Unclassified
interwiki.py (Other open bugs)
compat-(1.0)
All All
: High critical
: ---
Assigned To: xqt
:
: 55655 (view as bug list)
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-10-07 16:16 UTC by JAn Dudík
Modified: 2013-10-13 09:23 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments
log from interwiki py (365.29 KB, application/octet-stream)
2013-10-07 18:19 UTC, JAn Dudík
Details

Description JAn Dudík 2013-10-07 16:16:57 UTC
I run interwiki.py

Many existing categories and it's interwiki links are reported as missing
https://cs.wikinews.org/w/index.php?title=Kategorie:Srpen_2013&curid=7375&diff=34600&oldid=34561

https://cs.wikinews.org/w/index.php?title=Kategorie:21._%C4%8Dervenec_2013&curid=7378&diff=34599&oldid=34505

and many others
Comment 1 JAn Dudík 2013-10-07 16:17:49 UTC
the same for wiktionary main namespace
https://cs.wiktionary.org/w/index.php?title=%D1%81%D0%B5%D0%BE%D1%81%D0%BA%D0%B8&curid=41303&diff=437508&oldid=172166

other changes  I stopped
Comment 2 xqt 2013-10-07 16:31:25 UTC
Could you give any hints, tracebacks, messages while processing iw.py?
Comment 3 JAn Dudík 2013-10-07 18:19:00 UTC
Created attachment 13448 [details]
log from interwiki py

See attached log. 
But now seems to work correctly
Comment 4 JAn Dudík 2013-10-07 18:56:36 UTC
try
interwiki.py -lang:tr -family:wikinews -subcatsr:2013

Although all categories have all members, in february some categories "does not exist"

When I run -subcatsr:2013/02 exist all

It seems that bot takes only first 50 pages from some languages, because on 
interwiki.py -lang:cs -family:wikinews -new -namespace:14
it deleted some links and these languages now have only 50 pages to work.

Additionally there is bug
https://bugzilla.wikimedia.org/show_bug.cgi?id=55374
very slow run - loading about 1 page per second  and error messages every few minutes

------------
interwiki.py -lang:tr -family:wikinews -subcatsr:2013
...
NOTE: [[tr:Kategori:2013/02/18]] does not exist. Skipping.
NOTE: [[tr:Kategori:2013/02/19]] does not exist. Skipping.
NOTE: [[tr:Kategori:2013/02/20]] does not exist. Skipping.
NOTE: [[tr:Kategori:2013/02/21]] does not exist. Skipping.
NOTE: [[tr:Kategori:2013/02/22]] does not exist. Skipping.
NOTE: [[tr:Kategori:2013/02/23]] does not exist. Skipping.
NOTE: [[tr:Kategori:2013/02/24]] does not exist. Skipping.
NOTE: [[tr:Kategori:2013/02/25]] does not exist. Skipping.
NOTE: [[tr:Kategori:2013/02/26]] does not exist. Skipping.
NOTE: [[tr:Kategori:2013/02/27]] does not exist. Skipping.
...
NOTE: The first unfinished subject is [[tr:Kategori:2013/01]]
NOTE: Number of pages queued is 50, trying to add 60 more.
Getting [[Kategori:2013/02/27]] list...
Getting [[Kategori:2013/02/28]] list...
Getting [[Kategori:2013/03]] list...
Getting [[Kategori:2013/03/01]] list...
Getting [[Kategori:2013/03/02]] list...
Getting [[Kategori:2013/03/03]] list...
Getting [[Kategori:2013/03/04]] list...
Getting [[Kategori:2013/03/05]] list...
Getting [[Kategori:2013/03/06]] list...
Getting [[Kategori:2013/03/07]] list...
Getting [[Kategori:2013/03/08]] list...
Comment 5 Malafaya 2013-10-12 20:31:03 UTC
*** Bug 55655 has been marked as a duplicate of this bug. ***
Comment 6 Malafaya 2013-10-12 20:33:38 UTC
I can add that the interwikis removed are somewhat random. In two consecutive runs, interwiki.py readds the interwikis removed in the previous run, and sometimes also removes others it didn't remove in the previous run.
The problem is definitely related to the "NOTE: [[**:***]] does not exist. Skipping." message, which sometimes isn't correct.
Comment 8 Gerrit Notification Bot 2013-10-13 09:18:40 UTC
Change 89500 had a related patch set uploaded by Xqt:
(Bug 55414) Initial bugfix for non existing pages

https://gerrit.wikimedia.org/r/89500
Comment 9 Gerrit Notification Bot 2013-10-13 09:19:45 UTC
Change 89500 merged by Xqt:
(Bug 55414) Initial bugfix for non existing pages

https://gerrit.wikimedia.org/r/89500
Comment 10 xqt 2013-10-13 09:23:23 UTC
Decreased maxquerysize to 50 which is the same value as in core

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links