Last modified: 2014-11-06 15:50:52 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T56537, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 54537 - pagegenerators: follow redirects, intersection, exclusion
pagegenerators: follow redirects, intersection, exclusion
Status: NEW
Product: Pywikibot
Classification: Unclassified
pagegenerators (Other open bugs)
unspecified
All All
: Unprioritized enhancement
: ---
Assigned To: John Mark Vandenberg
: patch
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-09-24 22:03 UTC by Kunal Mehta (Legoktm)
Modified: 2014-11-06 15:50 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments
Copy of attachment from http://sourceforge.net/p/pywikipediabot/patches/625/attachment/pagegenerators.py.diff (6.49 KB, text/plain)
2013-09-24 22:03 UTC, Kunal Mehta (Legoktm)
Details

Description Kunal Mehta (Legoktm) 2013-09-24 22:03:42 UTC
Originally from: http://sourceforge.net/p/pywikipediabot/patches/625/
Reported by: andreasjs
Created on: 2013-08-24 21:57:56.794000
Subject: Pagegenerator: follow redirects, intersection, exclusion
Original description:
I added three new arguments:


-followredirects  
Used with other arguments that specify a set of pages. 
                  If a specified page is a redirect page, work on its 
                  target page.

-intersecting     
Argument to be used between two other arguments. 
                  Work only on pages normally specified by both the
                  previous and the next argument.

-excluding        
Argument to be used between two other arguments. 
                  Work only on pages normally specified by the
                  previous argument but not by the next argument.

For example, one could want to find the pages edited by a specific user that contain a certain keyword in a title. 

A few other suggestions:
Exclude sections, even on files.
Compare pages via the Page.\_\_cmp\_\_ property to exclude duplicate pages instead of 
u"%s:%s:%s" % (page._site.family.name, page._site.lang, page._title). 
(more transparent and easier to maintain).
Comment 1 Kunal Mehta (Legoktm) 2013-09-24 22:03:47 UTC
Created attachment 13365 [details]
Copy of attachment from http://sourceforge.net/p/pywikipediabot/patches/625/attachment/pagegenerators.py.diff
Comment 2 Merlijn van Deen (test) 2014-02-01 17:04:50 UTC
Patch does not apply cleanly to either core or compat
Comment 3 John Mark Vandenberg 2014-08-05 01:22:34 UTC
excluding would be very helpful to workaround bugs, where one page is causing a problem, such as bug 69133.
Comment 4 John Mark Vandenberg 2014-11-06 15:50:52 UTC
Patch is by Andreas, if I am understanding the old sf.net history correctly.

Mpaa is creating 'intersecting' with https://gerrit.wikimedia.org/r/#/c/170832/

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links