Last modified: 2014-11-01 07:05:20 UTC
This issue was converted from https://jira.toolserver.org/browse/PYWP-9. Summary: create library for diffs, expand userlib Issue type: New Feature - A new feature of the product, which has yet to be developed. Priority: Minor Status: Open Assignee: David <dkalkin@gmail.com> On Fri, 29 Aug 2008 23:49:25, David <dkalkin@gmail.com> opened the following bug: > Add a new library, editlib, for parsing diffs. It will parse the > diff html page, extracting info like editor, time, edit summary, > etc, and creating lists of the lines and words marked as added and > deleted. More details can be found in the docstrings. For use with > editlib, some new functions are also added to userlib. These will > get the user's registration date, the number of edits the user has > made, and the number of vandalism warnings on the user's talk page. > Only Wikipedia:en is supported - some of the regular expressions > rely on English text. Within that scope, I've done reasonably > thorough testing on the editlib functions, and somewhat less > thorough testing on the new functions for userlib. All of this is > designed for use with automated vandalism detection. I am working > on a bot to do just that - User:Kalbot - but it's still in an early > experimental stage. I am User:DKalkin on the English Wikipedia
On Fri, 29 Aug 2008 23:50:06, David <dkalkin@gmail.com> wrote: > Patch for review.
I think we don't need to parse HTML diffs, we use the API to get info about revisions and users, and difflib to show text diffs. Detection of the number of vandalism warnings on users' talk pages is interesting, but should not be bundled in the framework unless it is easily localized. Also, the patch does not appear to exist anymore on JIRA.