Last modified: 2013-07-15 19:30:33 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T49495, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 47495 - Ability to match text based on a negative lookbehind/lookahead regex
Ability to match text based on a negative lookbehind/lookahead regex
Status: NEW
Product: MediaWiki extensions
Classification: Unclassified
AbuseFilter (Other open bugs)
unspecified
All All
: Lowest enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks: 47497
  Show dependency treegraph
 
Reported: 2013-04-22 05:04 UTC by Mahitgar
Modified: 2013-07-15 19:30 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Mahitgar 2013-04-22 05:04:48 UTC
Hi,

Abuse filter need to have a feature where in it would provide a unique code to be included (using <!---exceptioncode--> next to given string in article.

Sub feature exception or a word chosen in wiki language should be standard software should suggest randamly generated code to be added next to word <!--exceptionThecode-->


Reason: In certain context certain abusive words remain encyclopedic. For example in artcile http://en.wikipedia.org/wiki/13_June  1934 entry about meeting between  Hitler and mussolini when asked to historians certain abusive words do have encyclopedic value.So we may need to give exception at that place only and no where else.This will save abuse filter managers and patrollers.





Thanks
Comment 1 Nemo 2013-06-09 09:49:36 UTC
You can already exclude pages containing a string, very easily. Are you saying that this kind of regular expressions you're looking for don't work with AbuseFilter? (I don't remember what subset of regex we use exactly.)

Anyway, your approach seems wrong: you can't determine context reliably with regex, you should tag such edits adding words you don't like and then check them manually.
Comment 2 Mahitgar 2013-06-09 10:27:03 UTC
(In reply to comment #1)
> You can already exclude pages containing a string, very easily. Are you
> saying
> that this kind of regular expressions you're looking for don't work with
> AbuseFilter? (I don't remember what subset of regex we use exactly.)
> 
> Anyway, your approach seems wrong: you can't determine context reliably with
> regex, you should tag such edits adding words you don't like and then check
> them manually.

:The logic is exemption is needed only in a particular line in particular page.Use anywhere else is abuse and can be disallowed strait away.

:This will increase work for abuse filter only initially.Once the people know they can not cheat the system easily they will automatically desist.Once people desist lesser work for filters,filter managers and patrollers.

:Tagging only continues patrollers work.Wiki's like en wiki has large manpower available to do that.Most of the wikis in wikimedia umbrella are smaller wikis and we have less man power .If we save our time on patrolling our editors will have more time at disposal for content contribution
Comment 3 Jesús Martínez Novo (Ciencia Al Poder) 2013-06-09 10:35:01 UTC
Mahitgar, you can achieve that easily by changing the condition to

 ( ( YOURCONDITION ) & article_text != '13 June' )
Comment 4 Mahitgar 2013-06-12 11:06:56 UTC
(In reply to comment #3)
> Mahitgar, you can achieve that easily by changing the condition to
> 
>  ( ( YOURCONDITION ) & article_text != '13 June' )


Yes I know, this exempts the complete page. 

Thanks

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links