Last modified: 2013-07-15 19:30:33 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T49495, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 47495 - Ability to match text based on a negative lookbehind/lookahead regex


Summary:	Ability to match text based on a negative lookbehind/lookahead regex

Status:	NEW

Product:	MediaWiki extensions
Classification:	Unclassified
Component:	AbuseFilter (Other open bugs)
Version:	unspecified
Hardware:	All All

Importance:	Lowest enhancement (vote)
Target Milestone:	---
Assigned To:	Nobody - You can work on this!

URL:
Whiteboard:
Keywords:

Depends on:
Blocks:	47497
	Show dependency tree / graph

Reported:	2013-04-22 05:04 UTC by Mahitgar
Modified:	2013-07-15 19:30 UTC (History)
CC List:	5 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Mahitgar 2013-04-22 05:04:48 UTC

Hi,

Abuse filter need to have a feature where in it would provide a unique code to be included (using <!---exceptioncode--> next to given string in article.

Sub feature exception or a word chosen in wiki language should be standard software should suggest randamly generated code to be added next to word <!--exceptionThecode-->


Reason: In certain context certain abusive words remain encyclopedic. For example in artcile http://en.wikipedia.org/wiki/13_June  1934 entry about meeting between  Hitler and mussolini when asked to historians certain abusive words do have encyclopedic value.So we may need to give exception at that place only and no where else.This will save abuse filter managers and patrollers.





Thanks

Comment 1 Nemo 2013-06-09 09:49:36 UTC

You can already exclude pages containing a string, very easily. Are you saying that this kind of regular expressions you're looking for don't work with AbuseFilter? (I don't remember what subset of regex we use exactly.)

Anyway, your approach seems wrong: you can't determine context reliably with regex, you should tag such edits adding words you don't like and then check them manually.

Comment 2 Mahitgar 2013-06-09 10:27:03 UTC

(In reply to comment #1)
> You can already exclude pages containing a string, very easily. Are you
> saying
> that this kind of regular expressions you're looking for don't work with
> AbuseFilter? (I don't remember what subset of regex we use exactly.)
> 
> Anyway, your approach seems wrong: you can't determine context reliably with
> regex, you should tag such edits adding words you don't like and then check
> them manually.

:The logic is exemption is needed only in a particular line in particular page.Use anywhere else is abuse and can be disallowed strait away.

:This will increase work for abuse filter only initially.Once the people know they can not cheat the system easily they will automatically desist.Once people desist lesser work for filters,filter managers and patrollers.

:Tagging only continues patrollers work.Wiki's like en wiki has large manpower available to do that.Most of the wikis in wikimedia umbrella are smaller wikis and we have less man power .If we save our time on patrolling our editors will have more time at disposal for content contribution

Comment 3 Jesús Martínez Novo (Ciencia Al Poder) 2013-06-09 10:35:01 UTC

Mahitgar, you can achieve that easily by changing the condition to

 ( ( YOURCONDITION ) & article_text != '13 June' )

Comment 4 Mahitgar 2013-06-12 11:06:56 UTC

(In reply to comment #3)
> Mahitgar, you can achieve that easily by changing the condition to
> 
>  ( ( YOURCONDITION ) & article_text != '13 June' )


Yes I know, this exempts the complete page. 

Thanks

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links