Last modified: 2011-10-21 23:03:14 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T33747, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 31747 - Generate email addresses as images to "cloak" from email harvesters
Generate email addresses as images to "cloak" from email harvesters
Status: RESOLVED WONTFIX
Product: MediaWiki extensions
Classification: Unclassified
Extensions requests (Other open bugs)
unspecified
All All
: Unprioritized normal (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2011-10-16 08:13 UTC by とある白い猫
Modified: 2011-10-21 23:03 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description とある白い猫 2011-10-16 08:13:45 UTC
It has came to my attention that a lot of people seem to be posting their email addresses in an text<image>text format. Bots are able to read emails from that. Instead perhaps an image should be generated with an extension displaying the text as an image.

I was thinking of something like <math> but am not sure the best way to execute this. Perhaps the magic word would display whatever email user is confirmed with and post that provided user enables this in settings.

This is intended as an anti-spam measure.
Comment 1 p858snake 2011-10-16 08:19:40 UTC
This would kill usability in a few ways... And there would be nothing stopping these bots from indexing/OCRing the images.

If users don't want their address public and/or indexable, they can list the Special:EmailUser method ([[Special:EmailUser/<USERNAME>]]) on the page and users can contact them via that.
Comment 2 とある白い猫 2011-10-18 07:03:47 UTC
Kill usability how? People already use images as email... forexample the use of @ image. If people want to use the current method they would still be able to. This just provides a third method.

If what you suggest (OCRing of images) happens, that would be a wonderful thing. It means the spam bots will be too busy OCRing images on userpages. This would waste their resources which is a positive thing for us.
Comment 3 p858snake 2011-10-18 07:39:24 UTC
(In reply to comment #2)
> Kill usability how? People already use images as email... forexample the use of
> @ image. If people want to use the current method they would still be able to.
That template (or image) when done properly (en.wikipedia) already gives "@" as the alt text so when you copy and paste it still outputs the email address properly.

If the whole address was a image, we would still want the alt text to be the full email address for users that don't display images (or using a screen reader package) so it would be no different than not using the image because the address would still be intact.

Example of how they would be laid out:
Without any image: joe@somebody.id.au
With at symbol as image: joe<img src="..." alt="@" />somebody.id.au
Full with alt: <img src="..." alt="joe@somebody.id.au" />
Comment 4 とある白い猫 2011-10-19 07:57:50 UTC
The idea here is to offer a third way for people who want to display their emails on their userpage but not want to be spammed like crazy. The idea is to confuse the bots. If people want to use the current method text<image of @>text.ext that would still be fine.

Why do we want alt description? We could do without that particularly for emails. (and who doesn't display images really). If I do not want an alt description for my email, I should not be forced to display it.

It could be crafted in a manner that the image becomes unreadable to bots. Image could be fragmented to make OCR impossible or very difficult. For instance OCR will have great difficulty recognizing a character if the character is broken in to two or more images. Or it could be one image per character. Also it is possible to add fake/invisible images in the mix that machines would pick up but not people.

The more difficult it is for bots the better for us. I am fully aware there are ways around every precaution but that shouldn't mean we should surrender to spam bots.
Comment 5 Daniel Friesen 2011-10-19 08:04:24 UTC
(In reply to comment #4)
> (and who doesn't display images really)
- Blind users
- People on slow connections who turn off images because it speeds things up
- Likewise people on slow connections who defer to text browsers because they have less to load

> It could be crafted in a manner that the image becomes unreadable to bots.
> Image could be fragmented to make OCR impossible or very difficult. For
> instance OCR will have great difficulty recognizing a character if the
> character is broken in to two or more images. Or it could be one image per
> character. Also it is possible to add fake/invisible images in the mix that
> machines would pick up but not people.
> 
> The more difficult it is for bots the better for us. I am fully aware there are
> ways around every precaution but that shouldn't mean we should surrender to
> spam bots.
And real users should lose out on a valid bit of usability, the ability to not have to manually type in an e-mail address and instead click a link to send an e-mail. Because of spam?
...imho, that's like saying Wikipedia should turn anon editing off because of vandals.
Comment 6 とある白い猫 2011-10-19 10:06:57 UTC
People can still use wikipedias own email system mentioned if they desire to not deal with images or if they are blind.

The email on userpage is just an additional means. The other option for people like me is to display nothing which is in essence less usable.

Your example about ip edits does not correlate with this proposal, please stay on topic.
Comment 7 Brion Vibber 2011-10-21 23:03:14 UTC
I don't think this is something we'd really need/want as a generalized feature in MediaWiki itself or on any Wikimedia sites.

You can of course devise your own image that's as convenient or inconvenient as you like and upload it.

If you're interested in writing an extension to help build it in for your own sites, we'd be more than happy to make sure it's hosted with our code repositories for extensions.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links