Last modified: 2014-07-23 08:31:00 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T19663, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 17663 - Parser interpretes <bXY> as <b> if XY begins with non-ascii character when $wgUseTidy=true


Summary:	Parser interpretes <bXY> as <b> if XY begins with non-ascii character when $w...

Status:	REOPENED

Product:	MediaWiki
Classification:	Unclassified
Component:	Parser (Other open bugs)
Version:	unspecified
Hardware:	All All

Importance:	Low minor (vote)
Target Milestone:	---
Assigned To:	C. Scott Ananian

URL:	http://de.wikipedia.org/w/index.php?t...
Whiteboard:
Keywords:	need-parsertest, parser

Duplicates:	40670 52022 68127 (view as bug list)
Depends on:
Blocks:
	Show dependency tree / graph

Reported:	2009-02-25 14:07 UTC by Church of emacs
Modified:	2014-07-23 08:31 UTC (History)
CC List:	7 users (show)

See Also:	52022
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Church of emacs 2009-02-25 14:07:44 UTC

<bXY> is parsed as <b> if XY is a non-ascii character.
Examples (also included in the URL):
<b→> doesn't work! </b>
<bä> doesn't work! </b>
<boo> works fine </b>

Comment 1 Dan Collins 2012-03-18 20:42:00 UTC

This is easy enough, but remind me exactly why <bXY> should be parsed as <b>?

Comment 2 C. Scott Ananian 2013-08-06 15:22:43 UTC

*** Bug 52022 has been marked as a duplicate of this bug. ***

Comment 3 C. Scott Ananian 2013-08-06 15:23:01 UTC

*** Bug 40670 has been marked as a duplicate of this bug. ***

Comment 4 Gerrit Notification Bot 2013-08-06 15:25:50 UTC

Change 77907 had a related patch set uploaded by Cscott:
Non-word characters don't terminate tag names.

https://gerrit.wikimedia.org/r/77907

Comment 5 C. Scott Ananian 2013-08-06 15:31:47 UTC

*** Bug 52022 has been marked as a duplicate of this bug. ***

Comment 6 C. Scott Ananian 2013-08-06 15:36:25 UTC

*** Bug 40670 has been marked as a duplicate of this bug. ***

Comment 7 Gerrit Notification Bot 2013-08-06 16:04:26 UTC

Change 77907 merged by jenkins-bot:
Non-word characters don't terminate tag names.

https://gerrit.wikimedia.org/r/77907

Comment 8 Andre Klapper 2013-08-14 12:43:44 UTC

Patch merged. Closing as FIXED.

Comment 9 C. Scott Ananian 2013-08-15 17:12:11 UTC

I was hoping to verify the fix on the deployed wiki.  This patch hasn't been deployed yet.  (Although it should happen today.)

Comment 10 C. Scott Ananian 2013-08-15 18:40:38 UTC

Fixed in the sanitizer, but html-tidy appears to still have a bug.

Comment 11 C. Scott Ananian 2013-08-15 20:58:25 UTC

See bug 52899 for a better way to document behavior which varies when tidy is being used.  The bug has been reopened.  Still need to fix tidy to ensure these tags aren't swallowed.

Comment 12 Gabriel Wicke 2013-11-08 23:05:28 UTC

Tidy bug https://sourceforge.net/p/tidy/bugs/946/

Comment 13 db [inactive,noenotif] 2014-07-23 08:31:00 UTC

*** Bug 68127 has been marked as a duplicate of this bug. ***

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links