Last modified: 2014-07-23 08:31:00 UTC
<bXY> is parsed as <b> if XY is a non-ascii character. Examples (also included in the URL): <b→> doesn't work! </b> <bä> doesn't work! </b> <boo> works fine </b>
This is easy enough, but remind me exactly why <bXY> should be parsed as <b>?
*** Bug 52022 has been marked as a duplicate of this bug. ***
*** Bug 40670 has been marked as a duplicate of this bug. ***
Change 77907 had a related patch set uploaded by Cscott: Non-word characters don't terminate tag names. https://gerrit.wikimedia.org/r/77907
Change 77907 merged by jenkins-bot: Non-word characters don't terminate tag names. https://gerrit.wikimedia.org/r/77907
Patch merged. Closing as FIXED.
I was hoping to verify the fix on the deployed wiki. This patch hasn't been deployed yet. (Although it should happen today.)
Fixed in the sanitizer, but html-tidy appears to still have a bug.
See bug 52899 for a better way to document behavior which varies when tidy is being used. The bug has been reopened. Still need to fix tidy to ensure these tags aren't swallowed.
Tidy bug https://sourceforge.net/p/tidy/bugs/946/
*** Bug 68127 has been marked as a duplicate of this bug. ***