Last modified: 2014-03-08 16:06:42 UTC
The 'Æ' is a letter and is currently normalized differently. The upper case form is normalized to 'AE', the lower case to 'A'.
The root cause is the equivalence file on our wiki: http://www.mediawiki.org/wiki/AntiSpoof/Equivalence_sets which is then copied under maintenance/equivset.in. The file list uses the format: <hexadecimal codepoint> <character> => [<hexadecimal codepoint>] <character> The relevant part: E6 æ => C6 Æ E6 æ => 41 A 4D4 Ӕ => C6 Æ 4D5 ӕ => C6 Æ Running maintenance/generateEquivset.php generates a PHP array of the list which uses the character for key. The codepoint E6 has two entries, I guess only the second one is taken in account.
I have removed the always failing test with https://gerrit.wikimedia.org/r/#/c/55553/