Last modified: 2014-10-16 12:11:21 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T65771, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 63771 - Log for Korean sent with wrong charset
Log for Korean sent with wrong charset
Status: UNCONFIRMED
Product: Utilities
Classification: Unclassified
WM-Bot (Other open bugs)
unspecified
All All
: Lowest normal (vote)
: ---
Assigned To: Peter Bena
http://bots.wmflabs.org/~wm-bot/logs/...
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-04-10 14:50 UTC by Revi
Modified: 2014-10-16 12:11 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Revi 2014-04-10 14:50:14 UTC
See URL, and you can see Korean is broken... (English is shown as normal.)
Comment 1 Bawolff (Brian Wolff) 2014-04-10 17:44:40 UTC
Looks like the http header just has to be changed from:

 content-type: text/plain

to

 content-type: text/plain; charset=utf-8
Comment 2 Peter Bena 2014-06-13 08:38:11 UTC
that didn't help it seems to be inserted to Mysql in wrong format, but I don't know how to fix that, mysql library for .net is pretty undocumented, maybe it doesn't even support this
Comment 3 Bawolff (Brian Wolff) 2014-06-13 18:52:46 UTC
(In reply to Peter Bena from comment #2)
> that didn't help it seems to be inserted to Mysql in wrong format, but I
> don't know how to fix that, mysql library for .net is pretty undocumented,
> maybe it doesn't even support this

Hmmm. If the mysql tables were set to iso 8859-1, I would expect data to be lost (Since it would discard invalid character points. I would think). In this case its clear the data isn't lost, just marked incorrectly. So it might just be the connection and not the db tables. (But if it is the db tables, you can use the Alter command to change the character set on them)

Googling suggests something along the lines of

new MySqlConnection( "Various parameters you need;CharSet=utf8;" );

----

Anyhow, the actual text content being served is fine, its just marked incorrectly. So that should be the layer above your database.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links