mameha1977
01-15-2007, 03:02 AM
I am having trouble displaying unicode data stored in my MySQL database.
It displays OK onscreen in the browser, and also within phpmyadmin the data looks OK. I cut and paste the data (korean and chinese text) in there from a notepad file with 'save as' utf-8.
Everything is fine until I try to read the website using other tools.
For example, I have installed an open source search spider, and when that indexes my pages it handles english OK but all korean etc is stored as garbage (ie. it looks like garbage in phpmyadmin, and also when outputted to my search results pages that are in utf-8.).
Another tool that spiders the site as garbage is Sitescore (http://sitescore.silktide.com/).
this leads me to think that although things look OK in my browser, somehow things are not quite right. I am worried that when we release the site soon it will not be indexed properly by search engines.
Some variable data from mysql:
character set client utf8
(Global value) latin1
character set connection utf8
(Global value) latin1
character set database latin1
character set filesystem binary
character set results utf8
(Global value) latin1
character set server latin1
character set system utf8
character sets dir /usr/local/mysql/share/mysql/charsets/
collation connection utf8_unicode_ci
(Global value) latin1_swedish_ci
collation database latin1_swedish_ci
collation server latin1_swedish_ci
Can anyone solve this mystery?
It displays OK onscreen in the browser, and also within phpmyadmin the data looks OK. I cut and paste the data (korean and chinese text) in there from a notepad file with 'save as' utf-8.
Everything is fine until I try to read the website using other tools.
For example, I have installed an open source search spider, and when that indexes my pages it handles english OK but all korean etc is stored as garbage (ie. it looks like garbage in phpmyadmin, and also when outputted to my search results pages that are in utf-8.).
Another tool that spiders the site as garbage is Sitescore (http://sitescore.silktide.com/).
this leads me to think that although things look OK in my browser, somehow things are not quite right. I am worried that when we release the site soon it will not be indexed properly by search engines.
Some variable data from mysql:
character set client utf8
(Global value) latin1
character set connection utf8
(Global value) latin1
character set database latin1
character set filesystem binary
character set results utf8
(Global value) latin1
character set server latin1
character set system utf8
character sets dir /usr/local/mysql/share/mysql/charsets/
collation connection utf8_unicode_ci
(Global value) latin1_swedish_ci
collation database latin1_swedish_ci
collation server latin1_swedish_ci
Can anyone solve this mystery?