cancel
Showing results for 
Search instead for 
Did you mean: 

UTF-8 Character encoding issue

sbalasubramania
Champ in-the-making
Champ in-the-making
Hello,
I am very new to Alfresco and have some basic questions related to UTF-8 character set.

I am trying to create a folder/discussion topic using "Tamil" language. It is showing ????" marks after creation of the folder.

Following things where done by me:
————————————————-
Installed community version of alfresco. Installed mysql with UTF-8  encoding.
I checked that tomcat server.xml file contains URIencoding as "UTF-8".
I created language files under: tomcat/webapps/alfresco/WEB-INF/classes/alfresco/messages directory specific to Tamil.  I made sure that  "encoding_utf8=UTF-8" is present in /webclient_ta_TA.properties.

I have a plugin which allows me to enter "Tamil" characters using firefox browser. I used that plugin to enter "Tamil" text while creating the folder and discussion topic. But after entering the text, it stores the data  and display back as "????" marks.
Search functionality is working fine:  If I do search, it is retrieving correctly. but it only displays "????" marks.


How can I configure Alfresco to allow for "Tamil"  characters?

I appreciate any help to resolve the issue.

Thank you very much
Sriram Balasubramanian
21 REPLIES 21

kevinr
Star Contributor
Star Contributor
Have you ensured your database is using utf-8 character set? The search will work because we index using Lucene which stores its indexes separately from the database - so it sounds like the database character set.

Kevin

sbalasubramania
Champ in-the-making
Champ in-the-making
Thanks for the information. I will check the database and will get back to you. Can you please let me know, which table  contains "space name", so that I can verify the table data.

I appreciate your response

kevinr
Star Contributor
Star Contributor
alf_node_properties contains all node properties including cm:name etc.

Kevin

sbalasubramania
Champ in-the-making
Champ in-the-making
I checked mysql database (alfresco schema). I looked into the table you have mentioned alf_node_properties; the table fields Collation property for varchar, strings are showing as "utf8_bin" . I remember setting the collation property with "utf8_bin" when I created the alfresco database.

Is there anything I need to do. I can try out from start, if required ( meaning setting up the database etc),

Can you please point me to any URL link (if you have) for setting up the database to handle UTF8 character set ?

You can reach me by mail at " sbalasubramanian@spikesource.com "
I took the snapshot of the table structure, but not able to attach the file

Thanks
Sriram[/img]

sbalasubramania
Champ in-the-making
Champ in-the-making
while browsing the table data, I came across one table called " alf_child_assoc " having qname and child_node_name with "????" marks.  ({http://www.alfresco.org/model/content/1.0}???????…)

I think problem is during the data insertion.

Any help is appreciated.
thanks
sriram

kevinr
Star Contributor
Star Contributor
UTF-8 is good for Tamil.  So try using UTF-8 (plain) for the DB upon creation.

Thanks,

Kevin

ismail_sidiya
Champ in-the-making
Champ in-the-making
You may try editing the jdbc url:
db.url=jdbc:mysql://localhost/alfresco?useUnicode=true&characterEncoding=utf-8
Hope this help

musper
Champ in-the-making
Champ in-the-making
True:

You may try editing the jdbc url:
db.url=jdbc:mysql://localhost/alfresco?useUnicode=true&characterEncoding=utf-8
Hope this help

This helped me when tried to migrate from Linux/HSQLDB…I just wasn+t able to get my users and spaces names to show certain unicode characters in Croatian language.

Many thanks!

iblazev
Champ in-the-making
Champ in-the-making
Hello!
I have a similar problem with Croatian characters but given solution doesn't work since I'm working on Linux server. Transition to Windows is not an option. Is there a way to correct this? To use ISO 8859-2 or cp1250?