postfix-users October 2010 archive
Main Archive Page > Month Archives  > postfix-users archives
postfix-users: Character corruption for Chinese (simple and trad

Character corruption for Chinese (simple and traditional) and Korean texts

From: Sharma, Ashish <ashish.sharma3_at_nospam>
Date: Tue Oct 05 2010 - 06:48:26 GMT
To: postfix users <postfix-users@postfix.org>

Hi,

I have a setup, where emails received by mail server(postfix) are taken on and the resulting email's body(html or plain text) and attachments are parsed to separate files and saved, for this I use javax mail api.

The problem occurs for email body when it is in Chinese (simple and traditional) (charset GB2312, as per email header) or Korean (charset ks_c_5601-1987, as per email header),

the resulting parsed email bodies show character corruption (the characters are displayed as '?').

Also even if I am explicitly saving the charset to be the one as suggested by email header the problem remains same.

I am unable to understand why rest of the programs like Google mail, Outlook can parse the mail body right while my code could not.

Please suggest what am I doing wrong?

thanks in advance

Ashish