Page 1 of 1

Unicode characters in exported HTML

Posted: Wed Feb 13, 2013 7:10 pm
by mstaszew
Hi there,

I'm exporting my text as HTML and Unicode characters are exported using decimal HTML character entity such as...

稿稸稕稜窔稸

I'd like to keep have the characters exported in UTF-16 little endian to match Delphi's string format. I'm sending the HTML to a server and cannot have it in the decimal representation it's currently in.

Is this possible?

Thanks,
Michael

Posted: Thu Feb 14, 2013 8:18 am
by Sergey Tkachenko
UTF-16 - no.
But UTF-8 - yes. Include rvsoUTF8 in the Options parameter of SaveHTML/SaveHTMLEx.
(if you need UTF-16, you can save as UTF8, then convert it to UTF-16 using Utf8Decode function, then replace the first occurence of 'UTF-8' to 'UTF-16' (two times, if you saved XHTML)).

Posted: Thu Feb 14, 2013 3:00 pm
by mstaszew
Perfect, works as expected. Thanks.