|Date||Mon, 21 Mar 2005 12:42:10 -0300|
|Subject||[greenstone-users] Wrong conversion from html entities to utf-8 (was: Html pages in iso-8859-1 are badly converted toutf-8 (double conversion?))|
|I think I found the problem:
a) if the page has the iso-8859-1 "□" (i acute), then the utf-8 sequence is ok and I get the accented "i" when the browser is in utf-8 mode.
b) if the page has the html entity í then I get a "double" utf-8 sequence with four characters, wich is of course wrongly displayed.
The program seems to translate:
a) iso-8859-1 -> utf-8 (right)
b) entity -> utf-8 -> utf-8 (wrong)
Anybody knows where can I fix that in the program?