[greenstone-users] Html pages in iso-8859-1 are badly converted to utf-8 (double conversion?)

From Eduardo Trápani
DateFri, 18 Mar 2005 14:22:30 -0300
Subject [greenstone-users] Html pages in iso-8859-1 are badly converted to utf-8 (double conversion?)
Hi,

I have a problem. My html documents are in iso-8859-1, when I try to see them after a search, for example, I get all the accented characters wrong. I actually see an utf-8 sequence, the two characters that start with ~A. In fact in the source I have a sequence of four utf-8 characters that is rendered as the sequence that would have to show the accented character:

source: □
html: ~AX~AY
browser: ~AX

if I type ~AZ in a UTF-8 then I get □ again (X, Y and Z could be any character).

HTMLPlugin has the input-encoding in iso-8859-1, anybody knows what could be wrong?

Eduardo.

PD: It's not a browser problem, if chose iso-8859-1 I get four characters per accented characer, if I choose UTF-8 then I get only two, the ~A and something else. The source has a four character utf-8 sequence for each accented character.