Re: pb with french accents under windows 2000 with the last release

From John R. McPherson
DateTue, 09 Jul 2002 13:08:44 +1200
Subject Re: pb with french accents under windows 2000 with the last release
In-Reply-To (6812526428-20020709014933-free-fr)
eou wrote:
>
> Hello greenstone,
>
> I would like to know how to include correctly files .doc (word) in
> the last release, because i dont obtain the accents in french
> (□,□,□,□,□, etc...), and the translation .doc to HTML is very Bad.
> The same thing with the search module, which ignore some caract.
> (□,□,□,□,□, etc...)

Hi,
we use the 3rd-party program "wvWare" to convert word documents
to html, and it can handle various character sets OK, converting
them into unicode (utf8).

What language encoding do your MS Word files have? (It uses your
default Windows "keyboard" settings or something, I think).
If you are using Unicode (UCS2) or Western (cp 1252) encodings
then wvWare should be fine.

The other possibility is that the files are converted properly,
but Greenstone itself isn't displaying the accented characters
properly. If you go to the Preferences page in greenstone, you
can set the encoding used to send data back to your browser.
Either a Western encoding or UTF-8 should show French accents
properly, if they were encoded properly in the first place.

John McPherson