[greenstone-users] Accents in PDF and Word files

From Pier.Luigi.Rossi@bondy.ird.fr
DateWed, 29 Sep 2004 15:26:27 +0200
Subject [greenstone-users] Accents in PDF and Word files
Hi,
i just became a menber of the list.

I have to made collections with pdf files in french.

A file contains title with accents and text with accents.

When a build collection using pdfplug i have 2 choice

if i use pdfplug with imput_encoding iso_8859_1
the accents in the title are show well when i consult the collection :
échelle is show échelle
the accents in the text are show not well for the text document (well
in the pdf document !) : unité and not unité
but all the documents are indexed in the collection

if i use pdfplug with imput_encoding auto
the accents in the title are show not well when i consult the collection :
échelle is show chelle
the accents in the text are show well for the text document (well in the
pdf document !) : unité is unité
not all the documents are indexed in the collection

When i search the collection if i want find documents it is very hard ....
if i search documents abaout unité i have to write .... unité (and i can't
whit my pc)
The difficult is to explain that to all the users ....

I try to change preferences in UTF-8 or in iso_8859_1 but if
a search unité i can't find unité in the index .....

Maybe people working whith accents in other langages have the same problems ?

Is it possible to made index whitout accents and filter the search entry
to put it whitout accents ?
if a search unité a filter translete it in unite and the index contains
just unite ....


Regards

Pier Luigi Rossi