Searching non-ASCII text

From Tim Finney
DateTue, 28 Jan 2003 10:37:53 -0500
Subject Searching non-ASCII text
In-Reply-To (002101c2c687$308655f0$7bf4d982-scms-waikato-ac-nz)
I have a number of Greek and Latin manuscript texts with English
commentary. I would like to use Greenstone to build a digital library
that can search and display the documents along with images of the
manuscripts.

Does anyone have any idea how I should do this? I would like to use
Unicode for the Greek but I don't know how a Unicode search string can
be entered using the Greenstone search interface. I can use a font that
maps ASCII characters to Greek ones, but then whoever uses the library
will need to install the font.

Also, the primary documents are in XML. I can use a stylesheet to
convert everything to HTML before importing the documents into
Greenstone. However, I would like to be able to use XML. Is Greenstone
ever going to be able to work directly with XML encoded documents? I
know that it uses XML internally, but I would like to use XML from start
to finish, with stylesheets being used for display? It would be great to
be able index the contents of individual elements (e.g. bibliographic
references, names, authors, etc.)

Best

Tim Finney