Re: [greenstone-users] Can't read unicode data when library is built onLinux.

From Chaitra Rao
DateTue, 19 Oct 2004 11:12:29 +0530 (IST)
Subject Re: [greenstone-users] Can't read unicode data when library is built onLinux.
In-Reply-To (41743725-2AD8FC7D-cs-waikato-ac-nz)
Thanx Michael. I compared the doc.xml files in both Windows and Linux.
There wasn't much of a difference except that the content seemed to be
encoded differently.The contents of the doc.xml files for Windows were as
follows:

<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE Archive SYSTEM
"http://greenstone.org/dtd/Archive/1.0/Archive.dtd">
<Archive>
<Section>
<Description>
<Metadata name="gsdlsourcefilename">C:Program
FilesgsdlcollectdevelopmimportBhav PuchiyeNames.txt</Metadata>
<Metadata name="gsdldoctype">indexed_doc</Metadata>
<Metadata name="Language">hi</Metadata>
<Metadata name="Encoding">utf8</Metadata>
<Metadata name="Plugin">TEXTPlug</Metadata>
<Metadata name="Source">Names.txt</Metadata>
<Metadata name="dc.Language">hi</Metadata>
<Metadata name="dc.Title">Bhav Puchiye</Metadata>
<Metadata name="dls.Language">hi</Metadata>
<Metadata name="dc.Description">Bhav Puchiye</Metadata>
<Metadata name="dc.Subject">Bhav Puchiye Documents</Metadata>
<Metadata name="Title"> - Apple
</Metadata>
<Metadata name="Identifier">HASHdbc194e7f23dac7f1078e3</Metadata>
</Description>
<Content>&lt;pre&gt;
- Apple

- Cabbage

- Geranium

- Melon

-Mushroom

- Potato

- Soybean

- Peas

-Tomato


&lt;/pre&gt;</Content>
</Section>
</Archive>


The contents of doc.xml in Linux were as follows:

<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE Archive SYSTEM
"http://greenstone.org/dtd/Archive/1.0/Archive.dtd">
<Archive>
<Section>
<Description>
<Metadata
name="gsdlsourcefilename">/home/gsdl/gsdl/collect/developm/import/bhav
puchiye/Names.txt</Metadata>
<Metadata name="gsdldoctype">indexed_doc</Metadata>
<Metadata name="Language">hi</Metadata>
<Metadata name="Encoding">utf8</Metadata> - Cabbage
<Metadata name="Plugin">TEXTPlug</Metadata>
<Metadata name="Source">Names.txt</Metadata>
<Metadata name="dc.Language">hi</Metadata>
<Metadata name="dc.Title">Bhav Puchiye</Metadata>
<Metadata name="dls.Language">hi</Metadata>
<Metadata name="dc.Description">Bhav Puchiye</Metadata>
<Metadata name="dc.Subject">Bhav Puchiye Documents</Metadata>
<Metadata name="Title"> - Apple^M</Metadata>
<Metadata name="Identifier">HASH01d65d493c96e473509e0373</Metadata>
</Description>
<Content>&lt;pre&gt;
- Cabbage

- Geranium
- Melon
-Mushroom
- Potato
- Soybean
- Peas
-Tomato

&lt;/pre&gt;</Content>
</Section>
</Archive>


Like you said there wasn't much diff except for the paths. What could be
the problem?


Regards,
Chaitra

On Tue, 19 Oct 2004, Michael Dewsnip wrote:

> Hi,
>
> You need to try to narrow down where the problem is. You should compare the
> doc.xml files in the archives folder of the collection built on both Windows
> and Linux. If the doc.xml files are different (except for file paths) on
> Windows and Linux then it is a building problem, otherwise it is a runtime
> problem. Once you know which of these areas it is in, the source of the
> problem can be narrowed down further.
>
> Regards,
>
> Michael
>
>
>
> Chaitra Rao wrote:
>
> > Hi,
> > I was not able to read unicode data on the browser(IE) when I built my
> > library on Linux. I had some unicode content in a text file which I wanted
> > to place in my collection.I was able to do the same on my Windows m/c
> > without many changes to the textplugin. I just changed the text
> > plugin and set the default language to hindi (the language in which I have
> > my unicode data) and set the input encoding to UTF-8.
> > Can someone help? Do I need to configure the Apache server settings to
> > be enable it to serve unicode content?
> >
> > Regards,
> > Chaitra
> >
> > _______________________________________________
> > greenstone-users mailing list
> > greenstone-users@list.scms.waikato.ac.nz
> > https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>