Re: [greenstone-devel] Import fails with Russian metadata

From John R. McPherson
DateThu, 22 Apr 2004 10:36:07 +1200
Subject Re: [greenstone-devel] Import fails with Russian metadata
In-Reply-To (20040421184843-GA26436-mercycorps-org)
Doug Carter wrote:
> Hi all,
> I've got a problem importing a collection that has Russian characters
> in the metadata.xml file. I thought that there was support for foreign
> character sets, so I don't know how to go about fixing this.
> When I import, the RecPlug dies with a parse error:
> Uncaught exception from user code:
> RecPlug: ERROR /usr/local/gsdl/collect/progdev/import/metadata.xml is not a well formed metadata.xml file (
> not well-formed (invalid token) at line 4486, column 44, byte 190213 at /usr/local/gsdl-build/perllib/cpan/XML/ line 187
> )
> The character it doesn't like is the *second* Russian character in
> the metadata field.
> Any ideas?

I think that the metadata.xml files must be encoded in unicode UTF-8.
You can have Russian (or anything) as long as it is utf-8, and not in a
Cyrillic encoding (eg koi8 or windows-1251).

John McPherson