Re: [greenstone-devel] Metadata.xml not well formed

From John R. McPherson
DateWed, 6 Aug 2003 09:52:47 +1200
Subject Re: [greenstone-devel] Metadata.xml not well formed
In-Reply-To (Pine-GSO-4-21-0308051652530-5405-100000-info-lanic-utexas-edu)
On Tue, Aug 05, 2003 at 05:00:31PM -0500, Bethany Letalien wrote:
> Helio, you can't use accents in the metadata files like that. Try removing
> them. Make it run, then do it again with the accents in place, but use
> UNICODE (http://www.unicode.org/). You're set to UTF-8, which is right.
> You need to start the statements with &#x and then end them with ; if
> you're using the sets at http://www.unicode.org/charts/PDF/U0080.pdf (also
> look at http://www.unicode.org/charts/PDF/U0000.pdf for special character
> it won't process if typed normally). For example, informacao becomes
>
> informaçõo
>
> I just got through converting Portuguese accents to UNICODE in a
> metadata.xml file, so feel free to e-mail me off list as well in either
> language. Also, I'd be curious to hear about your work at IBICT....

Hi,
you don't have to use the html &#...; codes. You can use the accented
characters, as long as they are using utf-8 codes, and not latin
(iso-8859 or windows codepage 1252) characters sets.

I'm not sure how to do this with windows text editors - on linux
you can use the "iconv" program to convert files from one encoding to
another.

John McPherson