Thanks for your help. I just found out that my item files actually worked, they just didn't show up in my browsing list. I could search for them. Your item file works as well. It was my problem. I had to add "no_metadata_formatting" to the title browsing classifier. Once I did that, everything works on my Linux server. On my Windows XP computer, the XML item file still doesn't work, though. It would bomb perl, every time...
From: Katherine Don [mailto:firstname.lastname@example.org]
Sent: Tuesday, March 21, 2006 8:38 PM
To: Bin Zhang
Subject: Re: [greenstone-users] PagedImage plugin
I have just tried this with an item file containing UTF-8 chinese metadata (metadata values, not metadata names).
And it built fine.
Greenstone currently has a problem with unicode metadata *names*, so if this is what you are trying to do, it won't work.
But unicode *values* should.
Are you sure the file is encoded in UTF-8?
What version of greenstone are you using?
I have attached my item file. You could try with this file and see if your greenstone can handle it. If not, you may need a newer version of greenstone.
Bin Zhang wrote:
> I used utf-8. Any idea why this is happening?
> -----Original Message-----
> From: John R. McPherson [mailto:email@example.com]
> Sent: Tuesday, March 21, 2006 4:42 PM
> To: Bin Zhang
> Cc: firstname.lastname@example.org
> Subject: Re: [greenstone-users] PagedImage plugin
> On Tue, Mar 21, 2006 at 03:15:29PM -0800, Bin Zhang wrote:
>>I was trying to setting up a collection of scanned images of Chinese
>>books. I first tried to use the plain format for the document
>>metadata. If I use Chinese in the .item file, GLI will not process
>>the file, but I add some English text in front of Chinese, it would
>>work. Then I tought maybe if I use the XML format it would work.
>>Wrong. XML format doesn't work at all, even if I only have Chinese
>>text in the .item file. Perl would die a fatal death.
> What encoding(s) did you use? I'm not sure but I wouldn't be surprised if the plugin assumes that the .item file is in utf-8, instead of big5 or gbk.
> greenstone-users mailing list