Re: [greenstone-users] Problem with metadata.xml

From Michael Dewsnip
DateSun, 07 Mar 2004 13:49:55 +1300
Subject Re: [greenstone-users] Problem with metadata.xml
In-Reply-To (1078529761-40490ee171cea-lakshmi-info-science-uiowa-edu)
Hi,

Good to hear you've had some success with Greenstone building a small test collection.
Regarding your use of metadata.xml files, it looks like you're doing everything right. A
couple of things to check:

- Have you remembered to rerun import.pl on your collection? (Metadata is assigned at import
time).
- Check the filenames carefully in your metadata.xml files (they are case-sensitive, on some
OSs at least)

Also, after importing, you can check whether the metadata has been assigned correctly by
looking at the doc.xml files in the archives directory. These show the metadata assigned to
the file -- you should see your Keyword metadata here.

Regards,

Michael

Padmini Srinivasan wrote:

> Apologies for the duplicate posting. I did not have a subject in my previous
> email and so am resending. Padmini
>
> ------
> Hello All,
>
> I am a newbie to Greenstone. I have thankfully succeeded in using OSX to build
> a small test collection of mainly pdf files. All is fine as far as the build
> is concerned. My question is regarding metadata.
>
> I have a file called metadata.xml in the same dir as the pdf files for the
> library. It looks like:
>
> <?xml version="1.0" ?>
> <!DOCTYPE GreenstoneDirectoryMetadata SYSTEM
> "http://greenstone.org/dtd/GreenstoneDirectoryMetadata/1.0/GreenstoneDirectoryMetadata.dtd"
> >
> <DirectoryMetadata>
> <FileSet>
> <FileName>FinancialTextmining.pdf</FileName>
> <Description>
> <Metadata name="Keyword">Finance </Metadata>
> </Description>
> <FileName>gapscoreBioInfo.pdf</FileName>
> <Description>
> <Metadata name="Keyword">Biology</Metadata>
> </Description>
> </FileSet>
> </DirectoryMetadata>
>
> Next I have in my configuration file the following changes to the default
> lines.
>
> RecPlug -use_metadata_files
>
> I also have a line
> classify "AZList" "-metadata" "Keyword"
>
> and my indexes are:
>
> indexes "document:text" "document:Title" "document:Source" "document:Keyword"
>
> My question is: I am not able to search on the Keyword field
>
> Also the Listing for Keyword is not alphabetical by Keyword.
>
> Finally, the pull down set of options for searching does not offer Keyword,
> instead it offers a _dmy_ field which in any case never retrieves any documents
> irrespective of what I search for.
>
> I have also tried putting the explicit path information into the metadata.xm
> file for each input file. So I am misunderstanding something completely about
> how to supply metadata from an external file. Since these are pdf files, I am
> not sure how to supply metadata any other way.
>
> Your input is greatly appreciated.
>
> Best wishes, Padmini
>
> Padmini Srinivasan
> Univ. of Iowa
>
>
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users&#64;list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users