Re: [greenstone-users] Extracted Subject terms from HTML

From Katherine Don
DateMon, 15 Nov 2004 08:53:44 +1300
Subject Re: [greenstone-users] Extracted Subject terms from HTML
In-Reply-To (4194F09F-6070207-ubcic-bc-ca)
Hi Jenn

Each classifier has a slightly different way of handling metadata - one
of these days we will get around to standardising them. AZlist and
AZSectionList only use the first value of the metadata. But the
CompactLists can use all.

Try the following to see if they do what you want:

classify AZCompactList -metadata Subject -allvalues -doclevel section
classify AZCompactSectionList -metadata Subject -allvalues

I'm not sure what the difference between AZCompactList -doclevel section
and AZCompactSectionList is.
This will items with common subjects into a subfolder. use -mingroup
option to control this. Eg set mingroup to 1 to make the the first
vertical list all subfolders, or if you don't want any subfolders, set
it to something large.

Katherine Don

Jenn Cole wrote:
> Hello,
> I am trying to get Greenstone to extract section subject metadata that
> is coded in the source HTML document while using the AZSectionList
> browsing classifier. I have several subject headings for each section,
> however, when I build and preview the database only the last subject
> term for each section appears. How do I tell Greenstone to display all
> of the extracted subject terms?
> Thanks,
> Jenn Cole
> UBCIC Library Technician
> _______________________________________________
> greenstone-users mailing list