Re: Extracting HTML meta data

From Eric Lease Morgan
DateWed, 25 Sep 2002 07:17:24 -0500
Subject Re: Extracting HTML meta data
In-Reply-To (006901c2644b$b70fdc70$7bf4d982-scms-waikato-ac-nz)
On 9/24/02 11:26 PM, "Stefan Boddie" <sjboddie@cs.waikato.ac.nz> wrote:

>> How do I edit my collect.cfg file to extract the meta data correctly? I am
>> specifically interested in the values of the subject and abstract tags.
>
> You should be able to do this by adding a "-metadata_fields
> Creator,Title,Abstract,Date,Subject" option to the HTMLPlug line in your
> collect.cfg file. That is, make it look something like the following (all on
> one line):
>
> plugin HTMLPlug -metadata_fields Creator,Title,Abstract,Date,Subject
>
> To alter the way your search results are displayed you'll need to add a
> "format SearchVList" line (or edit the existing one if it's already there).
>
> To display subject metadata you'd give it a value something like the following
> (all on one line):
>
> format SearchVList "<td>[link][icon][/link]</td><td>[Subject]</td>"

Thank you for the prompt reply. This is exactly what I needed to know, and
it not only works as advertised but works very well.

It seems that the key to displaying information in the search results list
is to have meta data embedded in the meta data tags of my HTML documents.

--
Eric Lease Morgan
http://www.infomotions.com/