[greenstone-users] creating a metadata field using an attribute of a span tag

From Greenstone Team
DateMon May 7 14:24:16 2012
Subject [greenstone-users] creating a metadata field using an attribute of a span tag
In-Reply-To (6F8C0499CDDFE04485C8AD3E353DCE46AADC70D01A-gcc-exch07vm1)
Hi,

I don't have experience with using span tags in conjunction with the
HTMLPlugin, but the Greenstone tutorials say that you can next <meta>
elements in the <head> element and configure the HTMLPlugin to extract
those. The disadvantage is that you can't add this in just any locations
within the body of the document, they really go into the head section in
particular.

See
http://wiki.greenstone.org/wiki/gsdoc/tutorial/en/large_html_collection.htm
The sample document boleyn.html that the tutorial refers to, for
instance contains elements like:

<meta name="author" content="Marilee Mongello">

The tutorial then explains how to configure the HTMLPlugin to grab the
values for it.

You may want to do something similar for person and place:
<meta name="person" content="Tom Jones">
<meta name="place" content="UK">

Something to try out is assigning meta for multiple persons or places
for a single document, and checking whether they all get extracted by
the plugin.

Regards,
Anupama


On 05/05/12 00:01, John Fitzgibbon wrote:
>
> Hi,
>
> I am using Greenstone to index a number of HTML files. The files have
> tags like <span class='person'>Tom Jones</span>. I am hoping to create
> an index of persons and places. I have configured the HTMLPlugin so
> that metadata fields displays Title, span class='place', span
> class='person'. However, this isn't working.
>
> Should the tag read <person>Tom Jones</person>, or can I get it to
> work with 'person' as an attribute of the span element?
>
> Any help would be much appreciated.
>
> Regards
>
> John
>
> Regards,
>
> John
>
> John Fitzgibbon
>
> w: www.galwaylibrary.ie
>
> e: info@galwaylibrary.ie
>
> p: 00 353 91 562471
>
> f: 00 353 91 565039
>
> ------------------------------------------------------------------------
> <http://www.householdcharge.ie> <http://www.householdcharge.ie>
>
> This e-mail message has been scanned for content and cleared by
> MailMarshal Hosted at Galway County Council
>
> T□ an teachtaireacht r□omhphoist seo scan□ilte d'□bhar agus glanta ag
> MailMarshal at□ □st□lta i gComhairle Chontae na Gaillimhe.
>
> Correspondance is welcome in Irish or in English.
>
> T□ m□le f□ilte roimh chomhfhreagras i nGaeilge n□ i mB□arla.
>
> T□ eolas at□ pr□obh□ideach agus r□nda sa r□omhphost seo agus aon iat□n
> a ghabhann leis agus is leis an duine/na daoine sin amh□in a bhfuil
> siad seolta chucu a bhaineann siad. Mura seola□ th□, n□l t□ □daraithe
> an r□omhphost n□ aon iat□n a ghabhann leis a l□amh, a ch□ip□il n□ a
> □s□id. M□ t□ an r□omhphost seo faighte agat tr□ dhearmad, cuir an
> seolt□ir ar an eolas thr□ aischur r□omhphoist agus scrios ansin □ le
> do thoil.
>
> This e-mail and any attachment contains information which is private
> and confidential and is intended for the addressee only. If you are
> not an addressee, you are not authorised to read, copy or use the
> e-mail or any attachment. If you have received this e-mail in error,
> please notify the sender by return e-mail and then destroy it.
>
> If you need this email in an alternative format please contact the sender
>
> M□ t□ an r□omhphost seo ag teast□il uait i bhform□id eile d□an
> teagmh□il leis an duine a sheol chugat □
>
> ------------------------------------------------------------------------
>
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users

-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20120507/669c095b/attachment.html