Re: [greenstone-users] Metadata For Header Page (Paged Image Collection)

From Gavin Spomer
DateTue, 14 Aug 2007 09:33:23 -0700
Subject Re: [greenstone-users] Metadata For Header Page (Paged Image Collection)
As promised I am (finally) replying to this thread and I see there has been another post about this subject from Georges Braoudakis.

Richard definitely got me going in the right direction. In particular:

1. Displaying metadata for the header page is done in the DocumentHeading format string
2. In the .item xml file, the metadata for the header page goes after your <PagedDocument> tag (obviously) but before
your <PageGroup> tag.

To Georges Braoudakis, I've attached an example .item file to illustrate #2 above or email me if the list doesn't allow attachments. (I'm on so many lists I can't remember which ones allow them/don't allow them)

Also, to allow the metadata to show up on the header page, I use the [Parent:dc.*] variables and the image pages I use [dc.*]. For example for title metadata I use [Parent:dc.Title] in the DocumentHeading format string and [dc.Title] in the DocumentText format string.

I hope this helps other people and big thanks to Richard for helping me.

Gavin Spomer
Systems Programmer
Brooks Library
Central Washington University

>>> Gavin Spomer <spomerg@cwu.EDU> 06/26/07 10:41 AM >>>
... and will come back to it as soon as I have time. I just have heaps of stuff on my plate right now and don't have the time to fully sort through and test this yet.

- Gavin

>>> Richard Managh <richard@dlconsulting.co.nz> 06/25/07 2:54 PM >>>
The short answer is that you use the DocumentHeading format statement to
display the metadata you want to import, and you include metadata in
your .item files as in the following example import .item file.

http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/23__2.item

<Metadata name="Series">Matariki 1881</Metadata>
<Metadata name="Date">18810515</Metadata>


I've included a description of how I arrived at this conclusion below:


I had a look at the example collection you gave and its collect.cfg file.

http://diglib.auburn.edu/gsdl/cgi-bin/library?c=sheetmus

collect.cfg (
http://diglib.auburn.edu/gsdl/collect/sheetmus/etc/collect.cfg )

and noticed that the format statement DocumentHeading, controls what
metadata is displayed on the header page.

format DocumentHeading "<center><table width=537><tr>{If}{[NumPages],<td width="100%" align="center"><h3>[Title]</h3></td></tr>,<td width="10px"></td><td>[parent(Top):Title], [parent(Top):Date]<br />[parent(Top):Creator]</td><td align="right">[DocTOC]</td><td width="10px"></td></tr>}{If}{[NumPages], <tr><td class="bib"></tr></table><table><tr><td rowspan="12"><img src="[DocImage]" alt="cover to piece" /></td><td rowspan="12" width="10"></td><td colspan="2"><u><h4>Bibliographic Information</h4></u></td></tr>{If}{[Title],<tr><td valign="top"><b>Title:</b></td><td>[Title]</td></tr>,}{If}{[Uniformtitle],<tr><td valign="top"><b>Uniform Title:</b></td><td>[Uniformtitle]</td></tr>,}{If}{[Series],<tr><td valign="top"><b>Series Title:</b></td><td>[Series]</td></tr>,}{If}{[Creator],<tr><td valign="top"><b>Composer:</b></td><td>[Creator]</td></tr>,}{If}{[Publisher],<tr><td valign="top"><b>Publisher:</b></td><td>[Publisher]</td></tr>,}{If}{[Date],<tr><td valign="top"><b>Date:</b></td><td>[Date]</td></tr>,}{If}{[Contributor],<tr><td valign="top"><b>Contributor(s):</b></td><td valign="top">[Contributor]</td></tr>,}{If}{[Format],<tr><td valign="top"><b>Format:</b></td><td>[Format]</td></tr>,}{If}{[Callnumber],<tr><td valign="top"><b>Call Number:</b></td><td>[Callnumber]</td></tr>,}{If}{[Description],<tr><td valign="top"><b>Notes:</b></td><td>[Description]</td></tr>,}{If}{[Subject],<tr><td valign="top"><b>Subject(s):</b></td><td>[Subject],}, </td></tr></table>}"


I then built a little test collection using one of the import documents
(url is below) in the

www.nzdl.org Paged Image Example collection (pagedimg-e)

using the Paged Image Examples collect.cfg file (url is below)


I changed the Paged Image Examples collect.cfg DocumentHeading format
statement to be the same as your example collection, the

http://diglib.auburn.edu/gsdl/cgi-bin/library?c=sheetmus

collection you give above.

Once this collection was built (details below) I looked at the document
imported in the built collection, and it did display *series title* and
*date* metadata on the *heading page.*

Note that *Series* and *Date* metadata is included in this .item file
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/23__2.item,
which is the one that I imported.

pagedimg-e collect.cfg:
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/etc/collect.cfg
(modifed the DocumentHeading format statement in this file to be the
same as given above)

Import document - This item file:

http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/23__2.item

Which contains series and date metadata that should appear on the
document heading using the above format statement.

I imported the above item file by placing it in the import directory of
my new test collection with its associated files in a directory
structure similar to how it appears in the example pagedimg-e collection.

I included all its files:

<collection dir>/import/xml/23/23__2.item
downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/23__2.item
<collection dir>/import/xml/23/abstracts/23__2abstract.txt downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/abstracts/23__2abstract.txt
<collection dir>/import/xml/23/images/23__2_1.gif
downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/images/23__2_1.gif

<collection dir>/import/xml/23/images/23__2_2.gif
downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/images/23__2_2.gif

<collection dir>/import/xml/23/images/23__2_3.gif
downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/images/23__2_3.gif

<collection dir>/import/xml/23/images/23__2_4.gif
downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/images/23__2_4.gif

<collection dir>/import/xml/23/text/23__2_1.txt
downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/text/23__2_1.txt

<collection dir>/import/xml/23/text/23__2_2.txt
downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/text/23__2_2.txt

<collection dir>/import/xml/23/text/23__2_3.txt
downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/text/23__2_3.txt

<collection dir>/import/xml/23/text/23__2_4.txt
downloaded from
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/import/xml/23/text/23__2_4.txt


and the modified collect.cfg:
<collection dir>/etc/xml/23/text/23__2_4.txt
*modified*
http://nzdl.sadl.uleth.ca/gsdl/collect/pagedimg-e/etc/collect.cfg
*(modified DocumentHeading format statement)*

Then imported and built the collection and looked at the particular
document's header page in the collection.

Hope this helps,

Richard.

--
DL Consulting
Greenstone Digital Library and Digitisation Specialists
contact@dlconsulting.com
www.dlconsulting.com


Gavin Spomer wrote:

>Genuinely big thanks for responding to my email, Richard. :) The list has been somewhat inactive lately.
>
>Yes, the first link you have below is the header page. The header page is the one that doesn't contain any of the images that are being "paged". It's the one that precedes the numbered pages, thus will usually have a "next button" with a 1 to go to the first of the paged images. And thanks for the (second) link to the Paged Image collection, but as I said in my first email, this is one of the many things I've studied already.
>
>Auburn University has a collection that illustrates what I want to do:
>
> http://diglib.auburn.edu/gsdl/cgi-bin/library?site=localhost&a=p&p=about&c=sheetmus&ct=0&l=en&w=utf-8
>
>Here is a header page that has metadata on it: (Title, Composer, Publisher, Date, Contributor, Format, Call Number, Notes & Subject(s))
>
> http://diglib.auburn.edu/gsdl/cgi-bin/library?e=d-000-00---0sheetmus--00-0-0--0prompt-10---4------0-1l--1-en-50---20-about---00031-001-1-0utfZz-8-00&a=d&c=sheetmus&cl=CL1.1&d=18195590
>
>
>What I want to do is have dublin core metadata on the header page. The problem isn't how/where to actually put it on the page, that's easy. You just put [dc.metadataitem], where metadataitem can be Subject, Title, etc., in one of the format strings such as the DocumentText one. The problem is where/how to put it into the XML BASED .item file that the PagedImgPlug processes. I have tried various things, but nothing works.
>
>I have emailed Auburn University, but they referred me to someone that hasn't gotten back to me yet. From reading through some of the list archives, I believe Katherine Don (sp?) is the one who developed the PagedImgPlug, but a recent email says she's on maternity leave so I respectively don't want to bother her.
>
>Thanks again for your reply. :)
>
>Gavin Spomer
>Systems Programmer
>Brooks Library
>Central Washington University
>
>
>
>>>>Richard Managh <richard@dlconsulting.co.nz> 06/21/07 6:07 PM >>>
>>>>
>>>>
>Hi Gavin,
>
>Gavin Spomer wrote:
>
>
>
>>Hello,
>>
>>I just can't figure out how to add metadata for the header page in a paged image collection, but I know it's possible because other people have done it. Where and how do I add it to my xml based .item file?
>>
>>
>>
>>
>I'm not sure exactly what you mean by a header page, do you mean a page
>like this?
>
>http://puka.cs.waikato.ac.nz/cgi-bin/library?e=d-00000-00---off-0pagedimgZz-e--00-0--0-10-0---0---0prompt-10---4-------0-1l--11-en-50---20-preview---00-0-1-00-0-0-11-1-0utfZz-8-00&a=d&c=pagedimg-e&cl=CL1.1&d=HASHecd552ed3c2d5f1f6a620f
>
>This is the collection for the above link:
>http://puka.cs.waikato.ac.nz/cgi-bin/library?e=d-00000-00---off-0pagedimgZz-e--00-0--0-10-0---0---0prompt-10---4-------0-1l--11-en-50---20-preview---00-0-1-00-0-0-11-1-0utfZz-8-00&a=p&p=about
>
>
>Could you give us a link to an example of what you mean when you say
>other people have done it?
>
>I'm not sure exactly what you want to do, and where. On which page you
>want to display the metadata? Where do you want to display it?
>
>
>Regards,
>
>Richard.
>
>
>
>------------------------------------------------------------------------
>
>_______________________________________________
>greenstone-users mailing list
>greenstone-users@list.scms.waikato.ac.nz
>https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>
>


<<attachment>>
Type: application/octet-stream
Filename: 1114.item

download