[greenstone-users] Harvesting metadata and documents from OAI

From j.eustis@neu.edu
DateFri Aug 21 01:26:48 2009
Subject [greenstone-users] Harvesting metadata and documents from OAI
In-Reply-To (4A8C73BD-5010404-cs-waikato-ac-nz)
Dear Katherine,

I am using Greenstone 2.81. I gathered both the oai and document files.
Then I built my new collection. The metadata appeared as extracted
metadata and I also saw this in the archives files.

I also tried exploded the files as well. However, I was not given the
option of exploding to dc metadata. I could assign assign metadata to the
files. This took me to the set filter option automatically. I choose the
advanced where dc.Title equals ex.Title. Then I exploded the metadata.

However, this did not delete the oai files. Also, the metadata remained in
the extrated metadata fields.

Is this because I'm using 2.81? Or am I just missing a step?

Thanks,
Jennifer
*********************************
Jennifer M. Eustis
Catalog/Metadata Librarian
260 Snell Library
Northeastern University
360 Huntington Ave.
Boston, MA 02115
Tel: 617-373-7102
Email: j.eustis@neu.edu
~~~~~~~~~~~~~~~~~~~~~~~~~~~~
This message may contain confidential information, and is intended only
for the addressee. If you are not the named addressee, or if you have
received this message in error, we ask your cooperation to refrain from
disseminating, distributing or copying this e-mail, and request that you
delete it from your computer or messaging device.
Thank you.


Katherine Don <kjdon@cs.waikato.ac.nz>
08/19/2009 05:49 PM

To
j.eustis@neu.edu
cc
greenstone-users@list.scms.waikato.ac.nz
Subject
Re: [greenstone-users] Harvesting metadata and documents from OAI


Hi Jennifer

When you looked in the downloaded files section, did you get .oai files
and document files? You need to add both to your collection.
If you look at the documents in Enrich panel, you won't see the metadata
as it is in the oai records.
Once you have built the collection, depending on which version of
Greenstone you are using, you may see the metadata as extracted
metadata, associated with the documents. Later version of greenstone
will not show the metadata in enrich panel as it is saved as dc
metadata, but cannot be edited.

After building, take a look at the archive files in the collection. You
should see some metadata in there.

Alternatively, explode the oai files by right clicking on them and
choosing explode metadata database. You will need to set the
document_field option. I can't remember off the top of my head what the
value should be. take a look at the oai files and you should see which
field has the document name in it. For 2.82, it should be gi.Sourcedoc I
think.
Choose dc metadata set to explode into.

If exploding has gone correctly, you should see the metadata attached to
the documents in the Enrich panel.

Note that exploding deletes the oai file, so you may want to save a copy
first.

Regards,
Katherine
j.eustis@neu.edu wrote:
>
> I need some help with downloaded documents over OAI.
>
> I tried to download several documents and their metadata over OAI and
> was unsuccessful. I followed the directions in the tutorial from the
> wiki
> (http://wiki.greenstone.org/wiki/gsdoc/tutorial/en/OAI_downloading.htm).
>
> -Click on the Download tab
> -Select OAI
> -Put in the URL (I was using URLs that I found from this website:
> http://www.openarchives.org/Register/BrowseSites)
> -Check the box "Get document"
> -Click Download
>
> When I added the documents to my new collection and tried to enrich
> them, there was only the documents but no metadata.
>
> Perhaps I am missing a step or am using the wrong URLs. Does anyone
> have any suggestions or helpful hints?
>
> Thank you,
> Jennifer
>
> *********************************
> Jennifer M. Eustis
> Catalog/Metadata Librarian
> 260 Snell Library
> Northeastern University
> 360 Huntington Ave.
> Boston, MA 02115
> Tel: 617-373-7102
> Email: j.eustis@neu.edu
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> This message may contain confidential information, and is intended
> only for the addressee. If you are not the named addressee, or if you
> have received this message in error, we ask your cooperation to
> refrain from disseminating, distributing or copying this e-mail, and
> request that you delete it from your computer or messaging device.
> Thank you.
> ------------------------------------------------------------------------
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20090820/ffe1b71e/attachment.html