Search in for of the words  

 

About this collection

This collection is based on records exported from a DSpace institutional repository. The source files can be downloaded as part of the Greenstone Tutorial package.

The documents in this collection have been produced by members of the Department of Computer Science, University of Waikato. The University of Waikato holds copyright. They may be distributed freely, without any restrictions.

How to recreate this collection

  1. Start the Greenstone Librarian Interface:

    Start → All Programs → Greenstone Digital Library Software → Greenstone Librarian Interface

    After a short pause a startup screen appears, and then after a slightly longer pause the main Greenstone Librarian Interface appears. (A command prompt is also opened in the background.)

  2. First, change to Library System Specialist (or Expert) mode (using File → Preferences), because you will need to change the order of plug-ins in the Design panel.

  3. Start a new collection called "DSpace Example" and fill out its fields appropriately. Leave the metadata set at Dublin Core, the default.

  4. Switch to the Design panel and select the Document Plugins section on the left-hand side. Remove TEXTPlug, HTMLPlug and EMAILPlug. Strictly speaking we do not need to remove these, however it reduces clutter.

  5. Now add DSpacePlug. Leave the plugin options at their defaults and press <OK>.

  6. Using the up and down arrows, Move the position of DSpacePlug to above GAPlug and below ZIPPlug.

  7. In the Gather panel, locate the folder sample_files\dspace\exported_docs. It contains five example items exported from a DSpace institutional repository. Copy them into your collection by dragging them over to the right-hand side of the panel.

  8. Build the collection and preview it to see the basic defaults exhibited by a DSpace collection.

If you browse by titles a-z, you will find 7 documents listed, though only 5 items were exported from DSpace. Two of the original items had alternative forms in their directory folder. DSpace plug-in options control what happens in such situations: the default is to treat them as separate Greenstone documents.

Below we use a plug-in option (first_inorder_ext) to fuse the alternative forms together. This option has the effect of treating documents with the same filename but different extensions as a single entity within a collection. One of the files is viewed as the primary document—it is indexed, and metadata is extracted from it if possible—while the others are handled as 'associated files.'

The first_inorder_ext option takes as its argument a list of file extensions (separated by commas): the first one in the list that matches becomes the primary document.

  1. Select DSpacePlug and click <Configure Plugin...>. Switch on its configuration option first_inorder_ext. Set its value to pdf,doc,rtf in the popup window that appears and press <OK>.

  2. Build and preview the collection.

There are now only 5 documents, because only one version of each document has been included—the primary version.

The DSpace exported files contain Dublin Core metadata for title and author (amongst other things).

Adding indexing and browsing capabilities to match DSpace's

  1. In the Design panel, select Search Indexes. Delete the ex.Title and ex.Source indexes, and add one for dc.Title called 'titles' and another for dc.Contributor called "authors".

  2. Staying within the Design panel, select Browsing Classifiers and delete both AZList classifiers (ex.Title and ex.Source). Add an AZList classifier for dc.Title and an AZCompactList classifier for dc.Contributor.

  3. Now select the Format Features section of the Design panel and replace the VList format statement with this:

    <td valign=top>[link][icon][/link]</td>
    <td valign=top>
      [srclink][srcicon][/srclink]
    </td>
    <td valign=top>
      [highlight]{Or}{[dc.Title],[ex.Title],Untitled}[/highlight]
      {If}{[ex.Source],<br><i>([ex.Source])</i>}
      {If}{[equivlink],<br>Also available as:[equivlink]}
    </td>

    You will find this text in the file format_tweak.txt in the dspace folder of sample_files, and you can copy and paste this. Remember to press <Replace Format> when finished.

  4. Also, let's add a format statement for the classifier based on dc.Contributor metadata. In the Choose Feature menu (under Format Features on the Design panel), select the item that says:

    CL2: AZCompactList -metadata dc.Contributor

  5. Leave VList as the Affected Component and edit the text in the HTML Format String box so that it looks like:

    <td valign=top>[link][icon][/link]</td>
    <td valign=top>
      [srclink][srcicon][/srclink]
    </td>
    <td valign=top>
      [highlight]
        {If}{[numleafdocs],([numleafdocs]) [ex.Title],[dc.Title]}
      [/highlight]
      {If}{[ex.Source],<br><i>([ex.Source])</i>}
      {If}{[equivlink],<br>Also available as:[equivlink]}
    </td>

    and click <Add Format>. This will display the number of documents for each bookshelf in the authors classifier.

    You will find this text in the file format_contributor.txt in the dspace folder of sample_files, and you can copy and paste this.

  6. Build collection once again and preview it.

There are still only 5 documents, but against some of the entries appears the line "Also available as," followed by icons that link to the alternative representations.

How to find information in the DSpace Example collection

There are 3 ways to find information in this collection:

  • search for particular words that appear in the text by clicking the Search button
  • browse documents by Title by clicking the Titles button
  • browse documents by Contributor by clicking the Contributors button