[greenstone-users] CDS/ISIS collection building

From John Rose
DateSat Mar 7 21:35:53 2009
Subject [greenstone-users] CDS/ISIS collection building
In-Reply-To (20090307005535-23E9C3F02D1-mail-alumni-caltech-edu)
Dear Renate,

The problem with the explode option for CDS/ISIS in version 2.81 was
discussed previously on the list, and a fix was given on 4 February:

********
1. Please go into your Greenstone installation folder's
gli/src/org/greenstone/gatherer/util
And make a copy of the file "RemoveContentBeforeRootElementXMLReader.java"
Rename this copy as RemoveContentBeforeRootElementXMLReader.java_orig

2. Download the updated file from:
http://svn.greenstone.org/gli/trunk/src/org/greenstone/gatherer/util/RemoveContentBeforeRootElementXMLReader.java

3. Put the downloaded file in gli/src/org/greenstone/gatherer/util to
replace the original.

4. Open a Windows DOS console or Linux xterm, go into your Greenstone
installation folder. Then change into your gli directory:
> cd gli

5. Now you're inside the GLI folder, compile up GLI by typing the
following in a Windows DOS prompt:
> makegli.bat
Or by typing the following in a Linux x-term:
> ./makegli.sh

6. Try to run GLI and explode that CDS/ISIS database again.
********
Please do try this fix and report if there are still problems. I'm
asking the Waikato team to confirm that this fix has been
incorporated in the nightly release snapshots, and also to inform
whether the problem is restricted to exploding with CDS/ISIS or might
occur when exploding other database formats as well.

Concerning the problem of errors when rebuilding an "as is" CDS/ISIS
collection (bibliographic data only without exploding), you first
said that it did not work when the comma default for the
subfield_separator parameter is set in the ISISPlugin but now you are
also implying that there is a problem with the default of <br> for
the entry_separator. These parameters are only used to format the
data in the built collection for screen output, to separate subfields
and occurrences of repeatable fields respectively. When I try to
build the CDS/ISIS example collection with 2.81, I get the error "The
bad argument occurred in plugin ISISPlugin." which goes away when I
unselect the subfield_separator parameter (not when I change its
value). This seems to be a bug, asking Waikato colleagues to check.
The collection builds with the default entry_separator, could you
please explain if you meant to say that there is also a problem with
this parameter?

Best regards, John


>From: Renate Morgenstern <rmorgenstern@unam.na>
>To: greenstone-users@list.scms.waikato.ac.nz
>Date: Fri, 06 Mar 2009 17:19:32 +0100
>Subject: [greenstone-users] CDS/ISIS collection building
>
>
>Hello,
>
>I was just updating an old collection configured and built with a
>previous version of Greenstone. I am NOT exploding my database. It
>is in the ISIS plugin where formerly I had defined the entry
>separator as <br>, which now gives the error and is does not build
>the collection. (an error has occurred which will prevent the
>collection being previewed)
>
>I also noticed that in the latest Greenstone version, the explode
>database option for an ISIS database is not working anymore. How can
>we now update existing databases?
>Regards
>Renate
>
>
>>Dear Renate,
>>
>>Follow the steps mentioned below.
>>
>>(1) drag .mst, .fdt and .xrf files into greenstone collection
>>(2) right click on .mst file and choose "Explode Metadata Database" then
>>the "Explode Parameters" window will be appeared
>>(3) Select Input_encoding as "dos_850 Codepage 850 (Latin 1)"
>>(4) Provide the field name for example FULLTEXT (which you have used in
>>WINISIS for pdf, word, mepg, jpg etc files) against "Document_Field
>>(5) Then explode the collection and build the collection
>>
>>Pl. see the following link for more info.
>>
>>http://wiki.greenstone.org/wiki/images/e/e9/WINISIS_to_GSDL_Guide.pdf
>>
>>Kind regards,
>>
>>Lavji Zala
>>MICA-KEIC
>>
>>
>> > Good day,
>> >
>> > I wanted to update an existing and working Greenstone collection with
>> > new data added in Winisis.
>> > I copied the .mst and .xrf file, but when rebuilding one gets an
>> > error message, and it just stops. In the ISIS plugin I then deleted
>> > the comma as the subfield delimiter, and then it works. Are the
>> > subfields separated by a comma by default?
>> > Thanks in advance.
>> > Renate
>> >
>> >
>> > Internal Virus Database is out of date.
>> > Checked by AVG - http://www.avg.com
>> > Version: 8.0.176 / Virus Database: 270.10.7/1892 - Release Date:
>> > 2009-01-13 20:04
>> >
>> > Mrs Renate Morgenstern
>> > University of Namibia Library
>> > Windhoek, Namibia
>> > Tel.: +264 61 2063869
>> > rmorgenstern@unam.na
>> >
>> > _______________________________________________
>> > greenstone-users mailing list
>> > greenstone-users@list.scms.waikato.ac.nz
>> > https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>> >
>
>Mrs Renate Morgenstern
>University of Namibia Library
>Windhoek, Namibia
>Tel.: +264 61 2063869
>rmorgenstern@unam.na