[greenstone-users] Re: Contents of greenstone-users digest Vol 49, Issue 28 and 31

From biblioteca@iedlapaz.edu.co
DateMon, 30 Apr 2007 00:08:54 -0300 (ART)
Subject [greenstone-users] Re: Contents of greenstone-users digest Vol 49, Issue 28 and 31
Hi friendly list, thanks for your time to explain to me your ideas. I
reply to you now (excuse me if i forget somebody)

1) reply to rajankila@hotmail.com
2) reply to suri@nic.in
3) reply to xiao@cs.waikato.ac.nz


1) in reply to: rajankila@hotmail.com

> ------------------------------
> Message: 5
> Date: Thu, 26 Apr 2007 17:36:24 +0530
> From: "Rajan" <rajankila@hotmail.com>
> Subject: Subject: [greenstone-users] CDS/ISIS into Greenstone
> To: <greenstone-users@list.scms.waikato.ac.nz>
> Message-ID: <BAY128-DAV74F40BC56278C6A401681B6480@phx.gbl>
> Content-Type: text/plain; charset="iso-8859-1"
> Dear Friend,
> Please export your CDS/ISIS data into .iso format.
> Again import it again into .mst format and then convert it into into
> Then you may be able to get the entire records in Greenstone. Please try
> Regards,
> K Rajasekharan
> Librarian , Kerala Institute of Local Administration
> Mulagunnathukavu, Thrissur -680581
> Ph 0487- 2204097 (O), 2201428 ( R)

I export de ISIS database from winisis to ISO and i import again, and use
these result to import into greenstone but the process now stop here:

GAPlug: processing HASH0140/972496a0/4f9cdda3/f162s333/87.dir/doc.xml
Stats (Creating index text;AUTORPERSONAL^all;EDICION^all;TITULO^all;)
Total bytes in collection: 1644962
Total bytes in text;AUTORPERSONAL^all;EDICION^all;TITULO^all;: 374237
mgpp_perf_hash_build : Unable to generate the perfect hash function. This
is probably because there aren't enough words

inverting the text (mgpp_passes -I2)
mgpp_passes : Unable to read in hash data for word dictionary
ArcPlug: processing /usr/local/gsdl/collect/bjfr/archives/archives.inf
GAPlug: processing HASH0140.dir/doc.xml
GAPlug: processing HASH0140/972496a0.dir/doc.xml
GAPlug: processing HASH0140/972496a0/4f9cdda3.dir/doc.xml
GAPlug: processing HASH0140/972496a0/4f9cdda3/f162s4.dir/doc.xml
GAPlug: processing HASH0140/972496a0/4f9cdda3/f162s5.dir/doc.xml
GAPlug: processing HASH0140/972496a0/4f9cdda3/f162s6.dir/doc.xml

what its mean?

if it is useful to catch the problem, here paste the result of my first
import.pl xxx -debug (before try with .iso):

tag=610 data=^aCo863.44^bV234
tag=860 data=^aA
tag=873 data=C1995
Database exception: xrf file, invalid statusXRFFile.cpp Line=259Database
exception: Error opening mf fileIsisDb.cpp Line=175</Metadata>
<Metadata name="FileFormat">CDS/ISIS</Metadata>
<Metadata name="Identifier">HASH7f0081d85d240b24216abbs172</Metadata>
<Content>&lt;table cellpadding=&quot;4&quot;
valign=top&gt;Visheda, Alvaro Valencia
PERSONAL&lt;/b&gt;&lt;/nobr&gt;&lt;/td&gt;&lt;td valign=top&gt;Valencia
Tovar, Alvaro, 1923-&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td
valign=top&gt;&lt;nobr&gt;&lt;b&gt;CIUDAD Y
EDITORIAL&lt;/b&gt;&lt;/nobr&gt;&lt;/td&gt;&lt;td valign=top&gt;Santafe de
Bogota, Planeta&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td
FISICA&lt;/b&gt;&lt;/nobr&gt;&lt;/td&gt;&lt;td valign=top&gt;491 p,
valign=top&gt;Co863.44, V234&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td

Import complete
* 172 documents were considered for processing
* 172 were processed and included in the collection

****************************************************************** 2) in
reply to suri@nic.in

> Message: 1
> Date: Wed, 25 Apr 2007 17:08:43 +0530 (IST)
> From: suri@nic.in
> Subject: [greenstone-users] Re CDS/ISIS Record Display
> To: greenstone-users@list.scms.waikato.ac.nz
> Message-ID: <34022.>
Content-Type: text/plain;charset=iso-8859-1
> While I was converting cds/isis data to GSDL, The same problem occured
> me. I realized cds/isis mst file may have contain some special character
which GSDL may not able to parse.
> I exported the whole database into iso2709 format. I checked the data
> found that in number of records, field separator and record separator
> something else. I replace it with # for field separator and ## for
> separator. You may please check this way. It may help.
> regards
> Surinder Kumar
> National Informatics Centre
> New Delhi
> suri@nic.in

Excuse me, but i see the iso file and do not see where/what are the
problem, is cryptic to me, i see numbers, ^Aa, spaces... but no have idea
to make something with that. (i see the iso but i do not know what find!!)

3) in reply to

> ------------------------------
> Message: 3
> Date: Fri, 27 Apr 2007 21:53:33 +1200
> From: xiao <xiao@cs.waikato.ac.nz>
> Subject: Re: Subject: [greenstone-users] CDS/ISIS into Greenstone To:
Rajan <rajankila@hotmail.com>
> Cc: greenstone-users@list.scms.waikato.ac.nz
> Message-ID:
> <c0a9e0f40704270253m559b9ad6jb8b4b4f4843edf0@mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
> Hi,
> You may want to take a look at the output message (GLI or command line)
> see why the rest of the records are rejected. If it's due to they are
> valid html, here is an article which gives step by step instructions on
publishing CDS/ISIS database records into GSDL:
> http://dlist.sir.arizona.edu/285/01/pisisc.pdf.
> Regards

i see the document and give me some ideas like try to convert in html, but
i think that need more time to eat this document.


thanks for your time. any idea about how can i solve it?

can i send to somebody the database to replicate the process?, i think
that i can not detect what kind of information is usefull to resolve the

(again...: sorry for my english)