[greenstone-users] Incomplete import of Winisis data to Greenstone - solved

From Ramon Sampang
DateFri May 22 12:47:53 2009
Subject [greenstone-users] Incomplete import of Winisis data to Greenstone - solved
Thanks !

After exporting data in ISO file in manageable number and reimporting, we
were able to isolate the records with errors. We are still looking at the
data to fix the errors, but I am happy to say that we were able to import
4405 records out of 4700 into Greenstone.

The strange characters on some of the records were causing the errors.

On Mon, May 18, 2009 at 10:36 AM, Katherine Don <kjdon@cs.waikato.ac.nz>wrote:

> Hi
>
> Are the documents you get valid records? Or are they just chunks of
> multiple records?
> Do you need to set the split_exp option to ISISPlugin?
>
> -split_exp <regexp> A perl regular expression to split
> input
> files into segments.
> Default: r? ----------r?
>
> Maybe its not detecting the record boundaries properly?
>
> Have you tried setting GLI to expert mode to see if there are any errors
> there?
>
> Regards,
> Katherine
>
> Ramon Sampang wrote:
>
> We have been trying to import our Winisis data to Greenstone 2.81 running
> on Linux but wea are having problems importing all the data. We have around
> 4,000+ records accessible in Winisis. But after creating a collection in
> Greenstone we only end up with 29 documents.
>
> We are already using the patch GLI.jar for 2.81, and we have already tried
> exploding the metadata, but still the same number of documents. There is no
> explicit error during the process.
>
> Could anyone help us figure out how to solve this problem?
>
> Thanks!
>
>
> ------------------------------
>
> _______________________________________________
> greenstone-users mailing listgreenstone-users@list.scms.waikato.ac.nzhttps://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20090522/93fdd42d/attachment.html