RE: [greenstone-users] import pl failure

From Elizabeth Cass
DateMon, 7 Aug 2006 14:50:45 +1200
Subject RE: [greenstone-users] import pl failure
In-Reply-To (44D69FEB-9040806-dlconsulting-co-nz)

Richard I removed fail log option and still not successful.  Specified file and still not working?




From: Richard Managh []
Sent: Monday, August 07, 2006 2:06 PM
To: Elizabeth Cass
Cc: 'greenstone-users'
Subject: Re: [greenstone-users] import pl failure


Hi Libby,

The command has the -faillog option, but no fail log file specified. This could cause the command to be invalid. Try removing the -faillog option or specifiying a file for the faillog to be written to.

Hope this helps,


DL Consulting
Greenstone Digital Library and Digitisation Specialists

Elizabeth Cass wrote:

I have experience my first Greenstone error when rebuilding the collection and am at a loss at to what

The error message is below. The collection was last rebuilt successfully 21 July.

It states that the collection is invalid but I can see it?

Would appreciate any assistance.


Libby Cass





Command: E:Greenstone2.70wbinwindowsperlbinPerl.exe -S -gli -language en -collectdir E:Greenstone2.70wcollect -OIDtype hash -faillog paddle>  Usage: [options] collection-name>  options:>   -archivedir <string>    Where the converted material ends up.>   -collectdir <string>    Collection directory.>                           Default:>   -debug                  Print imported text to STDOUT.>   -faillog <string>       Fail log filename. This log receives the filenames of>                           any files which fail to be processed.>                           Default:>   -groupsize <int>        Number of import documents to group into one XML file.>                           Default: 1>   -gzip                   Use gzip to compress resulting xml documents (don't>                           forget to include ZIPPlug in your plugin list when>                           building from compressed documents).>   -importdir <string>     Where the original material lives.>   -keepold                Will not destroy the current contents of the archives>                           directory.>   -removeold              Will remove the old contents of the archives>                           directory.>   -language <string>      Language to display option descriptions in (eg.>                           'en_US' specifies American English). Requires>                           translations of the option descriptions to exist in>                           the perllib/strings_language-code.rb file.>   -maxdocs <int>          Maximum number of documents to import.>   -OIDtype <enum>         The method to use when generating unique identifiers>                           for each document.>                           hash: Hashes the contents of the file. Document>                             identifier will be the same every time the>                             collection is imported.>                           incremental: A simple document count that is>                             significantly faster than "hash". It is not>                             guaranteed to always assign the same identifier to a>                             given document though and does not allow further>                             documents to be added to existing xml archives.>                           assigned: Uses 'D' plus the value of dc.Identifier as>                             the document identifier. dc.Identifiers should be>                             unique. If no dc.Identifier is assigned to the>                             document, a hash id will be used instead.>                           dirname: Uses 'J' plus the parent directory name as the>                             identifier. This relies on there being only one>                             document per directory, and all directory names>                             being unique. E.g. import/b13as/h15ef/page.html will>                             get an identifier of Jh15ef.>   -out <string>           Filename or handle to print output status to.>                           Default: STDERR>   -saveas <enum>          This is to decide the archives format to be generated.>                           The default setting is to GA.>                           Default: GA>                           GA: Will generate Greenstone Archive format.>                           METS: Will generate METS format.>   -sortmeta <string>      Sort documents alphabetically by metadata for>                           building. Search results for boolean queries will be>                           displayed in this order. This will be disabled if>                           groupsize > 1.>   -removeprefix <regexp>  A prefix to ignore in metadata values when sorting.>                           Default:>   -removesuffix <regexp>  A suffix to ignore in metadata values when sorting.>                           Default:>   -statsfile <string>     Filename or handle to print import statistics to.>                           Default: STDERR>   -verbosity <int>        Controls the quantity of output. 0=none, 3=lots.>   -gli          >   -xml                    Produces the information in an XML form, without>                           'pretty' comments but with much more detail.> Invalid collection (E:Greenstone2.70wcollect).> Command failed.


Libby Cass
Information Specialist
Pacific Regional Initiatives for the Delivery of Basic Education (PRIDE)
The University of the South Pacific, Suva, Fiji
Telephone: (679) 323 2787
Fax: (679) 3231532