Re: [greenstone-users] Problems building collections with server onWindows

From Michael Dewsnip
DateFri, 09 Jun 2006 17:47:41 +1200
Subject Re: [greenstone-users] Problems building collections with server onWindows
In-Reply-To (4486B0FD-7080902-caboose-org-uk)
Hi Kevin,

It looks like the Greenstone archive file
(archivesHASH015b.dirdoc.xml) that was generated from the import
document is bogus -- probably an encoding issue. This usually means
there is a bug in the plugin -- what version of Greenstone are you using?

Send me the import document off list and I'll try it here, if you like.

Regards,

Michael

Kevin O'Rourke wrote:

>I recently installed a local server on a Windows machine so that our
>library staff could work on collections without publishing them
>immediately to the internet.
>
>The install was very easy and I copied the collections across from our
>Linux server without any problems.
>
>However we can't build any collections on the Windows systems. I've
>included the GLI build log below, it doesn't make much sense to me.
>
>Does anyone have any ideas what might be going wrong? Is there any
>other information I need to provide?
>
>Kevin
>
>-----------------------------------------------------------------
>
>GLI build log:
>u
>Command: C:Program FilesGreenstonebinwindowsperlbinPerl.exe -S
>C:Program FilesGreenstonebinscriptimport.pl -gli -language en
>-collectdir C:Program FilesGreenstonecollect -removeold -verbosity
>3 testcoll
>import.pl> Removing current contents of the archives directory...
>import.pl> Removing contents of the collection "tmp" directory...
>import.pl> RecPlug: getting directory C:Program
>FilesGreenstonecollect estcollimport
>import.pl> RecPlug metadata recurring: curriculum
>import.pl> RecPlug: preparing metadata for curriculum
>import.pl> RecPlug recurring: curriculum
>import.pl> RecPlug: getting directory C:Program
>FilesGreenstonecollect estcollimportcurriculum
>import.pl> RecPlug: found metadata in C:Program
>FilesGreenstonecollect estcollimportcurriculummetadata.xml
>import.pl> RecPlug metadata recurring: curriculum deverlopment.doc
>import.pl> RecPlug metadata recurring: metadata.xml
>import.pl> RecPlug: preparing metadata for curriculum deverlopment.doc
>import.pl> File "curriculum deverlopment.doc" matches filespec
>"curriculum deverlopment.doc"
>import.pl> RecPlug recurring: curriculum deverlopment.doc
>import.pl> Converting curriculumdeverlopment.doc to StructuredHTML format
>import.pl> BasPlug: WARNING: language could not be extracted from
>C:Program FilesGreenstonecollect estcoll mpcurriculumdeverlopment.html
>- defaulting to en
>import.pl> BasPlug: reading C:Program
>FilesGreenstonecollect estcoll mpcurriculumdeverlopment.html as
>(utf8,en)
>import.pl> StructuredHTMLPlug: processing C:Program
>FilesGreenstonecollect estcoll mpcurriculumdeverlopment.html
>import.pl> Passing on the HTMLPlug
>import.pl> HTMLPlug: processing C:Program
>FilesGreenstonecollect estcoll mpcurriculumdeverlopment.html
>import.pl> Adding associated Word document
>import.pl> *********************************************
>import.pl> Import complete
>import.pl> *********************************************
>import.pl> * 1 document was considered for processing
>import.pl> * 1 was processed and included in the collection
>import.pl> Command complete.
>import.pl> Extracting new metadata from archive files.
>import.pl> Archived metadata extraction complete.
>Command: C:Program FilesGreenstonebinwindowsperlbinPerl.exe -S
>C:Program FilesGreenstonebinscriptbuildcol.pl -gli -language en
>-collectdir C:Program FilesGreenstonecollect -removeold -verbosity
>3 testcoll
>buildcol.pl> *** creating the compressed text
>buildcol.pl> collecting text statistics
>buildcol.pl> ArcPlug: processing C:Program
>FilesGreenstonecollect estcollarchivesarchives.inf
>buildcol.pl> GAPlug: processing HASH015b.dirdoc.xml
>buildcol.pl> **** Error is:
>buildcol.pl> not well-formed at line 575, column 45, byte 21149 at
>C:/Program Files/Greenstone/bin/windows/perl/lib/XML/Parser.pm line
>168
>buildcol.pl> WARNING: No plugin could process HASH015b.dirdoc.xml
>buildcol.pl> Stats (Compressing text from section:text)
>buildcol.pl> Total bytes in collection: 0
>buildcol.pl> Total bytes in section:text: 0
>buildcol.pl> ***************
>buildcol.pl> WARNING: There is very little or no text to compress
>buildcol.pl> Was this your intention?
>buildcol.pl> ***************
>buildcol.pl> creating the compression dictionary
>buildcol.pl> compressing the text
>buildcol.pl> ArcPlug: processing C:Program
>FilesGreenstonecollect estcollarchivesarchives.inf
>buildcol.pl> GAPlug: processing HASH015b.dirdoc.xml
>buildcol.pl> **** Error is:
>buildcol.pl> not well-formed at line 575, column 45, byte 21149 at
>C:/Program Files/Greenstone/bin/windows/perl/lib/XML/Parser.pm line
>168
>buildcol.pl> WARNING: No plugin could process HASH015b.dirdoc.xml
>buildcol.pl> Stats (Compressing text from section:text)
>buildcol.pl> Total bytes in collection: 0
>buildcol.pl> Total bytes in section:text: 0
>buildcol.pl> ***************
>buildcol.pl> WARNING: There is very little or no text to compress
>buildcol.pl> Was this your intention?
>buildcol.pl> ***************
>buildcol.pl> *** building index document:text in subdirectory dte
>buildcol.pl> creating index dictionary
>buildcol.pl> ArcPlug: processing C:Program
>FilesGreenstonecollect estcollarchivesarchives.inf
>buildcol.pl> GAPlug: processing HASH015b.dirdoc.xml
>buildcol.pl> **** Error is:
>buildcol.pl> not well-formed at line 575, column 45, byte 21149 at
>C:/Program Files/Greenstone/bin/windows/perl/lib/XML/Parser.pm line
>168
>buildcol.pl> WARNING: No plugin could process HASH015b.dirdoc.xml
>buildcol.pl> ivf.pass1 : Error during done of "ivf.pass1"
>buildcol.pl> Stats (Creating index document:text)
>buildcol.pl> Total bytes in collection: 0
>buildcol.pl> Total bytes in document:text: 0
>buildcol.pl> ***************
>buildcol.pl> WARNING: There is very little or no text to process for
>document:text
>buildcol.pl> Was this your intention?
>buildcol.pl> ***************
>buildcol.pl> mgbuilder::build_index - Couldn't create index document:text
>buildcol.pl> *** building index document:Title in subdirectory dti
>buildcol.pl> creating index dictionary
>buildcol.pl> ArcPlug: processing C:Program
>FilesGreenstonecollect estcollarchivesarchives.inf
>buildcol.pl> GAPlug: processing HASH015b.dirdoc.xml
>buildcol.pl> **** Error is:
>buildcol.pl> not well-formed at line 575, column 45, byte 21149 at
>C:/Program Files/Greenstone/bin/windows/perl/lib/XML/Parser.pm line
>168
>buildcol.pl> WARNING: No plugin could process HASH015b.dirdoc.xml
>buildcol.pl> ivf.pass1 : Error during done of "ivf.pass1"
>buildcol.pl> Stats (Creating index document:Title)
>buildcol.pl> Total bytes in collection: 0
>buildcol.pl> Total bytes in document:Title: 0
>buildcol.pl> ***************
>buildcol.pl> WARNING: There is very little or no text to process for
>document:Title
>buildcol.pl> Was this your intention?
>buildcol.pl> ***************
>buildcol.pl> mgbuilder::build_index - Couldn't create index document:Title
>buildcol.pl> *** building index document:Source in subdirectory dso
>buildcol.pl> creating index dictionary
>buildcol.pl> ArcPlug: processing C:Program
>FilesGreenstonecollect estcollarchivesarchives.inf
>buildcol.pl> GAPlug: processing HASH015b.dirdoc.xml
>buildcol.pl> **** Error is:
>buildcol.pl> not well-formed at line 575, column 45, byte 21149 at
>C:/Program Files/Greenstone/bin/windows/perl/lib/XML/Parser.pm line
>168
>buildcol.pl> WARNING: No plugin could process HASH015b.dirdoc.xml
>buildcol.pl> ivf.pass1 : Error during done of "ivf.pass1"
>buildcol.pl> Stats (Creating index document:Source)
>buildcol.pl> Total bytes in collection: 0
>buildcol.pl> Total bytes in document:Source: 0
>buildcol.pl> ***************
>buildcol.pl> WARNING: There is very little or no text to process for
>document:Source
>buildcol.pl> Was this your intention?
>buildcol.pl> ***************
>buildcol.pl> mgbuilder::build_index - Couldn't create index document:Source
>buildcol.pl> *** creating the info database and processing associated files
>buildcol.pl> mgbuilder: warning bad collectionmeta option 'Source' - ignored
>buildcol.pl> mgbuilder: warning bad collectionmeta option 'text' - ignored
>buildcol.pl> mgbuilder: warning bad collectionmeta option 'document' -
>ignored
>buildcol.pl> mgbuilder: warning bad collectionmeta option 'Title' - ignored
>buildcol.pl> ArcPlug: processing C:Program
>FilesGreenstonecollect estcollarchivesarchives.inf
>buildcol.pl> GAPlug: processing HASH015b.dirdoc.xml
>buildcol.pl> **** Error is:
>buildcol.pl> not well-formed at line 575, column 45, byte 21149 at
>C:/Program Files/Greenstone/bin/windows/perl/lib/XML/Parser.pm line
>168
>buildcol.pl> WARNING: No plugin could process HASH015b.dirdoc.xml
>buildcol.pl> *** creating auxiliary files
>buildcol.pl> arcinfo::save_info couldn't write C:Program
>FilesGreenstonecollect estcollarchivesHASH015b.dirdoc.xmlarchives.inf
>buildcol.pl> Command failed.
>
>
>
>_______________________________________________
>greenstone-users mailing list
>greenstone-users@list.scms.waikato.ac.nz
>https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>
>
>
>