Re: [greenstone-users] GS 2.74 : OAIplug + ImagePlug

From GB Polydoc
DateFri, 10 Aug 2007 17:45:29 +0200
Subject Re: [greenstone-users] GS 2.74 : OAIplug + ImagePlug
In-Reply-To (c0a9e0f40708091331y1a94cb72n9ef80831db684e0-mail-gmail-com)
Hi Xiao,

the complete message log is attached in this mail, and now the link to the two documents : Link

Regards,
Georges Braoudakis

xiao a écrit :
Hi Georges Braoudakis,

Could you please send me the complete message log for the import and build process in the GLI instead of just excerpt? My guess (before seeing the message) is that one image was imported directly and the same file was also brought in through the source url in the oai record. I don't think there is anything wrong with the combination of OAIplug + ImagePlug.

Cheers,
xiao

On 8/9/07, GB Polydoc <gbraouda@polydoc.net> wrote:
Hi xiao,

Yes, i've two HASHxxxx.dir directories in either the /collect/test1210/index/assoc and the /collect/test1210/archives directory. The archive.inf file contains also the two directorys.

For only one document this is not correct, OAIplug + ImagePlug are not working very well....!
Any suggestion ?

Regards,
Georges Braoudakis

xiao a écrit :
 

Hi Georges Braoudakis,

Go to your /collect/test1210/index/assoc directory, you must have two HASHxxxx.dir directories with each containing the same image file B010536201_P01_ILLA0172_P.jpg. That is the problem!

Cheers
xiao

--
Greenstone Digital Library
New Zealand




--
Greenstone Digital Library
New Zealand


<<attachment>>
Type: text/plain
Filename: build_log.1186760415095.txt

s
Commande: C:Program FilesGreenstonebinwindowsperlbinPerl.exe -S C:Program FilesGreenstonebinscriptimport.pl -gli -language fr -collectdir C:Program FilesGreenstonecollect -removeold test1212
import.pl> Suppression des contenus du répertoire d'archives…
import.pl> RecPlug: getting directory C:Program FilesGreenstonecollect est1212import
import.pl> ImagePlug processing B010536201_P01_ILLA0172_P.jpg
import.pl> OAIPlug: passing metadata on to B010536201_P01_ILLA0172_P.jpg
import.pl> ImagePlug processing B010536201_P01_ILLA0172_P.jpg
import.pl> *********************************************
import.pl> Fin de l'importation.
import.pl> *********************************************
import.pl> * 2 documents ont été pris en compte pour traitement
import.pl> * 2 ont été traités et intégrés dans la collection
import.pl> Commande exécutée.
import.pl> Extraction de nouvelles méta-données à partir des fichiers archives.
import.pl> Extraction de méta-données archivées terminée.
Commande: C:Program FilesGreenstonebinwindowsperlbinPerl.exe -S C:Program FilesGreenstonebinscriptbuildcol.pl -gli -language fr -collectdir C:Program FilesGreenstonecollect -removeold test1212
buildcol.pl> *** creating the compressed text
buildcol.pl> collecting text statistics (mgpp_passes -T1)
buildcol.pl> ArcPlug: traitement C:Program FilesGreenstonecollect est1212archivesarchives.inf
buildcol.pl> GAPlug: processing HASH7aab.dirdoc.xml
buildcol.pl> GAPlug: processing HASH019c.dirdoc.xml
buildcol.pl> Stats (Compressing text from text)
buildcol.pl> Total bytes in collection: 58
buildcol.pl> Total bytes in text: 58
buildcol.pl> creating the compression dictionary
buildcol.pl> mgpp_compression_dict.exe : Dictionary limit of 5120.00 Kb
buildcol.pl> mgpp_compression_dict.exe : Num words : 10 -> 11
buildcol.pl> mgpp_compression_dict.exe : Num non-words : 9 -> 10
buildcol.pl> mgpp_compression_dict.exe : Chars of words : 28 -> 28
buildcol.pl> mgpp_compression_dict.exe : Chars of non-words : 12 -> 12
buildcol.pl> mgpp_compression_dict.exe : Mem usage : 116 -> 124
buildcol.pl> mgpp_compression_dict.exe : Actual mem required : 99
buildcol.pl> compressing the text (mgpp_passes -T2)
buildcol.pl> ArcPlug: traitement C:Program FilesGreenstonecollect est1212archivesarchives.inf
buildcol.pl> GAPlug: processing HASH7aab.dirdoc.xml
buildcol.pl> GAPlug: processing HASH019c.dirdoc.xml
buildcol.pl> Stats (Compressing text from text)
buildcol.pl> Total bytes in collection: 58
buildcol.pl> Total bytes in text: 58
buildcol.pl> *** building index Title; in subdirectory idx
buildcol.pl> creating index dictionary (mgpp_passes -I1)
buildcol.pl> ArcPlug: traitement C:Program FilesGreenstonecollect est1212archivesarchives.inf
buildcol.pl> GAPlug: processing HASH7aab.dirdoc.xml
buildcol.pl> GAPlug: processing HASH019c.dirdoc.xml
buildcol.pl> ivf.pass1 : Inverted buffer size: 5242880 bytes
buildcol.pl> ivf.pass1 : Max memory needed for 1 chunk: 19 bytes
buildcol.pl> ivf.pass1 : Number of chunks written: 1
buildcol.pl> ivf.pass1 : Number of documents: 2
buildcol.pl> ivf.pass1 : Number of fragments: 17
buildcol.pl> ivf.pass1 : Number of words: 17
buildcol.pl> ivf.pass1 : Size of word dictionary: 16
buildcol.pl> ivf.pass1 : Size of tag dictionary: 3
buildcol.pl> Stats (Creating index Title;)
buildcol.pl> Total bytes in collection: 58
buildcol.pl> Total bytes in Title;: 103
buildcol.pl> inverting the text (mgpp_passes -I2)
buildcol.pl> ArcPlug: traitement C:Program FilesGreenstonecollect est1212archivesarchives.inf
buildcol.pl> GAPlug: processing HASH7aab.dirdoc.xml
buildcol.pl> GAPlug: processing HASH019c.dirdoc.xml
buildcol.pl> Stats (Creating index Title;)
buildcol.pl> Total bytes in collection: 58
buildcol.pl> Total bytes in Title;: 103
buildcol.pl> create the weights file
buildcol.pl> .L = 1.697857
buildcol.pl> U = 2.499178
buildcol.pl> B = 1.001511
buildcol.pl> .L = 1.697857
buildcol.pl> U = 2.499178
buildcol.pl> B = 1.001511
buildcol.pl> creating 'on-disk' stemmed dictionary
buildcol.pl> mgpp_invf_dict.exe : Max word block size = 0
buildcol.pl> mgpp_invf_dict.exe : Max tag block size = 0
buildcol.pl> mgpp_invf_dict.exe : Number of word blocks written = 1
buildcol.pl> mgpp_invf_dict.exe : Number of tag blocks written = 1
buildcol.pl> creating stem indexes
buildcol.pl> mgpp_stem_idx.exe : Num word stems = 16
buildcol.pl> mgpp_stem_idx.exe : Max stem block size = 0
buildcol.pl> mgpp_stem_idx.exe : Number of stem blocks written = 1
buildcol.pl> mgpp_stem_idx.exe : Num word stems = 16
buildcol.pl> mgpp_stem_idx.exe : Max stem block size = 0
buildcol.pl> mgpp_stem_idx.exe : Number of stem blocks written = 1
buildcol.pl> mgpp_stem_idx.exe : Num word stems = 16
buildcol.pl> mgpp_stem_idx.exe : Max stem block size = 0
buildcol.pl> mgpp_stem_idx.exe : Number of stem blocks written = 1
buildcol.pl> BuildDir: C:/Program Files/Greenstone/collect/test1212/building
buildcol.pl> *** creating the info database and processing associated files
buildcol.pl> ArcPlug: traitement C:Program FilesGreenstonecollect est1212archivesarchives.inf
buildcol.pl> GAPlug: processing HASH7aab.dirdoc.xml
buildcol.pl> GAPlug: processing HASH019c.dirdoc.xml
buildcol.pl> *** outputting information for classifier: CL1
buildcol.pl> *** outputting information for classifier: oai
buildcol.pl> *** creating auxiliary files
buildcol.pl> Commande exécutée.