[greenstone-users] Paged image collection problem

From Renate Morgenstern
DateFri Jan 28 21:00:39 2011
Subject [greenstone-users] Paged image collection problem
I have a collection of paged images with associated text for every page.
The collections show the pages and the text and it seems ok, however, I
get the following error message:

buildcol.pl> GreenstoneXMLPlugin: processing HASH01e3.dir/doc.xml
buildcol.pl> no first character found for "" - ""
buildcol.pl> AZCompactList: WARNING CLASSIFY.a has badly formatted title
()
buildcol.pl> *** outputting information for classifier: CL1
buildcol.pl> *** outputting information for classifier: CL2
buildcol.pl> *** outputting information for classifier: oai
buildcol.pl> *** creating auxiliary filesâ–ˇ
buildcol.pl> Command complete.


What does this mean?
Below is the collect.cfg.file.


Regards
Renate
===============================

buildtype mgpp


#indexes section:text
indexes
text,dc.Coverage,dc.Creator,dc.Date,dc.Description,dc.Subject,dc.Title,ex.Date,ex.FileFormat,ex.Image,ex.Number,ex.Plugin,ex.ScreenType,ex.Series,ex.Title,ex.Volume
defaultindex
text,dc.Description,dc.Title,Date,Encoding,FileFormat,FileSize,Identifier,Image,ImageHeight,ImageSize,ImageType,ImageWidth,Language,MaxImageHeight,MaxImageWidth,NoText,NumPages,Number,Plugin,Screen,ScreenHeight,ScreenType,ScreenWidth,Series,Source,SourceFile,Thumb,ThumbHeight,ThumbType,ThumbWidth,Title,Volume,assocfilepath,lastmodified,screenicon,srcicon,srclink,thumbicon


levels document


indexoptions casefold stem accentfold


defaultlevel document


subcollection years1940to1946 "dc.Title/1940-1946/I"


subcollection years1947to1950 "dc.Title/1947-1950/I"


indexsubcollections years1940to1946,years1947to1950 years1940to1946
years1947to1950


defaultsubcollection years1940to1946,years1947to1950


# We want the two types of paged documents to be treated differently:
paged
# and hierarchical. So include two PagedImgPlug plugins and modify the
process_exp.
plugin GreenstoneXMLPlugin -filename_encoding auto
plugin PagedImagePlugin -enable_cache -filename_encoding dos_850
-create_thumbnail true -process_exp xml.*.item$ -documenttype hierarchy
-create_screenview true -minimumsize 100
plugin PagedImagePlugin -enable_cache -create_thumbnail true
-screenviewtype png -thumbnailsize 100 -converttotype png -process_exp
.item$ -documenttype paged -create_screenview true -screenviewsize 400
-thumbnailtype gif -input_encoding dos_850 -default_encoding dos_850
plugin MetadataXMLPlugin
plugin ArchivesInfPlugin
plugin DirectoryPlugin


classify AZCompactList -metadata ex.Number -sort ex.Volume
-firstvalueonly -buttonname "Alphabetic Index Letter" -verbosity 2


classify AZCompactList -metadata ex.Volume -sort ex.Number
-firstvalueonly -buttonname Volume


# Format statements to display Series, Volume, Number and Date
information


format DocumentVList "<td valign=top>[link][icon][/link]</td>
<td valign=top>{If}{[Series],[Series] {If}{[Volume],Vol. [Volume]}
{If}{[Number],No. [Number]} {If}{[ex.Date],Date:
[format:ex.Date]},[highlight]{Or}{[Title],[PageNum]}[/highlight]}</td>"


format CL1VList "<td valign=top>[link][icon][/link]</td>
<td valign=top>{If}{[numleafdocs],[Title],{If}{[ex.Volume],Book
[ex.Volume]} - {If}{[ex.Number], Letter: [ex.Number]}
{If}{[ex.Date],([format:ex.Date])}</td>"


format SearchVList "<td valign=top>[link][icon][/link]</td>
<td valign=top>[parent(Top):Series] {If}{[parent(Top):Volume],Vol.
[parent(Top):Volume]} {If}{[parent(Top):Number],No.
[parent(Top):Number]} Page [Title]</td>"


format DateList "<td valign=top>[link][icon][/link]</td>
<td valign=top>[Series] {If}{[Volume],Vol. [Volume]} {If}{[Number],No.
[Number]}{If}{[Date},Date:[Date]}</td>"


format HList "[link][highlight][ex.Title][/highlight][/link]"


# We customise the document display, so use the extended options
format AllowExtendedOptions true


# We want to add in fullsize/preview/text buttons to switch between the
different versions of each page


format DocumentHeading "<center><table width=537>
<tr
valign=top><td>{Or}{[parent(Top):Series],[Series],[Volume]}</td></tr>
<tr valign=top><td><table><tr><td>
[DocumentButtonDetach][DocumentButtonHighlight]
{If}{preferences eq 'fullsize',{If}{[screenicon],_document:viewpreview_}
{If}{[Text] ne 'This document has no text. ',_document:viewtext_},
{If}{preferences eq 'preview',{If}{[srcicon],_document:viewfullsize_}
{If}{[Text] ne 'This document has no text. ',_document:viewtext_},
{If}{[srcicon],_document:viewfullsize_}
{If}{[screenicon],_document:viewpreview_}}}
</td></tr></table></td>
<td>[DocTOC]</td></tr></table></center>"


# Document text display changes based on the p argument - this is not
used
#normally for document display, so we can use it here to switch between
#fullsize/preview/text versions.
format DocumentText "<center><table width=537><tr><td>
{If}{preferences eq 'fullsize',[srcicon],
{If}{preferences eq 'preview',[screenicon],{If}{[Text] ne 'This
document has no text. ',[Text]}}}
</td></tr></table></center>"


format VList "<td valign="top">[link][icon][/link]</td>
<td
valign="top">[ex.srclink]{Or}{[ex.thumbicon],[ex.srcicon]}[ex./srclink]</td><td
valign="top">[highlight]
{Or}{[ex.Number],[ex.Title],Untitled}
[/highlight]{If}{[ex.Title]<i>[ex.Title]</i>}</td>"


format DocumentButtons "Detach|Highlight"


format SearchTypes "plain,form"


format CL0VList "<td valign="top">[link][icon][/link]</td>
<td
valign="top">[ex.srclink]{Or}{[ex.thumbicon],[ex.srcicon]}[ex./srclink]</td>
<td valign="top">[highlight] {Or} {Vol. [ex.Title],Vol.
[ex.Volume],Untitled}[/highlight]<br>{If}{[ex.Date],<br><i>([format:ex.Date])</i>}</td>"


# -- English strings --------------------
collectionmeta .section:text [l=en] "newspaper pages"


# -- English text -----------------------


collectionmeta .document [l=en] "document"
collectionmeta collectionname [l=en] "Nitzsche-Reiter Log book"
collectionmeta collectionextra [l=en] "Log book of photographs taken by
Ottilie Nitzsche-Reiter.<br>
Ottilie Nitzsche-Reiter kept track of the pictures she took by keeping
an index book in alphabetical order.
For example if she took a picture of Mr. Smuts, it was entered under S
in the log book, and then under the year it was taken.<br>
There are 2 volumes: Volume 1 from 1940 to 1946, and Volume 2 from 1947
to 1950.
One can browse by Volume or by the Index letter (alphabetical), or one
can search for any word.
"
collectionmeta .years1940to1946,years1947to1950 [l=en] "Both volumes"
collectionmeta .years1940to1946 [l=en] "1940-1946"
collectionmeta .years1947to1950 [l=en] "1947-1950"
collectionmeta
.text,dc.Coverage,dc.Creator,dc.Date,dc.Description,dc.Subject,dc.Title,Date,FileFormat,Image,Number,Plugin,ScreenType,Series,Title,Volume
[l=en]
"text,dc.Coverage,dc.Creator,dc.Date,dc.Description,dc.Subject,dc.Title,Date,FileFormat,Image,Number,Plugin,ScreenType,Series,Title,Volume"
collectionmeta depositormetadata [l=en]
"{"name":"dc.Title","label":"Title","tooltip":"dc.Title: A
name given to the resource.","type":"text"},
{"name":"dc.Creator","label":"Creator","tooltip":"dc.Creator:
An entity primarily responsible for making the content of the
resource.","type":"text"},
{"name":"dc.Description","label":"Description","tooltip":"dc.Description:
An account of the content of the resource.","type":"text"}"