[greenstone-users] one trouble pdf files

From Israel Abraham Flores Cruz
DateFri, 19 Jan 2007 15:02:42 +0000
Subject [greenstone-users] one trouble pdf files

Hi,my nameīs Israel Iīve just to come back, Before that nothing  thank you very much for your  help, a least, I can obtain an advanced search, when I translate  a cds/isis to GSDL , remove the bracketsī[  ]? from the fiels of the database i.e  AUTOR[10] , I change to AUTOR_10 , And I havenīt  any problem.

I have another problem when I want to make  a new collection with pdf files, Iīve used the tutorial ? Enhanced PDF handling? , there someone mention that :

If conversion to HTML doesn't produce the result you like, PDF documents can be converted to a series of images, one per page. This requires ImageMagick and Ghostscript to be installed.

At this time,  Iīve worked since the local library And I donīt  know  where I should  intall  Ghostscript, by default the path is : C:Archivos de programags, Iīve installed Greenstone in C:Archivos de programaGSDL, thereby I changed the path of Ghostscript  only  gs by GSDL, and no more, and I install ImageMagick-6.3.1-7-Q16-windows-dll.exe( my OS is Windows XP)  in the default path , I donīt know  if , Iīm OK , or not  because , when  I pass the mouse over  complex (option in the PDFPlug)

An advice  that I don?t  understand , if I use it or not  I  obtain the same result in the file of greenstone(Iīve use only , PDFPlug-convert_to html-complex) , A simple column with almost all information , any image  clear away of the files, and when  I  changed the option of PDFPlug  now to convert_to  option to one of the image types, e.g. pagedimg_jpg. & Use that advice : Switch off the use_sections option, as it is not used with image conversion.I get 6 document processed but in the file of Greenstone , it appear whithout  information except itīs title ,it pass whith all 6 documents,include  pdf05-notext .I add the collect.cfg on this message, thank you for your help again.




public               true


buildtype          mgpp


#indexes          document:text document:Title document:Source

indexes            text Title Source

defaultindex      text


levels    document


indexoptions     accentfold casefold stem


defaultlevel       document


plugin               GAPlug

plugin               PDFPlug -convert_to html -complex

plugin               ZIPPlug

plugin               TEXTPlug

plugin               HTMLPlug -smart_block

plugin               EMAILPlug

plugin               RTFPlug

plugin               WordPlug

plugin               PSPlug

plugin               ImagePlug

plugin               ISISPlug

plugin               NULPlug

plugin               MetadataXMLPlug

plugin               ArcPlug

plugin               RecPlug


classify AZList -metadata Title

classify AZList -metadata Source


format VList "<td valign="top">[link][icon][/link]</td>

<td valign="top">[ex.srclink]{Or}{[ex.thumbicon],[ex.srcicon]}[ex./srclink]</td>

<td valign="top">[highlight]




format HList "[link][highlight][ex.Title][/highlight][/link]"


format DocumentHeading "{Or}{[parent(Top):Title],[Title],untitled}<br>"


format DocumentText "[Text]"


format DocumentButtons "Detach|Highlight"


format SearchTypes "plain,form"


collectionmeta  collectionname [l=es] "Coleccion de PDFīs"

collectionmeta  .document:text [l=es] "text"

collectionmeta  .document:Title [l=es] "titles"

collectionmeta  .document:Source [l=es] "filenames"

collectionmeta  .text [l=es] "text"

collectionmeta  .Title [l=es] "titles"

collectionmeta  .Source [l=es] "filenames"

Be one of the first to try Windows Live Mail. Windows Live Mail.