From | Israel Abraham Flores Cruz |
Date | Fri, 19 Jan 2007 15:02:42 +0000 |
Subject | [greenstone-users] one trouble pdf files |
Hi,my nameīs Israel Iīve just to come back, Before that nothing thank you very much for your help, a least, I can obtain an advanced search, when I translate a cds/isis to GSDL , remove the bracketsī[ ]? from the fiels of the database i.e AUTOR[10] , I change to AUTOR_10 , And I havenīt any problem. I have another problem when I want to make a new collection with pdf files, Iīve used the tutorial ? Enhanced PDF handling? , there someone mention that : If conversion to HTML doesn't produce the result you like, PDF documents can be converted to a series of images, one per page. This requires ImageMagick and Ghostscript to be installed. At this time, Iīve worked since the local library And I donīt know where I should intall Ghostscript, by default the path is : C:\Archivos de programa\gs, Iīve installed Greenstone in C:\Archivos de programa\GSDL, thereby I changed the path of Ghostscript only gs by GSDL, and no more, and I install ImageMagick-6.3.1-7-Q16-windows-dll.exe( my OS is Windows XP) in the default path , I donīt know if , Iīm OK , or not because , when I pass the mouse over complex (option in the PDFPlug) An advice that I don?t understand , if I use it or not I obtain the same result in the file of greenstone(Iīve use only , PDFPlug-convert_to html-complex) , A simple column with almost all information , any image clear away of the files, and when I changed the option of PDFPlug now to convert_to option to one of the image types, e.g. pagedimg_jpg. & Use that advice : Switch off the use_sections option, as it is not used with image conversion.I get 6 document processed but in the file of Greenstone , it appear whithout information except itīs title ,it pass whith all 6 documents,include pdf05-notext .I add the collect.cfg on this message, thank you for your help again. /********************************************************************//// creator maintainer public true buildtype mgpp #indexes document:text document:Title document:Source indexes text Title Source defaultindex text levels document indexoptions accentfold casefold stem defaultlevel document plugin GAPlug plugin PDFPlug -convert_to html -complex plugin ZIPPlug plugin TEXTPlug plugin HTMLPlug -smart_block plugin EMAILPlug plugin RTFPlug plugin WordPlug plugin PSPlug plugin ImagePlug plugin ISISPlug plugin NULPlug plugin MetadataXMLPlug plugin ArcPlug plugin RecPlug classify AZList -metadata Title classify AZList -metadata Source format VList "<td valign=\"top\">[link][icon][/link]</td> <td valign=\"top\">[ex.srclink]{Or}{[ex.thumbicon],[ex.srcicon]}[ex./srclink]</td> <td valign=\"top\">[highlight] {Or}{[dc.Title],[exp.Title],[ex.Title],Untitled} [/highlight]{If}{[ex.Source],<br><i>([ex.Source])</i>}</td>" format HList "[link][highlight][ex.Title][/highlight][/link]" format DocumentHeading "{Or}{[parent(Top):Title],[Title],untitled}<br>" format DocumentText "[Text]" format DocumentButtons "Detach|Highlight" format SearchTypes "plain,form" collectionmeta collectionname [l=es] "Coleccion de PDFīs" collectionmeta .document:text [l=es] "text" collectionmeta .document:Title [l=es] "titles" collectionmeta .document:Source [l=es] "filenames" collectionmeta .text [l=es] "text" collectionmeta .Title [l=es] "titles" collectionmeta .Source [l=es] "filenames" Be one of the first to try Windows Live Mail. Windows Live Mail. |