|From||John R. McPherson|
|Date||Tue, 21 Sep 2004 12:39:25 +1200|
|Subject||Re: [greenstone-users] Some PDF files are imported but not shown in the listings|
|On Mon, 2004-09-20 at 23:26, Eduardo Trápani wrote:
> I'm trying to set up my own library after having successfully followed most of the three days course.
> I added three PDF files. I got no errors during creation and all three files are taken into account. But I only get two files when browsing by files or by titles.
> ex.Source is set alright, how can it be that the file does not show when listing by filename? (I created the library without any fancy options, just Dublin metadata set and nothing else).
> I then tried to "enrich" the file by adding a title, dc.Title. Then I added an index and the possibility to browse by dc.Title and the file still does not show!
> Is there anyway to debug that? I'm checking it now and it happens with many PDF files.
The most common reason for a file to be imported but not included in the
This should show up in a build log - it will say something about not
> In case you care to take a look at the PDF file, it is at: http://www.unesco.org.uy/phi/libros/analisisMaule.pdf
For this file, pdftohtml correctly extracted and encoded the text from
We've recently added some code that tries to detect when any document