[greenstone-users] Re: Problem with PDF files

From Anupama of Greenstone Team
DateWed Mar 18 16:55:58 2009
Subject [greenstone-users] Re: Problem with PDF files
In-Reply-To (49BEBB2E-50808-iway-na)
Hi Renate,

Katherine will be back on Monday, but I noticed your question just now
and thought maybe the following might help in the meantime:

> How could one increase the Java heap stack?
There is the heap and the stack (which is two different purposes for
which memory is allocated).

As I understand, you are using the client-gli application.

To increase the available stack memory:
1. open up your client-gli script (*.bat or *.sh)
2. find the one or two occurrences of "-Xmx128M" in the file and change
the number to a larger value such as "-Xmx256M".

Increasing the available heap memory is not something I've tried but the
page http://www.codeguru.com/forum/showthread.php?threadid=352756
seems to indicate that this should be possible in Java. It states that
"Heap memory can be increased using -XX:PermSize=32m and
-XX:MaxPermSize=96m, again for minimum and maximum."

We can try that as follows:
1. Open up your client-gli script (*.bat or *.sh) again
2. Find the line(s) containing -Xmx128M again.
3. Instead of changing that value, insert the following -XX:PermSize=32m
-XX:MaxPermSize=96m (with at least one spage between each item).
4. You may want to try adjusting the numbers to something larger or smaller.

Regards,
Anupama


Renate Morgenstern wrote:
>
> Hi Katherine,
> I am resending, it seems that my message did not reach you.
> Regards
> Renate
> ========
>
> We are building the digital archives of a newspaper as from 1985. The
> old issues were scanned in and save into PDF, and then made searchable.
> As these are large files we wanted to for example drag over the complete
> folder to the server using the GLI client on a MAC client to a Linux
> server. It will then just hand and one has to kill the process. In the
> GLI java client error message about the Java hep stack which is not
> large enough. How could one increase the Java heap stack?
>
> We are using these to create the collection, but it seems that the
> pdftohtml can't handle these PDFs. See error message below. I assume
> that this is a PDFtoHTML issue?.
> Any advice you could give us?
> Thanks
> Renate
>> import.pl> WARNING: No plugin could process 19850906_searchable.pdf
>> import.pl> Converting 19850830_searchableImage.pdf to HTML format
>> import.pl> Error executing pdftohtml.pl
>> import.pl> pdftohtml error log:
>> import.pl> Error: PDF version 1.6 -- xpdf supports version 1.4
>> (continuing anyway)
>> import.pl> Error (0): PDF file is damaged - attempting to reconstruct
>> xref table...
>> import.pl> Error: Couldn't find trailer dictionary
>> import.pl> Error: Couldn't read xref table
>> import.pl> Could not convert 19850830_searchableImage.pdf to HTML format
>> import.pl> Error: PDF version 1.6 -- xpdf supports version 1.4
>> (continuing anyway)
>> import.pl> Error (0): PDF file is damaged - attempting to reconstruct
>> xref table...
>> import.pl> Error: Couldn't find trailer dictionary
>> import.pl> Error: Couldn't read xref table
>> import.pl> []
>> RESOURCE CENTRE
>> Sally Harper
>> sally@namibian.com.na <mailto:sally@namibian.com.na>; T. 061 279 633
>> *
>>
>> *
>>
>>
>>
>>
>>
> Mrs Renate Morgenstern
> University of Namibia Library
> Windhoek, Namibia
> Tel.: +264 61 2063869
> rmorgenstern@unam.na
>
>
> --
> Renate Morgenstern
> P O Box 30664, WIndhoek, Namibia
> Tel/Fax: 242124
> Email: rmorgenstern@iway.na
>
>
> ------------------------------------------------------------------------
>
>
> Internal Virus Database is out of date.
> Checked by AVG - http://www.avg.com
> Version: 8.0.176 / Virus Database: 270.10.7/1892 - Release Date: 2009-01-13 20:04
>
>
>
>
>
> ------------------------------------------------------------------------
>
>
> Internal Virus Database is out of date.
> Checked by AVG - http://www.avg.com
> Version: 8.0.176 / Virus Database: 270.10.7/1892 - Release Date: 2009-01-13 20:04
>