RE: [greenstone-devel] greenstone with wvware on Cygwin

From Emanuel Dejanu / Simple Words
DateMon, 27 Sep 2004 09:16:29 +0300
Subject RE: [greenstone-devel] greenstone with wvware on Cygwin
In-Reply-To (87E364511228FB4185BC4E7856693D90C63931-aspcbr-mail-cbr-aspect-com-au)
We use an old rtf parser (open source, but I do not have the source :-) ).
This give use good results. If you will like it I can send it to
you by e-mail (~2mb). The only problem is with ident and languages.
This is an old version of the current "Logictran" (this is not free, nor
open source).

Another solution is OpenOffice (look in list achives, somebody send a macro
for it).

Best regards,

Emanuel Dejanu

-----Original Message-----
From: greenstone-devel-bounces@list.scms.waikato.ac.nz
[mailto:greenstone-devel-bounces@list.scms.waikato.ac.nz] On Behalf Of Bram
Van_Oosterhout
Sent: Friday, September 24, 2004 2:47 AM
To: 'greenstone-devel@list.scms.waikato.ac.nz.'
Subject: [greenstone-devel] greenstone with wvware on Cygwin

> Dear All,
> I installed greenstone 2.51 on a DELL Inspiron 7000 (Pentium II, 333
> Mhz,
> 127 Mbyte of ram).
>
> As my collections have a lot of Word documents, I rely on wvware to do
> the parsing. The standard installation (with wv 0.7.4) gave me a
> success rate of 40%. 7 out of a sample of 11 documents failed conversion.
>
> I recently found that there are updates for wvware on
> http://sourceforge.net/projects/wvware/
> I installed wv 1.0.2. It required some fiddling, but nothing too hard.
>
> The good news is that I now parse 90%. Only 1 out of the sample of 11
> fails (with Signal 11)
>
> The bad news is that the conversion is very slow. The 10 documents
> took 9 hours to convert to HTML. They are all a similar format.
> Tables, graphics and text. 3 to 5 pages.
>
> I intend to move to a faster processor and a Linux implementation
> later in the year. I expect that to be much faster.
>
> In the mean time, is there anyone with better experience with respect
> to the cygwin greenstone/wvware implementation?
>
> regards....
>
> Bram van Oosterhout | KAZ Technology Services (incorporating Aspect
> Computing) | Ph: +612 6247 7677 | Fx: +612 6249 1620 |
>


"Legal Notice - confidentiality, viruses and opinions - This email and any
files transmitted with it are confidential and solely for the intended
recipient. If you are not the intended recipient please notify the sender
immediately; delete this email from your system; and do not read,
distribute, print, store or copy this email or take any action in reliance
on its contents. KAZ Group Limited ACN 002 124 405 and its related bodies
corporate KAZ Technology Services accept no liability for any damage caused
by any virus transmitted with this email. Any views or opinions contained
in this email are solely those of the author and do not represent those of
KAZ unless expressly otherwise stated."


_______________________________________________
greenstone-devel mailing list
greenstone-devel@list.scms.waikato.ac.nz
https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-devel