| Hi Doug,
Often this wouldn't be possible, because if the converter can't get any
HTML out of a file, no text can be extracted -- and there won't be a
Greenstone document created. However, all plugins that inherit from
ConvertToPlug have the ability to extract simple text strings from a
document -- these may not be high quality, but they are usually better
than nothing. You can turn this on with the "-use_strings" option to
WordPlug, whose description is "If set, a simple strings function will be
called to extract text if the conversion utility fails."
Hope this helps,
Michael
Doug Carter wrote:
> Hi all,
>
> Occasionally, we get complex documents that cause vwWare to crash,
> which in turn fails to process the given Word document. Is there a way
> to configure the WordPlug so that even if it can't convert the document
> into HTML, it will still process (and include) the source document?
>
> TIA,
>
> Doug Carter
> Mercy Corps
>
> _______________________________________________
> greenstone-devel mailing list
> greenstone-devel@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-devel |