[greenstone-users] Re: Windows Scripting

From John Rowe
DateThu, 3 May 2007 09:19:47 +1200
Subject [greenstone-users] Re: Windows Scripting
In-Reply-To (20070502T084724Z-D309000B0000-nelligan-ca)
If you turn off the windows scripting option Greenstone will use the
package wvware which comes with it. This *should* extract all the
text from word files. However, since the Microsoft Word format is a
reverse engineered binary format so Microsoft Word should usually do
a more faithful reproduction of the document.
In my experience wvware does a very good job of extracting the text
however. I suggest you test it on a few documents and use the
extraction mechanism that does the best job for you.


Cheers,
John Rowe
Business Development Manager

DL Consulting
Greenstone Digital Library and Digitisation Specialists
johnr@dlconsulting.com
www.dlconsulting.com


On 3/05/2007, at 12:47 AM, Devon Cinnamon wrote:

> Does that mean that if I turn off windows scripting I may have some > problems with the word documents not matching the greenstone
> documents? Does the Windows Scripting option affect the full-text
> search?
>
> The reason I want to turn it off is that when a document contains
> the Microsoft Word style of "Heading 1" Greenstone takes the first
> text to have "Heading 1" in it and makes it the Title of the
> document in Greenstone. I have given all my documents Titles
> already using the Word properties Title section. If I turn off the > windows scripting then Greenstone ignores the Heading 1 style from
> the Word documents but I was wondering what I lost by turning off
> the windows scripting option in Greenstone.
>
> I'm currently using Windows XP, Microsoft Office 2000 and
> Greenstone 2.71 if that helps.
>
> Thank you for the help
> Devon
>
> ---------------------------------------------------------------------- > ----------------------------
> Hey Devon,
>
> The scripting option only works on a Microsoft Windows machine with
> Microsoft Word installed. It uses Visual Basic scripting to extract
> the text instead of using the normal command line software. The end
> result for you is that it's probably more compatible with newer
> Microsoft Word documents and probably does a more faithful
> reproduction of the document.
>
>
> Cheers,
> John Rowe
>
> DL Consulting
> Greenstone Digital Library and Digitisation Specialists
> contact@dlconsulting.com
> www.dlconsulting.com
>
>
> On 2/05/2007, at 7:01 AM, Devon Cinnamon wrote:
>
> > What does the windows scripting option do exactly under the Word
> > Plug-in?
> >
> > Thank you
>
> Confidentiality Note
>
> This message is intended only for the use of the individual or entity
> to which it is addressed, and may contain information that is
> privileged, confidential and exempt from disclosure under applicable
> law. If the reader of this message is not the intended recipient, or
> the employee or agent responsible for delivering the message to the
> intended recipient, you are hereby notified that any dissemination,
> distribution or copying of this communication is strictly prohibited.
> If you have received this communication in error, please notify us
> immediately. Thank you.
>
>
> AVIS - Courriel confidentiel Ce courriel est transmis au destinataire
> pour ses propres fins. Il pourrait contenir des renseignements
> confidentiels ou soumis au secret professionnel de l'avocat. Si vous
> n'êtes pas le véritable destinataire, ou son/sa mandataire, il est
> strictement interdit de diffuser ce courriel, les renseignements qu'il
> contient ou les documents qui lui sont joints. Si vous avez reçu ce
> courriel par erreur, veuillez en aviser l'expéditeur immédiatement et
> veuillez le supprimer sans le lire, l'imprimer, le sauvegarder ou le
> diffuser. Merci de votre aimable collaboration.