Re: [greenstone-users] Document Management with Greenstone

From Michael Dewsnip
DateThu, 16 Sep 2004 16:01:53 +1200
Subject Re: [greenstone-users] Document Management with Greenstone
In-Reply-To (85F883EB8B41D61181C80002A541DB9602847250-valcartierex-drdc-rddc-gc-ca)
Dear Marc,

Greenstone has two main parts: collection building, and collection serving.
Collection building is done with a combination of Perl scripts (to control the
building processes and deal with the different document formats) and C/C++
programs (MG/MGPP, for indexing the documents). Collection serving is done by a
C++ CGI program.

I can only suggest you download Greenstone and have a play around with it, and
look at the source code. It's all open source, so take any bits you find
useful.

Apart from that, I don't think I can help you much, except to say you might
want to look in particular at "The Collector", a web-based tool for building
Greenstone collections that uses the Perl building code from within the C++
CGI program.

Good luck!

Michael

"Morin, Marc-Andrâ–¡" wrote:

> Micheal,
>
> Your answer is really appreciated!
>
> Do you know if I could make a web service in C++ over the greenstone
> library?
> Is there some functionalities that can be just called by using Perl scripts,
> or everything can be accessible from the C++ libraries?
>
> Thanks in advance for your short but helpful comments!
>
> Regards,
>
> Marc
>
> > -----Original Message-----
> > From: Michael Dewsnip [mailto:mdewsnip@cs.waikato.ac.nz]
> > Sent: Friday, September 10, 2004 12:53 AM
> > To: Morin@drenet.dnd.ca; Marc-Andrâ–¡
> > Cc: 'greenstone-users@list.scms.waikato.ac.nz'
> > Subject: Re: [greenstone-users] Document Management with Greenstone
> >
> >
> > Hi Marc,
> >
> > Greenstone can do most of this, but is mainly C++ and Perl.
> > We don't really
> > have good libraries either -- a lot is done with command-line
> > Perl scripts
> > which call other executables. I think all the functionality
> > is there, but it
> > isn't very accessible, and certainly not as a Java library.
> >
> > Sorry I couldn't be more helpful,
> >
> > All the best,
> >
> > Michael
> >
> >
> >
> >
> > > I'm searching for a java library that could help me to:
> > >
> > > 1. Alert me when a document into a collection is removed,
> > added or modifed.
> > >
> > > 2. Convert any kind of documents (PDF, HTML, mails, XML,
> > Microsoft Office
> > > documents such as Word, etc.) in plain text format.
> > >
> > > 3. Index the plain text format in output
> > >
> > > Can Greenstone do one or more of these tasks by using one
> > of its libraries?
> > >
> > > Thanks in advance for your help.
> > >
> > > regards,
> > >
> > > -- Marc
> > >
> > > _______________________________________________
> > > greenstone-users mailing list
> > > greenstone-users@list.scms.waikato.ac.nz
> > > https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
> >