Sorry about the slow reply to this.
I think you've nailed it with your last sentence. Probably the easiest way of
doing what you want would be to convert your bibtex records into metadata.xml
files, and make sure the metadata is associated with the correct PDF file (ie.
the <FileName> element specifies the name of the PDF file). If you already have
metadata manually assigned to your PDF files, you would need to merge it with
the bibtex data. Once you've done this, you just build the collection as normal,
but only on the PDF files (you no longer need the bibtex files).
It should be fairly simple to write a small program to convert bibtex into
metadata.xml. Let me know if you need any help with this.
Ben Liesfeld wrote:
> Hi there,
> I found this in the mailinglist archives from January:
> "Dollberg, Donald D." <firstname.lastname@example.org> wrote:
> > [...] I want to create a collection of
> > journal articles written in my group and use Greenstone simply to create
> > a bibliography that is linked to the actual article in PDF format. I
> > have the bibliography in bibtex format and the articles will be scanned
> > and converted to PDF. I am not interested in indexing the actual
> > article just in the display of the article.
> In my ongoing project I'd like to go a bit further than Donald
> Dollberg: I'd like to create a collection that makes fulltext search in those
> pdf files possible and, of course, form search in the bibtex entries
> as well.
> I read the available docs about greenstone and I know how to build a
> collection from bibtex bibliographies and how to build a pdf library.
> I suppose I have to write a new import plugin to make a collection of
> both? What would be the best (and probably shortest) approach? I'd
> probably start converting the bibtex entries to xml metadata which
> RecPlug can handle.
> Thx for any hints.
> Ben Liesfeld
> greenstone-users mailing list