[greenstone-users] associate_tail_re

From Katherine Don
DateMon Apr 12 10:12:43 2010
Subject [greenstone-users] associate_tail_re
In-Reply-To (4BC0B47F-1080803-farpan-net)
Hi Michael

I am not sure if we have used this to associate files with the same
extension. It was designed for eg processing a doc file and associating
a pdf/image etc.

I'll have to try it out and see if its easy to get it to work with files
with same extension. One tricky point will be telling greenstone which
file to process. In your case, it will probably process both doc files
as individual files. will probably need to use process_exp too.

One other thing to note, associate_ext was designed to associate
different file types of the same content. The main file gets processed
(text and metadata extracted) while the other files are not processed in
any way other than to link them to the main one.
If you have text in two of the files that you want extracted (and then
indexed) then associate_ext is not the way to go.
I guess with musical scores you are not looking to extract text?

Regards,
Katherine

Michael Silver wrote:
> Hello,
>
> I am trying to get associate_tail_re to work with a collection I'm
> building using Greenstone 2.83. I'm hoping to be able to associate
> sets of files into something like the compound item used in CONTENTdm.
> For ease of testing, I'm using four files,
>
> Title.doc
> Title-part2.doc
> Title-part2.txt
>
> If I use associate_ext, I can successfully associate Title.doc with
> Title.pdf and Title-part2.txt. What I want is to associate Title.doc
> with Title-part2.doc.
>
> If I enter -part2.doc into associate_tail_re, no association is
> created. If I enter -part2\..* the PDF and TXT files are associated.
> I've tried various combinations of regular expressions and filenames,
> but I have not been able to get the result I'm looking for.
>
> According to what I've read in tickets in trac.greenstone.org and in
> the mailing list archives, associate_tail_re should be able to do
> this, but I've been unable to locate any details or examples. Can
> anyone point me in the right direction, or provide some examples?
>
> Any suggestions or pointers are welcome. I've tried RTFM, but I'm
> obviously looking in the wrong manuals!:-)Thank you!
>
> Michael
>
> P.S. If you're interested, the actual case (instead of testing) is to
> combine musical scores with instrumental parts, e.g.,
>
> SongTitle.pdf (score)
> SongTitleTrpts.pdf (trumpet parts)
> SongTitleTbones.pdf (trombones)
> SongTitleSax.pdf (saxes)
> SongTitleRhythm.pdf (rhythm section)
>
> If I could get past the above problem, I should be able to create a
> regex to match (Trpts|Tbones|Sax|Rhythm).pdf. I hope.
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>