[greenstone-users] Newbie doubts

From Emiliano Marmonti
DateTue, 08 Jul 2003 02:08:56 -0300
Subject [greenstone-users] Newbie doubts
Hello All

I'm newbie with GreenStone, I'm using gsdl 2.4 and I'm testing with a
little collection of Word Documents under Win'32-Apache web platform. I have
some doubts using the classifiers, indexes, macros and so on and will be
very gratefully if anybody could answer me...

I've 4 doc's grouped 2 in separte folders. I've written a metadata.xml file
like this:

<DirectoryMetadata>
<FileSet>
<FileName>informes gestion</FileName>
<Description>
<Metadata name="Area">Informatica</Metadata>
<Metadata name="Creator">Marmonti, Emiliano</Metadata>
<Metadata name="Subject" mode="accumulate">UNLZ</Metadata>
<Metadata name="Subject" mode="accumulate">Informes</Metadata>
<Metadata name="Subject" mode="accumulate">Reuniones</Metadata>
<Metadata name="Subject" mode="accumulate">Procesos</Metadata>
<Metadata name="Title">Informes Tecnicos realizados</Metadata>
<Metadata name="Date">2001</Metadata>
</Description>
</FileSet>
...
</DirectoryMetadata>

Also I've changed collect.cfg to this:
-------------------------------------------------------
creator emarmonti@siu.edu.ar
maintainer emarmonti@siu.edu.ar
public true

indexes document:text document:Title document:Creator
defaultindex document:text

plugin ZIPPlug
plugin GAPlug
plugin TEXTPlug
plugin HTMLPlug
plugin EMAILPlug
plugin PDFPlug
plugin RTFPlug
plugin WordPlug
plugin PSPlug
plugin ArcPlug
plugin RecPlug -use_metadata_files


classify AZList -metadata Title -buttonname Titulo
classify AZCompactList -metadata Creator -buttonname Autores
classify Hierarchy -hfile area.txt -metadata Area -sort Title
-buttonname Area
classify Hierarchy -hfile sub.txt -metadata Subject -sort Title
-buttonname Descriptores
classify DateList -buttonname Fecha
classify AZCompactList -metadata Subject -buttonname Materias
-mingroup 1 -mincompact 5 -minnesting 7 -maxcompact 10

format CL2Vlist "<td valign=top>[link][icon][/link]</td>
<td
valign=top>{If}{[Creator],<br>Autores:[Creator]}{If}{[Source],<br>[srclink][srcicon][/srclink][Source]}</small></i>
</td>"

format DateList "<td valign=top>[link][icon][/link]</td>
<td valign=top><b>[Title]</b> - <i>{Or}{[Creator],[Editor]}</i></td>"

format DocumentText "<p><b>Creador:</b>[Creator]</p>
<p><b>Archivo
Original:</b>[srclink][srcicon][/srclink]<small>&nbsp;&nbsp;[Source]</small></p>
<table width=90% align=center>
<tr bgcolor=#DDDDEE>
<td align=center><font face=Arial size=2>[Text]</font></td></tr></table>"
-------------------------------------------------

My first problem is about AZList. It doesn't present any HList containing
A,B,C,...Z links. It simply presents CL2VList but not any hyperlink guide.
I've read the docs and looked for the examples and it seems to be
single...I've seen that using AZCompactList the documents are presented
grouped by the -metadata field but either could see A,B,C or any letter for
guiding the navigation. Also, I've used but no understand the meaning of (
-mingroup 1 -mincompact 5 -minnesting 7 -maxcompact 10 ) AZCompactList's
options.

My second question is about DateList, perhaps my xml has no the entire date,
but is presented like a VList classified by year, and the DateList format
but I want to present it in a HList with year's hyperlinks like some
examples that I've seen.

Third, Hierarchy classifiers, I wish to present the number of items that I
have in the collection for each category, for example for Subject classifier
I obtain a first node that perhaps has 4 documents or perhaps has 3 internal
nodes that each contain 2 documents. I wish to tell to the user the number
of documents that each category has. I've tried using [numleafdocs] but it
seems to be a metadata field, I need a calculated field, is there a macro or
a way to do so?

Four, I will not have to treat with death collections, two or tree times for
week I will have to add documents to the collection. Simply I must to add
keepold option for retain the old documents when I import and build
collections? Perhaps will change the way of access to the collection many
times, allways I've to re-build the collection or only re-build indexes?
I've seen that If I follow the recommended method of moving the content of
building and carrying into index I could not re-build again.

Five, I've seen that Word plug-in sometimes work well, sometimes not, it
depends of the document. I will recommend to convert to a "light" HTML and
after some touches (and Section metadata adding), adding to Greenstone, I
mean the [Source] 'll be HTML. But the finall user perhaps will request the
doc file (or pdf, or ps). How could I add an "attached" file or how could I
instruct Greenstone to add it automatically?

Six (and last), I wish to export the metadata of all the documents inside
the collection and posibly to add a separate OAI server for enabling
searching (I'm trying to put NTLD thesis using Greenstone for internal
institutional use and another free OAI server for external searching). Is
there an automated way to export all the docs.xml? I've seen the OAI plug-in
but is for accessing from Greenstone to an OAI repository, I will need the
inverse...

Thanks a lot in advance for your time and sorry by the length of this mail.
Emiliano Marmonti

_________________________________________________________________
Charla con tus amigos en l□□nea mediante MSN Messenger:
http://messenger.yupimsn.com/