From | Phan Vo Minh Thang |
Date | Tue, 1 Oct 2002 16:32:48 +0700 |
Subject | my 8 questions about GreenStone system |
Dear All,
These following questions are raised while I'm using GreenStone
2.38 system on Window environment. Please kindly help me to solve them. I
always appreciate your help.
1.About searching
methods.
Does GreenStone
version 2.38 support ranked searching ? If yes, how could I do to switch between
ranked searching and boolean searching ?
2.About remote
handling.
Is it possible to
add new documents to the collection remotely?. Once, I tried to specify the
input file from collector wizard via "file:/" option at a remote computer (let
say computer B), but it always looks for the files from the computer that
installed collection server (let say computer A)! Can I handle collection,
specify the input files located in the remote computer (different from A). Or I
have to create a ftp server ? Is it necessary ?
4 About
metadata.
In the
GreenStone Developer Guide document page 19-20, it is said that the Dublin
Core metadata was used for defining metadata types and the meta data was store
with document. But I can't see any other metadata fields accept the Title one.
How should I modify my config file to get the full metadata information. Right
now,I declare my plugins without any arguments.
6. About the splitting documents into sections and searching on those
sections indices.
I have
some MSWord documents. As we know that we can use wvWare to convert them
into HTML format and then HTMLPlug will convert the output to be GML format. How
can I split each document into sections base on a common hierarchical structure
for all the documents. Because I want to browse the documents in hierarchical
style like some demo collections.
Let
suppose that the source documents are edited base on a given MS template, it
means that we can mark the content of the document, is it right ? and the out
put of wvWare will be some thing like
..<p> <div
name="MSWordTemplateNameComeoutHere" .....><p> Marked text by applied
MSWord template tag<p></p></div>..
So I think , we
should modify the HTMLPlug or write another one to split those output into the
wished hierarchical structure, is my thoughts possible ? If yes what should
I start form ? Do you have any better solution ?
5. About assigning
the value for metadata.
You can see that the
Dublin Core information for each document in MSWord format is
available inside the properties information of themselves (MSWord supports that)
so how can we reuse that information. I looked at wvWare output file. Only Title
field is available. So what can be the solution
! 7. About Date
format.
I major about the
Date information, and want to classify document on Date. How can I add
DateClassifier into my collection. Can you show me step by step to do it. I'm
sure that it will be useful for other people. I tried to add format string to
the collection config but the Date tag doesn't appear in the navigation
bar.
8. About import
directories
Is import directory
is a temporary directory ? After building the collection, is it deleted ? The
reason I ask so because I couldn't see the import directory in my collection.
But the system store my input file in the tmp build* directory ! So if I remove
the tbuild* in the tmp directory, does the system work properly ? Some time
when I build the collection again, document content disappears. Is is the result
of remove files in tmp directory.
Thank you very
much.
Best
regards.
Minh
Thang. |