|From||Phan Vo Minh Thang|
|Date||Tue, 1 Oct 2002 16:32:48 +0700|
|Subject||my 8 questions about GreenStone system|
These following questions are raised while I'm using GreenStone 2.38 system on Window environment. Please kindly help me to solve them. I always appreciate your help.
1.About searching methods.
Does GreenStone version 2.38 support ranked searching ? If yes, how could I do to switch between ranked searching and boolean searching ?
2.About remote handling.
Is it possible to add new documents to the collection remotely?. Once, I tried to specify the input file from collector wizard via "file:/" option at a remote computer (let say computer B), but it always looks for the files from the computer that installed collection server (let say computer A)! Can I handle collection, specify the input files located in the remote computer (different from A). Or I have to create a ftp server ? Is it necessary ?
4 About metadata.
In the GreenStone Developer Guide document page 19-20, it is said that the Dublin Core metadata was used for defining metadata types and the meta data was store with document. But I can't see any other metadata fields accept the Title one. How should I modify my config file to get the full metadata information. Right now,I declare my plugins without any arguments.
6. About the splitting documents into sections and searching on those sections indices.
I have some MSWord documents. As we know that we can use wvWare to convert them into HTML format and then HTMLPlug will convert the output to be GML format. How can I split each document into sections base on a common hierarchical structure for all the documents. Because I want to browse the documents in hierarchical style like some demo collections.
Let suppose that the source documents are edited base on a given MS template, it means that we can mark the content of the document, is it right ? and the out put of wvWare will be some thing like
..<p> <div name="MSWordTemplateNameComeoutHere" .....><p> Marked text by applied MSWord template tag<p></p></div>..
So I think , we should modify the HTMLPlug or write another one to split those output into the wished hierarchical structure, is my thoughts possible ? If yes what should I start form ? Do you have any better solution ?
5. About assigning the value for metadata.
You can see that the Dublin Core information for each document in MSWord format is available inside the properties information of themselves (MSWord supports that) so how can we reuse that information. I looked at wvWare output file. Only Title field is available. So what can be the solution !
7. About Date format.
I major about the Date information, and want to classify document on Date. How can I add DateClassifier into my collection. Can you show me step by step to do it. I'm sure that it will be useful for other people. I tried to add format string to the collection config but the Date tag doesn't appear in the navigation bar.
8. About import directories
Is import directory is a temporary directory ? After building the collection, is it deleted ? The reason I ask so because I couldn't see the import directory in my collection. But the system store my input file in the tmp build* directory ! So if I remove the tbuild* in the tmp directory, does the system work properly ? Some time when I build the collection again, document content disappears. Is is the result of remove files in tmp directory.
Thank you very much.