[greenstone-users] HTML Tags

From Diego Spano
DateWed, 9 Mar 2005 17:17:00 -0300
Subject [greenstone-users] HTML Tags
Hi List, I want to add my own tags to html files, so Greenstone can index them. I create a sample html file like this:
 
 
************************* HTML File ************************
<html>
<head>
<title>Archivo Digital</title>
<META name="Subject" content="Jorge Lopez">
</head>
<body>
    Esta es una prueba de importacion de html
</body>
</html>
************************* HTML File ************************
 
and the collect.cfg is:
 
 
************************* Collect.cfg ************************
indexes        document:text document:Subject
defaultindex   document:text
 
plugin         ZIPPlug
plugin         GAPlug
plugin         TEXTPlug
plugin         HTMLPlug
plugin         EMAILPlug
plugin         PDFPlug
plugin         RTFPlug
plugin         WordPlug -metadata_fields Subject
plugin         PSPlug
plugin         ArcPlug
plugin         RecPlug
 

classify       AZList -metadata Subject
 
collectionmeta collectionname    "prueba"
collectionmeta iconcollection    ""
collectionmeta collectionextra   ""
collectionmeta .document:text    "texto"
collectionmeta .document:Subject  "autor"
************************* Collect.cfg ************************
 
There is no way to process this document. Import process doesn´t extract Subject tag from the file. Is there any mistake in html file format?
 
Thanks in advance.
 
Diego Spano