| From | Diego Spano |
| Date | Wed, 9 Mar 2005 17:17:00 -0300 |
| Subject | [greenstone-users] HTML Tags |
|
Hi List, I want to add my own
tags to html files, so Greenstone can index them. I create a sample html file
like this:
************************* HTML
File ************************
<html>
<head> <title>Archivo Digital</title> <META name="Subject" content="Jorge Lopez"> </head> <body> Esta es una
prueba de importacion de html
</body> </html> ************************* HTML File ************************ and the collect.cfg is:
************************* Collect.cfg
************************ indexes document:text document:Subject
defaultindex document:text plugin
ZIPPlug
plugin GAPlug plugin TEXTPlug plugin HTMLPlug plugin EMAILPlug plugin PDFPlug plugin RTFPlug plugin WordPlug -metadata_fields Subject plugin PSPlug plugin ArcPlug plugin RecPlug classify AZList -metadata Subject collectionmeta collectionname "prueba"
collectionmeta iconcollection "" collectionmeta collectionextra "" collectionmeta .document:text "texto" collectionmeta .document:Subject "autor" ************************* Collect.cfg
************************
There is no way to process this document.
Import process doesn´t extract Subject tag from the file. Is there any mistake
in html file format?
Thanks in advance.
Diego
Spano | |