[greenstone-users] Lucene indexer

From Diego Spano
DateWed Dec 12 09:18:04 2007
Subject [greenstone-users] Lucene indexer
H i List, when I want to build indexes, I get the following error:


Starting to index <xml doc on stdin>
[ Doc: 9parse error:
org.xml.sax.SAXParseException: not a name start character: "U+26"
at gnu.xml.stream.SAXParser.parse(libgcj.so.7rh)
at javax.xml.parsers.SAXParser.parse(libgcj.so.7rh)
at org.greenstone.LuceneWrapper.Indexer.index(Indexer.java:117)
at org.greenstone.LuceneWrapper.IndexXML.indexFile(IndexXML.java:65)
at
org.greenstone.LuceneWrapper.GS2LuceneIndexer.main(GS2LuceneIndexer.java:110
)
Caused by: javax.xml.stream.XMLStreamException: not a name start character:
"U+26"
at gnu.xml.stream.XMLParser.error(libgcj.so.7rh)
at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.7rh)
at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.7rh)
at gnu.xml.stream.XMLParser.readCharData(libgcj.so.7rh)
at gnu.xml.stream.XMLParser.next(libgcj.so.7rh)
at gnu.xml.stream.XMLParser.hasNext(libgcj.so.7rh)
at gnu.xml.stream.SAXParser.parse(libgcj.so.7rh)
...4 more

This happens for many documents. Any help?. GS version is 2.74 running on
Centos5.

TIA

Diego Spano
-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20071211/582ac488/attachment.html