[greenstone-users] Lucene indexer

From Qiu
DateWed Dec 12 11:24:35 2007
Subject [greenstone-users] Lucene indexer
In-Reply-To (00c201c83c33$71dff340$7c3401c8-diegos)
Hi Diego

It seems the XML parser went wrong. The link below will help you fix the
problem:
http://wiki.greenstone.org/wiki/index.php/Building_Greenstone_collections#How_do_I_fix_XML::Parser_errors

Regards
Quan
Diego Spano wrote:
> H i List, when I want to build indexes, I get the following error:
>
> Starting to index <xml doc on stdin>
> [ Doc: 9parse error:
> org.xml.sax.SAXParseException: not a name start character: "U+26"
> at gnu.xml.stream.SAXParser.parse(libgcj.so.7rh)
> at javax.xml.parsers.SAXParser.parse(libgcj.so.7rh)
> at org.greenstone.LuceneWrapper.Indexer.index(Indexer.java:117)
> at org.greenstone.LuceneWrapper.IndexXML.indexFile(IndexXML.java:65)
> at
> org.greenstone.LuceneWrapper.GS2LuceneIndexer.main(GS2LuceneIndexer.java:110)
> Caused by: javax.xml.stream.XMLStreamException: not a name start
> character: "U+26"
> at gnu.xml.stream.XMLParser.error(libgcj.so.7rh)
> at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.7rh)
> at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.7rh)
> at gnu.xml.stream.XMLParser.readCharData(libgcj.so.7rh)
> at gnu.xml.stream.XMLParser.next(libgcj.so.7rh)
> at gnu.xml.stream.XMLParser.hasNext(libgcj.so.7rh)
> at gnu.xml.stream.SAXParser.parse(libgcj.so.7rh)
> ...4 more
>
> This happens for many documents. Any help?. GS version is 2.74 running
> on Centos5.
>
> TIA
>
> Diego Spano
> ------------------------------------------------------------------------
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>