[greenstone-users] Exception building with Lucene

From Sullivan,Mark
DateWed Feb 25 22:18:04 2009
Subject [greenstone-users] Exception building with Lucene
All,

Maybe I should make this a bit more general. Are there any additional installs or path configurations that I need to do with Greenstone 2.81 to be able to build with Lucene? The MGPP builds are failing due to the size of two of our collections. We are hoping that switching to Lucene will allow us to continue using Greenstone as our base DLMS.

Many thanks for anyone's experiences with this.

Mark Sullivan
Systems Programmer
University of Florida Libraries
http://www.uflib.ufl.edu/ufdc/


________________________________
From: Sullivan,Mark
Sent: Wednesday, January 14, 2009 10:07 AM
To: 'Diego Spano'; greenstone-users@list.scms.waikato.ac.nz
Subject: RE: [greenstone-users] Exception building with Lucene

Okay, we switched to Sun's version of Java, but are getting roughly the same thing. A bit more details on line numbers, etc.. now though. (Error message below).

Were there changes to the doc.xml GSA file format or anything? The current ones are compliant with Greenstone 2.60.

GreenstoneXMLPlugin: processing UF/07/04/04/82/00002/doc.xml
GreenstoneXMLPlugin: processing UF/07/04/04/82/00003/doc.xml
GreenstoneXMLPlugin: processing UF/07/04/04/82/00004/doc.xml
GreenstoneXMLPlugin: processing UF/07/03/04/88/00001/doc.xml
GreenstoneXMLPlugin: processing UF/07/03/04/89/00001/doc.xml
GreenstoneXMLPlugin: processing UF/07/03/04/89/00002/doc.xml
GreenstoneXMLPlugin: processing UF/07/06/04/81/00001/doc.xml
Stats ( Compressing text from text)
Total bytes in collection: 1664131658
Total bytes in text: 1687795670

*** building index text;internal.Author;internal.Title;internal.Citation;allfields;dc.Subject;ufdc.Spatial;ufdc.BibID;ufdc.VID;ufdc.SubCollection;ufdc.SourceCode;ufdc.HoldingCode....; at level Doc in subdirectory didx

Creating index dictionary (lucene_passes -I1)
Cmd: perl -S "/opt/gsdl281/bin/script/lucene_passes.pl" -removeold index Doc "/opt/gsdl281/collect/dloclucene/building" "didx"
ArchivesInfPlugin: processing /opt/gsdl281/collect/dloclucene/archives/archives.inf

-removeold set
Monitoring for input!
GreenstoneXMLPlugin: processing CA/00/00/00/05/00001/doc.xml
Exception in threadn "main" java.lang.NoClassDefFoundError: org/greenstone/LuceneWrapper/GS2LuceneIndexer
Caused by: java.lang.ClassNotFoundException: org.greenstone.LuceneWrapper.GS2LuceneIndexer not at java.net.URLCLassLoader.findClass(libgcj.so.7rh)
at java.net.URLClassLoader$1.run(URLClassLoader.java:217)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:205)
at java.lang.ClassLoader.loadClass(ClassLoader.java:323)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:294)
at java.lang.ClassLoader.loadClass(ClassLoader.java:268)
at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:336

Could not find the main class: org.greenstone.LuceneWrapper.GS2LuceneIndexer. Program will exit.

GreenstoneXMLPlugin: processing CA/00/00/00/05/00002/doc.xml
GreenstoneXMLPlugin: processing CA/00/00/00/05/00003/doc.xml
GreenstoneXMLPlugin: processing CA/00/00/00/05/00004/doc.xml


Many thanks again for any help!

Mark

________________________________
From: Diego Spano [mailto:dspano@orsna.gov.ar]
Sent: Tuesday, January 13, 2009 12:31 PM
To: Sullivan,Mark; greenstone-users@list.scms.waikato.ac.nz
Subject: RE: [greenstone-users] Exception building with Lucene

Mark,

It looks like you're using the GNU version of Java . Please try downloading the Sun version of Java
and see if this works better.

Diego Spano

De: greenstone-users-bounces@list.scms.waikato.ac.nz [mailto:greenstone-users-bounces@list.scms.waikato.ac.nz] En nombre de Sullivan,Mark
Enviado el: martes, 13 de enero de 2009 14:37
Para: greenstone-users@list.scms.waikato.ac.nz
Asunto: [greenstone-users] Exception building with Lucene

I recently upgraded to Greenstone 2.81 and I am attempting build collections with Lucene for the first time. I am currently getting the error below and wonder if anyone can help me figure out what is going on. It appears to go through the first pass just fine, but then hits a problem.

GreenstoneXMLPlugin: processing UF/07/04/04/82/00002/doc.xml
GreenstoneXMLPlugin: processing UF/07/04/04/82/00003/doc.xml
GreenstoneXMLPlugin: processing UF/07/04/04/82/00004/doc.xml
GreenstoneXMLPlugin: processing UF/07/03/04/88/00001/doc.xml
GreenstoneXMLPlugin: processing UF/07/03/04/89/00001/doc.xml
GreenstoneXMLPlugin: processing UF/07/03/04/89/00002/doc.xml
GreenstoneXMLPlugin: processing UF/07/06/04/81/00001/doc.xml
Stats ( Compressing text from text)
Total bytes in collection: 1664131658
Total bytes in text: 1687795670

*** building index text;internal.Author;internal.Title;internal.Citation;allfields;dc.Subject;ufdc.Spatial;ufdc.BibID;ufdc.VID;ufdc.SubCollection;ufdc.SourceCode;ufdc.HoldingCode....; at level Doc in subdirectory didx

Creating index dictionary (lucene_passes -I1)
Cmd: perl -S "/opt/gsdl281/bin/script/lucene_passes.pl" -removeold index Doc "/opt/gsdl281/collect/dloclucene/building" "didx"
ArchivesInfPlugin: processing /opt/gsdl281/collect/dloclucene/archives/archives.inf

-removeold set
Monitoring for input!
Exception in thread "main" GreenstoneXMLPlugin: processing CA/00/00/00/05/00001/doc.xml
Java.lang.NoClassDefFoundError: org.greenstone.LuceneWrapper.GS2LuceneIndedexer
at gnu.java.lang.MainThread.run (libgcj.so.7rh)
Caused by: java.lang.ClassNotFoundException: org.greenstone.LuceneWrapper.GS2LuceneIndexer not found in gnu.gcj.runtime.SystemClassLoader{ urls=[], parent=gnu.gcj.runtime.ExtensionClassLoader{urls=null[], parent=null }}
at java.net.URLCLassLoader.findClass(libgcj.so.7rh)
at gnu.gcj.runtime.SystemClassLoader.findClass(libgcj.so.7rh)
at java.lang.ClassLoader.loadClass(libgcj.so.7rh)
at java.lang.ClassLoader.loadClass(libgcj.so.7rh)
at gnu.java.lang.MainThread.run(libgcj.so.7rh)

GreenstoneXMLPlugin: processing CA/00/00/00/05/00002/doc.xml
GreenstoneXMLPlugin: processing CA/00/00/00/05/00003/doc.xml
GreenstoneXMLPlugin: processing CA/00/00/00/05/00004/doc.xml


And then the process aborts.

In addition, I am using Greenstone just as the searching mechanism, so the users never really see the items in Greenstone. I have configured the collection to return the item as XML imbedded in the HTML, but no page images. Do I really need the following plug ins?

plugin GAPlug
plugin NULPlug
plugin PagedImgPlug
plugin ArcPlug
plugin MetadataXMLPlug

http://www.uflib.ufl.edu/ufdc/

Many thanks for any help you can provide.

Mark Sullivan
Systems Programmer
Digital Library Center
University of Florida Libraries
(352) 273-2900 (w)
(352) 682-9692 (c)

-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20090115/aba62407/attachment-0001.html