[greenstone-users] RE: I´ve a trouble with the advanced search in v 2.72 ,when I use the spanish language

From Israel Abraham Flores Cruz
DateSat, 6 Jan 2007 01:18:39 +0000
Subject [greenstone-users] RE: I´ve a trouble with the advanced search in v 2.72 ,when I use the spanish language
 

Hi! ,my name´s Israel,  I´ve been working with gsdl v2.72, I need to  exchange a  both of databases  from CDS/ISIS to gsdl, the language that I use is the Spanish, when I make   a new collection , with a cds/isis  database that no  contain brackets”[ ]” in the labels of field , for instance FONDO: , SECCION: ,LEGAJO:  etc. , I can´t get  an advanced search in fact , I obtain this on  the GLI  :

 

orden  : /opt/greenstone/bin/script/import.pl -gli -language es -collectdir /opt/greenstone/collect/ -removeold ahssa

import.pl> Borrando el contenido actual del directorio archives...

import.pl> RecPlug: getting directory /opt/greenstone/collect/ahssa/import

import.pl> SplitPlug found 100 documents in /opt/greenstone/collect/ahssa/import/AHSSA.mst

import.pl> segment 1 -

import.pl> IsisPlug: processing AHSSA.mst

import.pl> segment 2 –

.

.  (here more lines , like above)

 

import.pl> IsisPlug: processing AHSSA.mst

import.pl> segment 100 -

import.pl> IsisPlug: processing AHSSA.mst

import.pl> *********************************************

import.pl> La importación ha sido completada

import.pl> *********************************************

import.pl> * 100 de los documentos fueron considerados a efecto de ser procesados

import.pl> * 100 se procesaron e incluidos en la colección

import.pl> Orden completada. 

import.pl> Extrayendo nuevos metadatos de los ficheros. 

import.pl> Extracción completa de metadatos archivados.

orden  : /opt/greenstone/bin/script/buildcol.pl -gli -language es -collectdir /opt/greenstone/collect/ -removeold ahssa

buildcol.pl> *** creating the compressed text

buildcol.pl>     collecting text statistics (mgpp_passes -T1)

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/ahssa/archives/archives.inf

buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml

.

.  (here more lines , like above)

.

buildcol.pl> GAPlug: processing HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml

buildcol.pl> Stats (Compressing text from text)

buildcol.pl> Total bytes in collection: 90329

buildcol.pl> Total bytes in text: 90329

buildcol.pl>     creating the compression dictionary

buildcol.pl>     compressing the text (mgpp_passes -T2)

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/ahssa/archives/archives.inf

buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml

buildcol.pl> GAPlug: processing HASH495a/3fea08b6.di

buildcol.pl> GAPlug: processing HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml

.

.  (here more lines , like above)

.

buildcol.pl> Stats (Compressing text from text)

buildcol.pl> Total bytes in collection: 90329

buildcol.pl> Total bytes in text: 90329

buildcol.pl> *** building index text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all; in subdirectory idxatat

buildcol.pl>     creating index dictionary (mgpp_passes -I1)

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/ahssa/archives/archives.inf

buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml

buildcol.pl> GAPlug: processing HASH495a/3fea08b6.dir/doc.xml

.

.

.

buildcol.pl> GAPlug: processing HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml

buildcol.pl> Stats (Creating index text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;)

buildcol.pl> Total bytes in collection: 0

buildcol.pl> Total bytes in text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;: 0

buildcol.pl> ***************

buildcol.pl> WARNING: There is very little or no text to process for text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;

buildcol.pl>          Was this your intention?

buildcol.pl> ***************

buildcol.pl>     inverting the text (mgpp_passes -I2)

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/ahssa/archives/archives.inf

buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml

buildcol.pl> GAPlug: processing HASH495a/3fea08b6.dir/doc.xml

.

. (here more lines , like above)

.

buildcol.pl> GAPlug: processing HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml

buildcol.pl> Stats (Creating index text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;)

buildcol.pl> Total bytes in collection: 0

buildcol.pl> Total bytes in text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;: 0

buildcol.pl> ***************

buildcol.pl> WARNING: There is very little or no text to process for text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;

buildcol.pl>          Was this your intention?

buildcol.pl> ***************

buildcol.pl>     create the weights file

buildcol.pl>     creating 'on-disk' stemmed dictionary

buildcol.pl>     creating stem indexes

buildcol.pl> BuildDir: /opt/greenstone/collect/ahssa/building

buildcol.pl> *** creating the info database and processing associated files

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/ahssa/archives/archives.inf

buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml

buildcol.pl> GAPlug: processing HASH495a/3fea08b6.dir/doc.xml

buildcol.pl> GAPlug: processing HASH495a/3fea08b6/a89ac637.dir/doc.xml

.

.  (here more lines , like above)

.

buildcol.pl> GAPlug: processing HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml

buildcol.pl> *** outputting information for classifier: CL1

buildcol.pl> *** outputting information for classifier: CL2

buildcol.pl> *** outputting information for classifier: oai

buildcol.pl> *** creating auxiliary files

buildcol.pl> Orden completada. 

 

I only obtain in the advanced search, this:

 

Buscar ^all,text;DESCRIPTORES in ^all,text;DESCRIPTORES language que

 

contengan   de ,

 

and although , I´ve used  a lot of search index ,  for instance

 

text “contenido”[omit index]

ex.ASUNTO:^all “asunto”

ex.DESCRIPTORES:^all “descriptores” & etc,

I only obtain :

 

Palabra o frase

  ... en el campo

 

 

contenido

 

 

contenido

 

However If  I use another database  CDS/ISIS with , fields like :

Código del Centro [01],where “[ any number]” it don’t mean an array ,

Identificación [02] etc, in gsdl (greenstone digital libray) I  get Código do Centro [01], Identificaçäo [02],  and so on with  some labels of field, here I get a good  advanced search , but at the time of build these new collection in the GLI  I obtain something

Like these :

 

s

orden  : /opt/greenstone/bin/script/import.pl -gli -language es -collectdir /opt/greenstone/collect/ docsal

import.pl> ATENCIÓN: Si no se especificaron -removeold o -keepold, se establece -removeold. Se borrarán todos los contenidos del directorio archives.

import.pl> Borrando el contenido actual del directorio archives...

import.pl> RecPlug: getting directory /opt/greenstone/collect/docsal/import

import.pl> SplitPlug found 100 documents in /opt/greenstone/collect/docsal/import/DOCSAL.mst

import.pl> segment 1 –

.

.  (here more lines , like above)

.

 

import.pl> IsisPlug: processing DOCSAL.mst

import.pl> segment 100 -

import.pl> IsisPlug: processing DOCSAL.mst

import.pl> *********************************************

import.pl> La importación ha sido completada

import.pl> *********************************************

import.pl> * 100 de los documentos fueron considerados a efecto de ser procesados

import.pl> * 100 se procesaron e incluidos en la colección

import.pl> Orden completada. 

import.pl> Extrayendo nuevos metadatos de los ficheros. 

import.pl> Extracción completa de metadatos archivados.

orden  : /opt/greenstone/bin/script/buildcol.pl -gli -language es -collectdir /opt/greenstone/collect/ -removeold docsal

buildcol.pl> *** creating the compressed text

buildcol.pl>     collecting text statistics (mgpp_passes -T1)

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/docsal/archives/archives.inf

buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c.dir/doc.xml

.

.  (here more lines , like above)

.

 

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml

buildcol.pl> Stats (Compressing text from text)

buildcol.pl> Total bytes in collection: 320048

buildcol.pl> Total bytes in text: 320048

buildcol.pl>     creating the compression dictionary

buildcol.pl>     compressing the text (mgpp_passes -T2)

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/docsal/archives/archives.inf

buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c.dir/doc.xml

.

.  (here more lines , like above)

.

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml

buildcol.pl> Stats (Compressing text from text)

buildcol.pl> Total bytes in collection: 320048

buildcol.pl> Total bytes in text: 320048

buildcol.pl> *** building index text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all; in subdirectory idx

buildcol.pl>     creating index dictionary (mgpp_passes -I1)

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/docsal/archives/archives.inf

buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c/ads4.dir/doc.xml

.

.  (here more lines , like above)

.

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml

buildcol.pl> Stats (Creating index text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all;)

buildcol.pl> Total bytes in collection: 320048

buildcol.pl> Total bytes in text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all;: 155848

buildcol.pl>     inverting the text (mgpp_passes -I2)

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/docsal/archives/archives.inf

buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c.dir/doc.xml

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c/ads4.dir/doc.xml

.

.  (here more lines , like above)

.

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml

buildcol.pl> Stats (Creating index text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all;)

buildcol.pl> Total bytes in collection: 320048

buildcol.pl> Total bytes in text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all;: 155848

buildcol.pl>     create the weights file

buildcol.pl>     creating 'on-disk' stemmed dictionary

buildcol.pl>     creating stem indexes

buildcol.pl> BuildDir: /opt/greenstone/collect/docsal/building

buildcol.pl> *** creating the info database and processing associated files

buildcol.pl> ArcPlug: procesando /opt/greenstone/collect/docsal/archives/archives.inf

buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml

buildcol.pl> WARNING: AZList: HASHbe2c7ef374dd3e66808cads1 metadata is empty - not classifying

.

.  (and so on  100 times)

.

 

buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml

buildcol.pl> WARNING: AZList: HASHbe2c7ef374dd3e66808cads100 metadata is empty - not classifying

buildcol.pl> *** outputting information for classifier: CL1

buildcol.pl> *** outputting information for classifier: CL2

buildcol.pl> *** outputting information for classifier: oai

buildcol.pl> *** creating auxiliary files

buildcol.pl> Orden completada. 

 

Here I can´t get any kink of information in AZlist, and in  another classifier like AZCompactList , I can´t  write down  , it´s title,

 

A-S 0-9


ago. 1988]

 


]

    ago. 1988] 

  ] 

 Please can you help me to solve this problem !, I´ve use the  tutorial,CDS/ISIS (http://greenstone.sourceforge.net/wiki/gsdoc/tutorial/en/cds_isis.htm), thank you for your help.

 

 

 

 


Be one of the first to try Windows Live Mail. Windows Live Mail.