Re: [greenstone-users] RE: I´ve a trouble with the advanced search in v 2.72 ,when I use the spanish language

From Katherine Don
DateMon, 08 Jan 2007 14:31:50 +1300
Subject Re: [greenstone-users] RE: I´ve a trouble with the advanced search in v 2.72 ,when I use the spanish language
In-Reply-To (BAY124-W38CEFEE9AACD8FD44966C2C9BE0-phx-gbl)
Hi

I think that your problem is that you have : in the metadata names -
this is not allowed. We use a : to separate index and subindex
specifications, so having a : in an index name will make things go wrong.

Having [xx] in metadata names will not affect indexing, but will affect
format statements, as we use [xx] to specify a position in an array of
values.

Is it possible to remove [] and : from your field names?

Regards,
Katherine

Israel Abraham Flores Cruz wrote:
>
>
> Hi! ,my name´s Israel, I´ve been working with gsdl v2.72, I need to
> exchange a both of databases from CDS/ISIS to gsdl, the language that
> I use is the Spanish, when I make a new collection , with a cds/isis
> database that no contain brackets”[ ]” in the labels of field , for
> instance *FONDO: , SECCION: ,LEGAJO*: etc. , I can´t get an advanced
> search in fact , I obtain this on the *GLI :*
>
>
>
> orden : /opt/greenstone/bin/script/import.pl -gli -language es
> -collectdir /opt/greenstone/collect/ -removeold ahssa
>
> import.pl> Borrando el contenido actual del directorio archives...
>
> import.pl> RecPlug: getting directory /opt/greenstone/collect/ahssa/import
>
> import.pl> SplitPlug found 100 documents in
> /opt/greenstone/collect/ahssa/import/AHSSA.mst
>
> import.pl> segment 1 -
>
> import.pl> IsisPlug: processing AHSSA.mst
>
> import.pl> segment 2 –
>
> *.*
>
> *. (here more lines , like above)*
>
> * *
>
> import.pl> IsisPlug: processing AHSSA.mst
>
> import.pl> segment 100 -
>
> import.pl> IsisPlug: processing AHSSA.mst
>
> import.pl> *********************************************
>
> import.pl> La importación ha sido completada
>
> import.pl> *********************************************
>
> import.pl> * 100 de los documentos fueron considerados a efecto de ser
> procesados
>
> import.pl> * 100 se procesaron e incluidos en la colección
>
> import.pl> Orden completada.
>
> import.pl> Extrayendo nuevos metadatos de los ficheros.
>
> import.pl> Extracción completa de metadatos archivados.
>
> orden : /opt/greenstone/bin/script/buildcol.pl -gli -language es
> -collectdir /opt/greenstone/collect/ -removeold ahssa
>
> buildcol.pl> *** creating the compressed text
>
> buildcol.pl> collecting text statistics (mgpp_passes -T1)
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/ahssa/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml
>
> *.*
>
> *. (here more lines , like above)*
>
> *.*
>
> buildcol.pl> GAPlug: processing
> HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml
>
> buildcol.pl> Stats (Compressing text from text)
>
> buildcol.pl> Total bytes in collection: 90329
>
> buildcol.pl> Total bytes in text: 90329
>
> buildcol.pl> creating the compression dictionary
>
> buildcol.pl> compressing the text (mgpp_passes -T2)
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/ahssa/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASH495a/3fea08b6.di
>
> buildcol.pl> GAPlug: processing
> HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml
>
> *.*
>
> *. (here more lines , like above)*
>
> *.*
>
> buildcol.pl> Stats (Compressing text from text)
>
> buildcol.pl> Total bytes in collection: 90329
>
> buildcol.pl> Total bytes in text: 90329
>
> buildcol.pl> *** building index
> text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;
> in subdirectory idxatat
>
> buildcol.pl> creating index dictionary (mgpp_passes -I1)
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/ahssa/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASH495a/3fea08b6.dir/doc.xml
>
> *.*
>
> *.*
>
> *.*
>
> buildcol.pl> GAPlug: processing
> HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml
>
> buildcol.pl> Stats (Creating index
> text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;)
>
> buildcol.pl> Total bytes in collection: 0
>
> buildcol.pl> Total bytes in
> text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;:
> 0
>
> buildcol.pl> ***************
>
> buildcol.pl> WARNING: There is very little or no text to process for
> text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;
>
> buildcol.pl> Was this your intention?
>
> buildcol.pl> ***************
>
> buildcol.pl> inverting the text (mgpp_passes -I2)
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/ahssa/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASH495a/3fea08b6.dir/doc.xml
>
> *.*
>
> *. (here more lines , like above)*
>
> *.*
>
> buildcol.pl> GAPlug: processing
> HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml
>
> buildcol.pl> Stats (Creating index
> text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;)
>
> buildcol.pl> Total bytes in collection: 0
>
> buildcol.pl> Total bytes in
> text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;:
> 0
>
> buildcol.pl> ***************
>
> buildcol.pl> WARNING: There is very little or no text to process for
> text;ASUNTO:^all,text;DESCRIPTORES:^all,text;EXPEDIENTE:^all,text;FECHA:^all,text;FOJAS:^all,text;FONDO:^all,text;LEGAJO:^all,text;SECCION:^all,text;text;ASUNTO:^all;DESCRIPTORES:^all;EXPEDIENTE:^all;FECHA:^all;FOJAS:^all;FONDO:^all;LEGAJO:^all;SECCION:^all;
>
> buildcol.pl> Was this your intention?
>
> buildcol.pl> ***************
>
> buildcol.pl> create the weights file
>
> buildcol.pl> creating 'on-disk' stemmed dictionary
>
> buildcol.pl> creating stem indexes
>
> buildcol.pl> BuildDir: /opt/greenstone/collect/ahssa/building
>
> buildcol.pl> *** creating the info database and processing associated files
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/ahssa/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASH495a.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASH495a/3fea08b6.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASH495a/3fea08b6/a89ac637.dir/doc.xml
>
> *.*
>
> *. (here more lines , like above)*
>
> *.*
>
> buildcol.pl> GAPlug: processing
> HASH495a/3fea08b6/a89ac637/40s100.dir/doc.xml
>
> buildcol.pl> *** outputting information for classifier: CL1
>
> buildcol.pl> *** outputting information for classifier: CL2
>
> buildcol.pl> *** outputting information for classifier: oai
>
> buildcol.pl> *** creating auxiliary files
>
> buildcol.pl> Orden completada.
>
>
>
> I only obtain in the advanced search, this:
>
>
>
> Buscar ^all,text;DESCRIPTORES in ^all,text;DESCRIPTORES language que
>
>
>
> contengan de ,
>
>
>
> and although , I´ve used a lot of search index , for instance
>
>
>
> text “contenido”[omit index]
>
> ex.ASUNTO:^all “asunto”
>
> ex.DESCRIPTORES:^all “descriptores” & etc,
>
> I only obtain :
>
>
>
> *Palabra o frase*
>
>
>
> * ... en el campo*
>
>
>
>
>
>
>
>
>
> contenido
>
>
>
>
>
>
>
>
>
> contenido
>
>
>
> However If I use another database CDS/ISIS with , fields like :
>
> Código del Centro [01],where “[ any number]” *it don’t mean an array* ,
>
> Identificación [02] etc, in gsdl (greenstone digital libray) I get
> *Código do Centro [01], Identificaçäo [02], *and so on with some
> labels of field, here I get a good advanced search , but at the time of
> build these new collection in the *GLI * I obtain something
>
> Like these :
>
>
>
> s
>
> orden : /opt/greenstone/bin/script/import.pl -gli -language es
> -collectdir /opt/greenstone/collect/ docsal
>
> import.pl> ATENCIÓN: Si no se especificaron -removeold o -keepold, se
> establece -removeold. Se borrarán todos los contenidos del directorio
> archives.
>
> import.pl> Borrando el contenido actual del directorio archives...
>
> import.pl> RecPlug: getting directory /opt/greenstone/collect/docsal/import
>
> import.pl> SplitPlug found 100 documents in
> /opt/greenstone/collect/docsal/import/DOCSAL.mst
>
> import.pl> segment 1 –
>
> *.*
>
> *. (here more lines , like above)*
>
> *.*
>
>
>
> import.pl> IsisPlug: processing DOCSAL.mst
>
> import.pl> segment 100 -
>
> import.pl> IsisPlug: processing DOCSAL.mst
>
> import.pl> *********************************************
>
> import.pl> La importación ha sido completada
>
> import.pl> *********************************************
>
> import.pl> * 100 de los documentos fueron considerados a efecto de ser
> procesados
>
> import.pl> * 100 se procesaron e incluidos en la colección
>
> import.pl> Orden completada.
>
> import.pl> Extrayendo nuevos metadatos de los ficheros.
>
> import.pl> Extracción completa de metadatos archivados.
>
> orden : /opt/greenstone/bin/script/buildcol.pl -gli -language es
> -collectdir /opt/greenstone/collect/ -removeold docsal
>
> buildcol.pl> *** creating the compressed text
>
> buildcol.pl> collecting text statistics (mgpp_passes -T1)
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/docsal/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c.dir/doc.xml
>
> *.*
>
> *. (here more lines , like above)*
>
> *.*
>
>
>
> buildcol.pl> GAPlug: processing
> HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml
>
> buildcol.pl> Stats (Compressing text from text)
>
> buildcol.pl> Total bytes in collection: 320048
>
> buildcol.pl> Total bytes in text: 320048
>
> buildcol.pl> creating the compression dictionary
>
> buildcol.pl> compressing the text (mgpp_passes -T2)
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/docsal/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c.dir/doc.xml
>
> *.*
>
> *. (here more lines , like above)*
>
> *.*
>
> buildcol.pl> GAPlug: processing
> HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml
>
> buildcol.pl> Stats (Compressing text from text)
>
> buildcol.pl> Total bytes in collection: 320048
>
> buildcol.pl> Total bytes in text: 320048
>
> buildcol.pl> *** building index
> text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all; in subdirectory idx
>
> buildcol.pl> creating index dictionary (mgpp_passes -I1)
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/docsal/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c/ads4.dir/doc.xml
>
> *.*
>
> *. (here more lines , like above)*
>
> *.*
>
> buildcol.pl> GAPlug: processing
> HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml
>
> buildcol.pl> Stats (Creating index
> text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all;)
>
> buildcol.pl> Total bytes in collection: 320048
>
> buildcol.pl> Total bytes in
> text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all;: 155848
>
> buildcol.pl> inverting the text (mgpp_passes -I2)
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/docsal/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c.dir/doc.xml
>
> buildcol.pl> GAPlug: processing HASHbe2c/7ef374dd/3e66808c/ads4.dir/doc.xml
>
> *.*
>
> *. (here more lines , like above)*
>
> *.*
>
> buildcol.pl> GAPlug: processing
> HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml
>
> buildcol.pl> Stats (Creating index
> text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all;)
>
> buildcol.pl> Total bytes in collection: 320048
>
> buildcol.pl> Total bytes in
> text;AutorInst[17]^all;Cidade[56]^all;Data[64]^all;: 155848
>
> buildcol.pl> create the weights file
>
> buildcol.pl> creating 'on-disk' stemmed dictionary
>
> buildcol.pl> creating stem indexes
>
> buildcol.pl> BuildDir: /opt/greenstone/collect/docsal/building
>
> buildcol.pl> *** creating the info database and processing associated files
>
> buildcol.pl> ArcPlug: procesando
> /opt/greenstone/collect/docsal/archives/archives.inf
>
> buildcol.pl> GAPlug: processing HASHbe2c.dir/doc.xml
>
> buildcol.pl> WARNING: AZList: HASHbe2c7ef374dd3e66808cads1 metadata is
> empty - not classifying
>
> *.*
>
> *. (and so on 100 times)*
>
> *.*
>
>
>
> buildcol.pl> GAPlug: processing
> HASHbe2c/7ef374dd/3e66808c/ads100.dir/doc.xml
>
> buildcol.pl> WARNING: AZList: HASHbe2c7ef374dd3e66808cads100 metadata is
> empty - not classifying
>
> buildcol.pl> *** outputting information for classifier: CL1
>
> buildcol.pl> *** outputting information for classifier: CL2
>
> buildcol.pl> *** outputting information for classifier: oai
>
> buildcol.pl> *** creating auxiliary files
>
> buildcol.pl> Orden completada.
>
>
>
> Here I can´t get any kink of information in AZlist, and in another
> classifier like AZCompactList , I can´t write down , it´s title,
>
>
>
> *A-S* _0-9_
> <http://desarrollo.salud.gob.mx/cgi-bin/library?e=d-000-00---0docsal--00-1-0--0prompt-10---4------0-1l--1-es-50---20-about---00131-001-1-0utfZz-8-00&a=d&cl=CL2.2>
>
>
> <http://desarrollo.salud.gob.mx/cgi-bin/library?e=d-000-00---0docsal--00-1-0--0prompt-10---4------0-1l--1-es-50---20-about---00131-001-1-0utfZz-8-00&a=d&cl=CL2.1.1.pr>
>
>
>
> ago. 1988]
>
>
>
>
>
>
> <http://desarrollo.salud.gob.mx/cgi-bin/library?e=d-000-00---0docsal--00-1-0--0prompt-10---4------0-1l--1-es-50---20-about---00131-001-1-0utfZz-8-00&a=d&c=docsal&cl=CL2.1.1&d=HASHbe2c7ef374dd3e66808cads64>
>
>
>
> ]
>
> ago. 1988]
>
> ]
>
> Please can you help me to solve this problem !, I´ve use the
> tutorial,CDS/ISIS
> (_http://greenstone.sourceforge.net/wiki/gsdoc/tutorial/en/cds_isis.htm_),
> thank you for your help.
>
>
>
>
>
>
>
>
>
> ------------------------------------------------------------------------
> Be one of the first to try Windows Live Mail. Windows Live Mail.
> <http://ideas.live.com/programpage.aspx?versionId=5d21c51a-b161-4314-9b0e-4911fb2b2e6d>
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users