Re: [greenstone-devel] Version 2.72 and CDS/ISIS - order of subfileds]

From ruben pandolfi
DateFri, 15 Dec 2006 08:08:18 +0100
Subject Re: [greenstone-devel] Version 2.72 and CDS/ISIS - order of subfileds]
Hello John,

Thank you very much for your work and for your message,

I have a litte question about CDS/ISIS, do you think is possible to
maintain subfield order and relation?

Example:

extract from doc.xml

......................................................................-
<Description>
<Metadata
name=3D"gsdlsourcefilename">import/BABEL3.00000301/00000400.nul</Metadata=
>
<Metadata name=3D"gsdldoctype">indexed_doc</Metadata>
<Metadata name=3D"Plugin">NULPlug</Metadata>
<Metadata name=3D"Source">00000400.nul</Metadata>
<Metadata name=3D"FileSize">0</Metadata>
<Metadata name=3D"null_file">00000400.nul</Metadata>
<Metadata name=3D"dc.DcContributor^a">Morin, Edgar</Metadata>
<Metadata name=3D"dc.DcContributor^a">All=E8gre, Claude</Metadata>
<Metadata name=3D"dc.DcContributor^*">Morin, Edgar</Metadata>
<Metadata name=3D"dc.DcContributor^*">All=E8gre, Claude</Metadata>
<Metadata name=3D"dc.DcDate">1994</Metadata>
<Metadata name=3D"dc.DcType">entretien</Metadata>
<Metadata name=3D"dc.ModsRecordInfoRecordIdentifi">400</Metadata>
<Metadata name=3D"dc.DcContributor^q">interviewer</Metadata>
<Metadata name=3D"dc.DcTitle">Edgar Allegr=E8ment</Metadata>
<Metadata name=3D"dc.ModsOriginInfoPlace">AR</Metadata>
<Metadata name=3D"dc.DcContributor">Morin, Edgar</Metadata>
<Metadata name=3D"dc.DcContributor">All=E8gre, Claude, interviewer</Metad=
ata>
<Metadata name=3D"dc.DcLanguage">fre</Metadata>
<Metadata name=3D"dc.DcIdentifier">babel-id-400</Metadata>
-
<Metadata name=3D"dc.ISISRawRecord">
tag=3D10 data=3DEdgar Allegr=E8ment
tag=3D20 data=3Dentretien
tag=3D30 data=3D^aMorin, Edgar%^aAll=E8gre, Claude^qinterviewer
tag=3D40 data=3Dfre
tag=3D65 data=3DAR
tag=3D85 data=3D1994
tag=3D90 data=3Dbabel-id-400
tag=3D190 data=3D400
</Metadata>
<Metadata name=3D"Title">00000400</Metadata>
<Metadata name=3D"Identifier">HASH01f630c90dd3b1b10d811117</Metadata>
<Metadata name=3D"assocfilepath">HASH01f6.dir</Metadata>
</Description>

......................................................................

In this case I would like to display the isis record:

tag=3D30 data=3D^aMorin, Edgar%^aAll=E8gre, Claude^qinterviewer


as

Morin, Edgar
All=E8gre, Claude - interviewer


the closer I can get at the moment is with:
......................................................................

{If}{[dc.DcContributor], <tr class=3D"metadata"><td
valign=3Dtop><b>_Co_:</b></td><td valign=3Dtop>[sibling(all\' <br />
'):dc.DcContributor]</td></tr>}
......................................................................

this displays:

......................................................................
Author:Morin, Edgar
All=E8gre, Claude, interviewer
......................................................................

but I do not have a way to format dc.DcContributor^q

On the other hand, if I take every signle subfield I always get
......................................................................
Author:Morin, Edgar, interviewer
All=E8gre, Claude
......................................................................

because GSDL does not keep the order or association of subfileds,

Do you think there is a way to achive that in importing/exploding
cds/isis db ?

Thank you

Ruben

John Rose ha scritto:
> Dear Greenstone users/developers,
>=20
> I have been working with the Greenstone team to ensure liaison with=20
> CDS/ISIS users, and am taking this opportunity to list (in more detail=20
> than in the release announcements) the improvements for CDS/ISIS=20
> database conversion in Greenstone version 2.72 relative to version 2.70=
=20
> (most of these functions were available in 2.71 but some had bugs or=20
> have been further improved, so CDS/ISIS users wishing to benefit from=20
> them are advised to upgrade to 2.72):
>=20
> 1. The ^* metadata element is available to access the first subfield of=
=20
> a field with subfields (even if it is the main field without a=20
> delimiting prefix).
>=20
> 2. Backslashes in a CDS/ISIS field (e.g. Windows file paths) will=20
> display correctly with Greenstone formatting language.
>=20
> 3. Support for DOS 852 coding (needed for DOS-based CDS/ISIS databases=20
> in Eastern European languages).
>=20
> 4. Logically deleted records will not be imported (with prior versions,=
=20
> you had to export to an ISO file and re-import into CDS/ISIS before=20
> converting to Greenstone)
>=20
> 5. A "-records_per_folder" option has been added to the explode=20
> function. This puts the records from exploding a metadata database into=
=20
> multiple subdirectories, which means that the GLI should use less memor=
y=20
> and edit the metadata more quickly. This option has not yet been tested=
=20
> for its usefulness in real conversion situations; it may be tried for=20
> large databases in which the time for explosion seems inordinately long=
.=20
> The default value is 100, so you can try a lower value, say 10.
>=20
> 6. A bug under Linux, by which the CDS/ISIS files with filenames in=20
> capital letters were not handled correctly, has been fixed (previously=20
> the filenames had to be changed manually to small letters before=20
> dragging them into GLI).
>=20
> 7. '&' characters and spaces in filenames now work in the=20
> "document_field" parameter of the explode function (previously, the=20
> corresponding documents were not imported).
>=20
> 8. When the "document_field" CDS/ISIS field is repeatable, each=20
> occurrence will yield a separate Greenstone document, each with the sam=
e=20
> metadata (previously only the first occurrence was imported).
>=20
> 9. Building a CDS/ISIS collection (either "as is", i.e. metadata only,=20
> or by exploding) should be significantly faster in Greenstone v2.71, as=
=20
> it no longer tries to determine the encoding of the CDS/ISIS file.
>=20
> All reported problems with the "as is" conversion of large CDS/ISIS=20
> databases with GLI seem to have been resolved with v2.72 - one user has=
=20
> successfully converted a database of 38,000 records. On the other hand,=
=20
> GLI may fail at the explode step because it wasn't designed to handle=20
> huge amounts of metadata (typically when approaching 15,000 CDS/ISIS=20
> records, but possibly less or greater depending on the size of the=20
> records); in this case, the command line may be used, and I will shortl=
y=20
> be posting to the Wiki a summary of this process for basic Greenstone=20
> users. Please do report to the discussion lists any problems encountere=
d=20
> in CDS/ISIS conversions.
>=20
> With best regards, John=
=20
> Rose
>=20
>=20
>=20
>=20
> John B. Rose
> Honorary Research Associate, University of Waikato
> S=E8vres, France
> Email: <johnrose@alumni.caltech.edu>
>=20
> _______________________________________________
> greenstone-devel mailing list
> greenstone-devel@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-devel
>=20


--=20

..................
..................

Ruben Pandolfi

-------------------------------------------------------------
"...I Think This is the Beginning of a Beautiful Friendship."
-------------------------------------------------------------


--=20

..................
..................

Ruben Pandolfi

-------------------------------------------------------------
"...I Think This is the Beginning of a Beautiful Friendship."
-------------------------------------------------------------