RE: [greenstone-users] Ignoring leading articles

From Yachnes, Paul
DateMon, 27 Jun 2005 19:52:54 -0400
Subject RE: [greenstone-users] Ignoring leading articles
Katherine,

Yes the documents are English and the Language metadata in the doc.xml files
is "en" which I assume is English. But Greenstone is definitely sorting on
leading articles.

Paul

-----Original Message-----
From: Katherine Don
To: Yachnes, Paul
Cc: 'greenstone-users@list.scms.waikato.ac.nz '
Sent: 6/27/2005 7:22 PM
Subject: Re: [greenstone-users] Ignoring leading articles

Hi Paul

Are your documents in English, and does Greenstone think they are?
(check Language metadata in the archives doc.xml files). We only do this
for english docs.

The 'the a an' will remain in the metadata that gets displayed, even if
it has been removed for sorting.

Perhaps you could send me your config file, and some examples of
metadata that is not being sorted properly.

Regards,
Katherine


Yachnes, Paul wrote:
> Katherine,
>
> Greenstone is definitley NOT ignoring or removing the, a, an when
sorting.
> Can you think of any reason for this and how to correct it?
>
> Paul
>
> -----Original Message-----
> From: Katherine Don
> To: Yachnes, Paul
> Cc: greenstone-users@list.scms.waikato.ac.nz
> Sent: 6/27/2005 5:38 PM
> Subject: Re: [greenstone-users] Ignoring leading articles
>
> Hi Paul
>
> Greenstone already does this in a very limited way (removes the, a, an
> for english strings). You can configure the classifiers to do
something
> different by using the -removeprefix option - the value should be a
> regular expression, and whatever it matches will be removed for
sorting
> purposes.
>
> Regards,
> Katherine
>
> Yachnes, Paul wrote:
>
>>Is there any way to configure Greenstone to ignore leading articles in
>>metadata for sorting purposes in VLists?
>>
>>
>>
>>Paul A. Yachnes, MLS
>>
>>Senior Manager
>>
>>Information Resource Center
>>
>>Newspaper Association of America
>>
>>(703) 902-1694
>>
>>fax: (703) 902-1691
>>
>>yachp@naa.org
>>
>>
>>
>>
>>
>
>
------------------------------------------------------------------------
>
>>_______________________________________________
>>greenstone-users mailing list
>>greenstone-users@list.scms.waikato.ac.nz
>>https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>
>