Re: Re: [greenstone-users] Missing documents

From sw64@cs.waikato.ac.nz
DateFri, 5 Jan 2007 10:35:04 +1300 (NZDT)
Subject Re: Re: [greenstone-users] Missing documents
In-Reply-To (7-0-1-0-2-20070104093613-02326008-free-fr)
Hello John,

You are right, there is a bug in Greenstone. The classifier is ok on the
extracted metadata (i.e ex.Title), which was what I tested, but not on the
dc.Title. I will trace the bug next week. Again, thanks for pointing it
out.

Regards
Shaoqun

> Dear Shaoqun,
>
> All of the long filenames had also accented
> French characters (à, é, è, ù, ç), and when I
> shortened the names, some ended up without
> accented characters and these were displayed.
> Thus I think it is a problem with accented
> characters rather than with long filenames. I am
> sending separately one of the files with accented
> characters in the filename, as well as the
> collect.cfg file. When I browse on title, for
> example, only the documents without accented
> characters in the filename appear in the document
> list. I am wondering whether it is a problem of
> using Greenstone in English on a French version
> of Windows (there were similar problems with an
> earlier version of Greenstone - I believe with
> the installation procedure, but I don't remember
> exactly - which the Greenstone team fixed. Best regards, John
>
> At 03:30 04/01/2007, you wrote:
>>------------------------------
>>
>>Message: 8
>>Date: Thu, 4 Jan 2007 15:28:18 +1300 (NZDT)
>>From: sw64@cs.waikato.ac.nz
>>Subject: Re: [greenstone-users] Missing documents
>>To: sw64@cs.waikato.ac.nz
>>Cc: greenstone-users@list.scms.waikato.ac.nz, John Rose
>> <johnrose@alumni.caltech.edu>
>>Message-ID:
>> <44656.130.217.244.2.1167877698.squirrel@webmail.scms.waikato.ac.nz>
>>Content-Type: text/plain;charset=iso-8859-1
>>
>>Hello,
>>
>> >> When I built the collection, I found that two of
>> >> the pdf documents were rejected (see below), and
>> >> the others seemed to be processed normally. I
>> >> believe that searching worked for the processed
>> >> documents, but when I tried to display them in
>> >> browsing classifiers, those with filenames of
>> >> more than 36 characters (but which were handled
>> >> without problems by Windows) would not display
>> >> (at least with the default VList). When I
>> >> shortened the filenames and tried again, I found
>> >> that the documents with filenames with French
>> >> accented characters would not display with the
>> >> browsing classifiers (although they apparently
>> >> did display when found by search). When I took
>> >> out the accents, all 14 are displayed normally.
>> >> Is this a bug or is there a way to get around it?
>>
>>I tried it on our windows machine using 2.72 version--making the filename
>>longer than 36 chars and with french accented chars, and it seemed work
>>fine, so could you send me one of your such files (if possible)?
>>
>>Regards
>>Shaoqun
>>
>>------------------------------
>>
>>_______________________________________________
>>greenstone-users mailing list
>>greenstone-users@list.scms.waikato.ac.nz
>>https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>>
>>
>>End of greenstone-users Digest, Vol 46, Issue 4
>>***********************************************
>
>
> John B. Rose
> 1 Bis, Rue des Châtre-Sacs
> 92310 Sèvres
> France
> Email: <johnrose@alumni.caltech.edu>
>
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>
>