Re: [greenstone-users] Sorting Russian PDF in Classifiers

From Michael Dewsnip
DateThu, 15 Jun 2006 10:45:13 +1200
Subject Re: [greenstone-users] Sorting Russian PDF in Classifiers
In-Reply-To (000001c68dca$496c91c0$6401a8c0-WALLACE)
Hi Jonathan,

Please check you have specified the "-sort" and
"-no_metadata_formatting" options to the classifier. If this still
doesn't work then it is probably a bug: please let us know and we'll
look into it further.

All the best,

Michael

Jonathan Tremblay wrote:

> Hi,
>
>
>
> My project contains English, Spanish, French and Russian documents
> (all in PDF).
>
>
>
> Since the beginning, sorting has been a problem. So I created a
> metadata field specifically for sorting. At first I used numbers, but
> since I still got problems, I start using characters in that field
> (AAAA, AAAB, AAAC, etc.)
>
>
>
> It worked perfectly for the search results. But Russian documents are
> not sorted correctly in the classifiers (AZCompactList and Hierarchy):
> they always appear before other documents (ex. RAAA, RAAB, AAAA, AAAB,
> AAAC, etc.)
>
>
>
> Why?
>
>
>
> I got a similar problem with an English PDF which contained no
> editable text (it contained only images from a scan). As soon as I
> replace the document with a PDF version containing text, the document
> got sorted correctly. By the way, all my Russian documents contain
> editable text.
>
>
>
> Thanks,
>
>
>
> Jonathan Tremblay
>
>
>
>------------------------------------------------------------------------
>
>_______________________________________________
>greenstone-users mailing list
>greenstone-users@list.scms.waikato.ac.nz
>https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>
>