|From||John R. McPherson|
|Date||Thu, 20 Feb 2003 10:16:19 +1300|
|Subject||Re: building mgpp collection from non-ascii|
|r c wrote:
> Hi all,
> I’m trying to build MGPP collection from pages that contain
> non-ascii characters (encoding Windows 1250). Import as well as
> building went smoothly but displayed results contain "cabalistic
> characters" (as soomeone pointed before).
> I’m sure that
> -import was OK (I made 2nd mg collection from the same archive
> files). -encoding preferences for receptionist were set (Windows
> 1250, UTF-8 later on ; mg collection is displayed right)
> The metadata is the only thing I can see well- ie. when I get
> search results there may be displayed First200 metadata which
> contains all non-ascii characters(I read that mgpp doesn’t
> compress metadata by default). I can’t search for strings
> that contain non-ascii characters at all(zero hits).
> Is there anyone who built *mgpp* collection from non-ascii
> encoded files? Please, let me know what I am doing wrong. Here
> are both collects.cfg and some building messages - (interesting
> is the different number of reported bytes for both collections -
> don't be stressed by the rest amount of text).
Hope this helps