From | Joel Sandeep |
Date | Tue, 25 Jul 2006 08:50:03 +0530 (IST) |
Subject | Re: [greenstone-users] Title page image |
In-Reply-To | (20060725010642-225CA22CEF-nidhi-vidyanidhi-org-in) |
> Send greenstone-users mailing list submissions to
> greenstone-users@list.scms.waikato.ac.nz > > To subscribe or unsubscribe via the World Wide Web, visit > https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users > or, via email, send a message with subject or body 'help' to > greenstone-users-request@list.scms.waikato.ac.nz > > You can reach the person managing the list at > greenstone-users-owner@list.scms.waikato.ac.nz > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of greenstone-users digest..." > > > Today's Topics: > > 1. Re: Hebrew characters (John R. McPherson) > 2. Highlighting Search Results (makekee@msu.ac.zw) > 3. Title page image (Chandana Patra) > 4. Re: Highlighting Search Results (sw64@cs.waikato.ac.nz) > 5. ERROR: Cannot Load collection (Admin) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Mon, 24 Jul 2006 16:04:26 +1200 > From: "John R. McPherson" <jrm21@cs.waikato.ac.nz> > Subject: Re: [greenstone-users] Hebrew characters > To: Admin <admin@egoists.info> > Cc: greenstone-users@list.scms.waikato.ac.nz > Message-ID: <20060724040426.GR7361@matai.cs.waikato.ac.nz> > Content-Type: text/plain; charset=utf-8 > > On Sun, Jul 23, 2006 at 10:55:02PM -0500, Admin wrote: >> Hello everybody, >> We have few English books with paragphs in Hebrew. As you can see, >> they don't show up corectly >> http://lib.kabbalah.info/cgi-bin/library?e=d-000-00---0newtest--00-0-0--0prompt-10---4----dte--0-1l--1-en-50---20-about-%d7%a4--00031-001-1-0utfZz-8-00&a=d&c=newtest&cl=CL1.3.3&d=HASHb59144edba1cb57bf4a48d.5>> >> We are using UTF encoding in HTML files. All files are OK, if I test >> them from my local machine. >> Please, any advice how to fix this error. > > It looks like the input documents have been re-encoded from iso-8859 to > utf-8 - probably because greenstone tries to guess the encoding if you > don't specify it, and in this case it has guessed wrong. > There are two ways to fix this: 1 quick fix, and 1 longer term fix. > > The quick fix is for you to specify the encoding in your collect.cfg > file. Eg > plugin HTMLPlug -input_encoding utf8 > for all your HTML source documents. > > The long-term fix is to improve our language detection - I suspect we > don't have any language models for Hebrew. If you can email me > (off-list) several documents in Hebrew then I can add model files (in > perllib/textcat) for it. > > John McPherson > > > > ------------------------------ > > Message: 2 > Date: Mon, 24 Jul 2006 08:20:19 +0200 > From: makekee@msu.ac.zw > Subject: [greenstone-users] Highlighting Search Results > To: greenstone-users@list.scms.waikato.ac.nz > Message-ID: <20060724082019.wesu29py8googggk@216.104.194.219> > Content-Type: text/plain;charset=ISO-8859-1 > > Does anybody know how to highlight search terms in retrieved documents. > For example if i search for "mad cow", i want these terms to be > highlighted in the document body. > > > Ephraim Makeke > > ---------------------------------------------------------------- > Message From Midlands State University P Bag 9055,Gweru,Zimbabwe > website: www.msu.ac.zw > > > > > ------------------------------ > Dear Chandana, use the <img src="image path"> in the browsing classifier VList then the image is automatically displayed. Use <tr> and <td> tags to place the image in proper positions With regards Joel Sandeep Software Engineer Vidyanidhi Digital Library University of Mysore, Mysore, India > Message: 3
|