[greenstone-users] Re: greenstone-users Digest, Vol 89, Issue 16

From Yoseph bizuneh
DateMon Aug 30 18:17:06 2010
Subject [greenstone-users] Re: greenstone-users Digest, Vol 89, Issue 16
Hi list,

When i tried to build collection of 8771.pdf files, 8509 was
processed and 199 pdf files was rejected everything ok!
TOTAL OF 34.3 GB pdf files. but i tried to build collection which content 8509
pdf files in grennstone 2.83 (linux platform).


After this, the Building progress bar shows 100% complete, and an error message
"Collection Preview State"pops up with the message:
**************************************************************
"An error has occurred which will prevent the collection being previewed".
***************************************************************

This collection does not show up on the user web interface.
Could you tell the reason of our problem?
Thank you in advance
yoseph bizuneh

adama university
ethiopia.

________________________________
From: "greenstone-users-request@list.scms.waikato.ac.nz"
<greenstone-users-request@list.scms.waikato.ac.nz>
To: greenstone-users@list.scms.waikato.ac.nz
Sent: Mon, August 30, 2010 3:57:01 AM
Subject: greenstone-users Digest, Vol 89, Issue 16

Send greenstone-users mailing list submissions to
greenstone-users@list.scms.waikato.ac.nz

To subscribe or unsubscribe via the World Wide Web, visit
https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
or, via email, send a message with subject or body 'help' to
greenstone-users-request@list.scms.waikato.ac.nz

You can reach the person managing the list at
greenstone-users-owner@list.scms.waikato.ac.nz

When replying, please edit your Subject line so it is more specific
than "Re: Contents of greenstone-users digest..."


Today's Topics:

1. fail log for xml metadata files? (Mariana Pichinini)
2. Greenstone CD (Semiti Ravatu)
3. Re: PagedImage Plugin - problem (Katherine Don)
4. Re: Problem in updating version from 2.72 to 2.83 (Katherine Don)
5. Re: backups - Greenstone 3 (Katherine Don)
6. Re: How to create a list of Subjects. (Katherine Don)
7. Re: indexing problem by search engines (Katherine Don)
8. Re: CD/DVD image (Katherine Don)


----------------------------------------------------------------------

Message: 1
Date: Fri, 27 Aug 2010 17:49:40 -0300 (ART)
From: "Mariana Pichinini" <mariana@fahce.unlp.edu.ar>
Subject: [greenstone-users] fail log for xml metadata files?
To: greenstone-users@list.scms.waikato.ac.nz
Message-ID:
<53dad501b83f678e2e6fa85b2f46c640.squirrel@webmail.fahce.unlp.edu.ar>
Content-Type: text/plain;charset=iso-8859-1

Hello everybody

The fail.log file, generated by import.pl (Greenstone v2.82), include
records from the data files that failed to be processed sucessfully.
But it does not log rejected metadata files, usually malformed xml.

There is any way to include them in the fail log file?

Thanks in advance

Lic. Mariana Pichinini
Area Tecnolog□as
_______________________________________________
BIBHUMA - Biblioteca Profesor Guillermo Obiols
Facultad de Humanidades y Ciencias de la Educaci□n
Universidad Nacional de La Plata
Calle 48 entre 6 y 7 - 1er subsuelo
B1900AMW LA PLATA, Argentina
Telefax: +54-221-4230125 interno 162 (l□neas rotativas)
WEB: www.bibhuma.fahce.unlp.edu.ar


------------------------------

Message: 2
Date: Fri, 27 Aug 2010 23:11:34 -0700 (PDT)
From: Semiti Ravatu <semitir@ymail.com>
Subject: [greenstone-users] Greenstone CD
To: greenstone-users@list.scms.waikato.ac.nz
Message-ID: <563537.8092.qm@web59610.mail.ac4.yahoo.com>
Content-Type: text/plain; charset="iso-8859-1"

Hi

Can you send a Greenstone CD please

Semiti Ravatu
51 San Miguel Way
Novato CA 94945

Thank you
Semiti

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20100827/8bbd5575/attachment-0001.html


------------------------------

Message: 3
Date: Mon, 30 Aug 2010 11:33:26 +1200
From: Katherine Don <kjdon@cs.waikato.ac.nz>
Subject: Re: [greenstone-users] PagedImage Plugin - problem
To: wtmann@comune.belluno.it
Cc: greenstone-users@list.scms.waikato.ac.nz
Message-ID: <4C7AEE46.4080900@cs.waikato.ac.nz>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Hi William

This is a bug that is my fault. srclink is supposed to be generated at
runtime from the srclink_file value, and it is for classifiers and
search results, but I obviously missed out testing its use in the
document page. The fix involves source code changes. The next release
will be fixed.

The easiest thing for you to do would be to modify your format
statement. This would be easier than recompiling the code.

In DocumentText or DocumentHeading format statements, instead of
[srclink], use <a
href="/gsdl/collect/[collection]/index/assoc/{Or}{[parent(Top):assocfilepath],[assocfilepath]}/[srclink_file]">


A bit of a mouthful.

Cheers,
Katherine

William Mann wrote:
> After further testing, I've found that after my images are processed
> with the PagedImage plugin (I've removed the Text and Image plugins)
> the following formatstring items are empty: [link], [srclink] and
> [href] therefore Greenstone is defaulting the link to cgi-bin. I still
> haven't found the cause of this and any ideas are totally welcome!
> Thanks.
>
>
> Il 16/08/2010 15:02, William Mann ha scritto:
>> Hi,
>>
>> I'm having a problem with the srclink while using the PagedImage plugin:
>> that is, everything seems to work fine but when I click on the image to
>> go to the raw version I get a link like the following:
>> http://myhost:8282/greenstone/cgi-bin/image.jpg which obviously doesn't
>> exist. I'm using a hierarchical structure with the following structure:
>>
>> Collection->Area 1
>> Collection->Area 2
>> ...
>>
>> In Area 1 there are the original images with the .item file and in Area
>> 2 there are other images with their .item file.
>>
>> As stated, I can navigate the images but I can't get to the full-sized
>> image because it seems that the srclink tag isn't working correctly (or
>> I'm missing something which is more likely).
>>
>> Does anyone have any idea what I'm doing wrong? This is my first time
>> trying Greenstone and this has me stumped. I'm running Greenstone on an
>> Ubuntu 10.04 server. Any help would be greatly appreciated. Thanks.
>>
>>
>>
>
>

------------------------------

Message: 4
Date: Mon, 30 Aug 2010 11:37:49 +1200
From: Katherine Don <kjdon@cs.waikato.ac.nz>
Subject: Re: [greenstone-users] Problem in updating version from 2.72
to 2.83
To: Lavji Zala <zala@micamail.in>
Cc: greenstone-users@list.scms.waikato.ac.nz
Message-ID: <4C7AEF4D.1010407@cs.waikato.ac.nz>
Content-Type: text/plain; charset="iso-8859-1"

An HTML attachment was scrubbed...
URL:
https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20100830/c44ed211/attachment-0001.html


------------------------------

Message: 5
Date: Mon, 30 Aug 2010 11:56:23 +1200
From: Katherine Don <kjdon@cs.waikato.ac.nz>
Subject: Re: [greenstone-users] backups - Greenstone 3
To: Catherine Chambers <chambers.catherine@gmail.com>
Cc: greenstone-users@list.scms.waikato.ac.nz
Message-ID: <4C7AF3A7.2070205@cs.waikato.ac.nz>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Hi Catherine

What you'll need to backup will depend on what customisations have been
made.

For collection backups, all you need is the folder inside the collect
directory. For example, for the demo gs2mgppdemo collection, you'd need
to backup the web/sites/localsite/collect/gs2mgppdemo folder.

You may want to just back up the whole collect directory.

Or, if the collections are very large, then you could just back up the
source of the collection, not the whole thing. In this case, you'd need
probably all the folders inside the collection's folder (import, etc,
images..) but not archives, tmp or index. index contains the built
collection, while archives and tmp are used during building. If you
didn't save these, then you'd have to rebuild the collection from
scratch if it needed to be reinstated from the backup.

If there has been any interface customisations, then you'll need to back
up those too. This may be in web/interfaces. Have you got any record of
whether there has been software customisations or not?

Regards,
Katherine

Catherine Chambers wrote:
> Hi there
>
> I am new to Greenstone and have recently inherited responsibility for
> a Greenstone3 installation, running on Fedora 7.
> A recent hardware failure brought to my attention that there is no
> backup occurring. Unfortunately i have a dreadful internet connection
> that is making searching the list archives almost impossible.
> Can anyone tell me what exactly i need to backup in case of a
> hardware/software failure? or direct me to somewhere on the web that
> can tell me this information?
>
> TIA
> Catherine
> Systems Librarian : Mzuzu University (Malawi)
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>
>

------------------------------

Message: 6
Date: Mon, 30 Aug 2010 12:32:59 +1200
From: Katherine Don <kjdon@cs.waikato.ac.nz>
Subject: Re: [greenstone-users] How to create a list of Subjects.
To: Jay Clark <jclark@maf.org>
Cc: greenstone-users@list.scms.waikato.ac.nz
Message-ID: <4C7AFC3B.90604@cs.waikato.ac.nz>
Content-Type: text/plain; charset="iso-8859-1"

An HTML attachment was scrubbed...
URL:
https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20100830/27375709/attachment-0001.html


------------------------------

Message: 7
Date: Mon, 30 Aug 2010 12:50:24 +1200
From: Katherine Don <kjdon@cs.waikato.ac.nz>
Subject: Re: [greenstone-users] indexing problem by search engines
To: tigran@flib.sci.am
Cc: greenstone-users@list.scms.waikato.ac.nz
Message-ID: <4C7B0050.30707@cs.waikato.ac.nz>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Hi Tigran

If you have other sites that are already indexed that link to your
greenstone site, then Google should be able to find it and index it.

I think you can manually tell search engines about your site. Here is a
page from Google.
http://www.google.com/support/webmasters/bin/answer.py?answer=34397&cbid=-rid3n4nhpwr9&src=cb&lev=

index

Regards,
Katherine

Tigran Zargaryan wrote:
> Dear List,
> (Using Greenstone we are developing a very interesting repository of
> Armenian rare books (bibliographic description+images). How can I
> point serac engines (Google, Yahoo, etc) to index this repository?
> thanks for assistance,
> Tigran
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>

------------------------------

Message: 8
Date: Mon, 30 Aug 2010 12:55:59 +1200
From: Katherine Don <kjdon@cs.waikato.ac.nz>
Subject: Re: [greenstone-users] CD/DVD image
To: Paul Yachnes <pyachnes@nbtafoundation.org>
Cc: "greenstone-users@list.scms.waikato.ac.nz"
<greenstone-users@list.scms.waikato.ac.nz>
Message-ID: <4C7B019F.3070305@cs.waikato.ac.nz>
Content-Type: text/plain; charset="iso-8859-1"

An HTML attachment was scrubbed...
URL:
https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20100830/349247af/attachment.html


------------------------------

_______________________________________________
greenstone-users mailing list
greenstone-users@list.scms.waikato.ac.nz
https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users


End of greenstone-users Digest, Vol 89, Issue 16
************************************************


-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://list.scms.waikato.ac.nz/mailman/private/greenstone-users/attachments/20100829/590b7703/attachment-0001.html