Re: Fwd: Re: [greenstone-users] performance with 150 collections

From Dominique Babini
DateFri, 10 Jun 2005 12:51:26 -0300
Subject Re: Fwd: Re: [greenstone-users] performance with 150 collections
In-Reply-To (5-1-1-6-0-20050610110812-00bb97f0-piluso-clacso-edu-ar)
Hi Michael,
Thank you for your reply. We have today 70 subcollections (which we
estimate will grow to 150 subcollection in two years) because we have 168
research institutes in 21 countries of Latin America and the Caribbean who
are members of our network and each member institute wants to have its own
collection besides being part of this cooperative portal of collections.
Today the performance when searching in 4.000 full-text documents in the
complete set of subcollections is good. We have configured Greenstone so
that it searches in the complete 4.000 full-text documents of these
subcollections, and results are shown in sets of 20 registers. Today we
have no problems.
In your last question (
>Could you

>achieve the functionality you need by combining all the subcollections

>into one large collection, and using metadata to maintain the splits?
) you are giving us a very good idea if performance is a problem when
our collections grow in the future.
Thank you for your help,
Dominique and Florencia from Buenos Aires
www.clacso.org.ar/biblioteca

>
>>X-Kaspersky: Checked
>>Envelope-to: vergara@clacso.edu.ar
>>Date: Fri, 10 Jun 2005 16:38:08 +1200
>>From: Michael Dewsnip <mdewsnip@cs.waikato.ac.nz>
>>User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.8)
>Gecko/20050516
>>X-Accept-Language: en-us, en
>>To: Dominique Babini <dbabini@campus.clacso.edu.ar>
>>Subject: Re: [greenstone-users] performance with 150 collections
>>Cc: greenstone-users@list.scms.waikato.ac.nz
>>X-BeenThere: greenstone-users@list.scms.waikato.ac.nz
>>X-Mailman-Version: 2.1.5
>>List-Id: Greenstone Digital Library Users List
>> <greenstone-users.list.scms.waikato.ac.nz>
>>List-Unsubscribe:
>><https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users>,
>>
>><mailto:greenstone-users-request@list.scms.waikato.ac.nz?subject=unsubscribe>
>>List-Archive:
>><https://list.scms.waikato.ac.nz/mailman/private/greenstone-users>
>>List-Post: <mailto:greenstone-users@list.scms.waikato.ac.nz>
>>List-Help:
>><mailto:greenstone-users-request@list.scms.waikato.ac.nz?subject=help>
>>List-Subscribe:
>><https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users>,
>>
>><mailto:greenstone-users-request@list.scms.waikato.ac.nz?subject=subscribe>
>>Sender: greenstone-users-bounces@list.scms.waikato.ac.nz
>>X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on
>> piluso.clacso.edu.ar
>>X-Spam-Status: No, score=0.0 required=5.0 tests=none
>autolearn=unavailable
>> version=3.0.3
>>X-Spam-Level:
>>
>>Hi Dominique,
>>
>>I haven't heard of anyone with that many subcollections before, and I
>>wouldn't want to hazard a guess about the performance! We would be
>>interested in hearing about your experiences when your library is
>>complete, and adding it to this FAQ entry:
>>http://www.greenstone.org/cgi-bin/library?a=p&p=faqbuild#sizelimit. I
>>think the critical factor would be merging the results of searching each
>>collection, and how the performance scales would depend on how this is
>done.
>>
>>I'm a little curious why you need so many subcollections. Could you
>>achieve the functionality you need by combining all the subcollections
>>into one large collection, and using metadata to maintain the splits?
>>
>>Regards,
>>
>>Michael
>>
>>
>>
>>Dominique Babini wrote:
>>
>> >We would appreciate opinions/experiences about how efficient will
>> >Greenstone perform a search in the complete set of collections when we
>> >will have 150 collections with a total of 10.000 documents in 2007.
>> >Is having 150 individual collections a handicap for performance when
>> >searching a concept in the complete set of collections?
>> >Today we have 70 and 4.000 documents, we worry about our future
>performance
>> >Thank you for help on this issue
>> >Dominique and Florencia
>> >
>> >
>> >
>> >
>> >Red de Bibliotecas Virtuales de Ciencias Sociales
>> >de Am□rica Latina y el Caribe de la red CLACSO
>> >www.clacso.org.ar/biblioteca
>> >dbabini@campus.clacso.edu.ar
>> >
>> >Consejo Latinoamericano de Ciencias Sociales (CLACSO)
>> >Av. Callao 875, 3ro. E, (C1023 AAB) Buenos Aires, Argentina
>> >Tel.: (54-11) 4814-2301/4811-6588 Fax: (54-11) 4812-8459
>> >
>> >
>> >
>> >
>> >
>> >
>> >_______________________________________________
>> >greenstone-users mailing list
>> >greenstone-users@list.scms.waikato.ac.nz
>> >https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>> >
>> >
>> >
>>
>>
>>_______________________________________________
>>greenstone-users mailing list
>>greenstone-users@list.scms.waikato.ac.nz
>>https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>
>Florencia Vergara Rossi
>Bibilioteca - Clacso
>vergara@clacso.edu.ar
>http://www.clacso.org.ar/biblioteca
>
>

Dra. Dominique Babini
Coordinadora
Red de Bibliotecas Virtuales de Ciencias Sociales
de Am□rica Latina y el Caribe de la red CLACSO
www.clacso.org.ar/biblioteca
dbabini@campus.clacso.edu.ar

Consejo Latinoamericano de Ciencias Sociales (CLACSO)
Av. Callao 875, 3ro. E, (C1023 AAB) Buenos Aires, Argentina
Tel.: (54-11) 4814-2301/4811-6588 Fax: (54-11) 4812-8459