1 Introduction
close this book View the PDF document Coping with very large digital collections using Greenstone : Stefan Boddie, John Thompson, David Bainbridge, Ian H. Witten
View the document 2 Building the Papers Past collection
View the document 3 Distributed operation with IBM’s DB2
View the document 4 Summary and Conclusions
View the document References


1.  Witten, I.H. and Bainbridge, D. (2003) How to build a digital library. Morgan Kaufmann.

2.  http://www.greenstone.org

3.  http://www.greenstone.org/map

4.  The Google n-gram collection is available on six DVDs from http://www.ldc.upenn.edu/

5.  Witten, I.H., Moffat, A. and Bell, T. (1999) Managing gigabytes: compressing and indexing documents and images (second edition). Morgan Kaufmann.

6.  Littman, J.A (2006) “Technical Approach and Distributed Model for Validation of Digital Objects.” In D-Lib Magazine, 12(5).

7.  Reynaert, M. (2008) “Non-interactive OCR post-correction for Giga-scale digitization projects.” In Computational Linguistics and Intelligent Text Processing, LNCS Vol. 4919, Springer.

8.  http://www.greenstone.org/greenstone3-home

9.  http://www-306.ibm.com/software/data/db2/

10.  Ahuja, R. (2006) “DB2 9 unveiled: overview and new enhancements.” IBM White Paper.

11.  http://searchoracle.techtarget.com/sDefinition/0,,sid41_gci214145,00.html

12.  Josifovski, V., Schwarz, P., Haas, L. and lin, E. (2002) “Garlic: a new flavor of federated query processing for DB2.” Proc ACM SIGMOD Int Conf on Management of Data, Madison, Wisconsin; pp. 524-532.

13.  http://www-306.ibm.com/software/data/db2/express/

14.  http://www-306.ibm.com/software/data/db2/extenders/netsearch/

15.  http://search.cpan.org/~mferris/Class-DBI-DB2-0.16/

16.  http://dbconnect.sourceforge.net/