[greenstone-users] Collection size anomaly

From Jay Clark
DateThu Aug 26 03:32:31 2010
Subject [greenstone-users] Collection size anomaly
In-Reply-To (002201cb445c$254303e0$6fc90ba0$-gov-ar)
Hi Diego,
So, if I understand you correctly, Greenstone stores 3 separate copies of each digital resource in its original form and another in plain text? Are my collections 3 times larger than they need to be?

I see this is a huge issue for hard drive space requirements when creating a library - to have 3 copies of a resource in the library, plus the original if it was on your hard drive in the first place. Is this issue going to be addressed in subsequent versions of Greenstone? Is there a way to safely pare this down to just one copy of the resource in the library (not including plain text version)?

Thank you,
Jay Clark

MAF Learning Technologies

-----Original Message-----
From: greenstone-users-bounces@list.scms.waikato.ac.nz [mailto:greenstone-users-bounces@list.scms.waikato.ac.nz] On Behalf Of Diego Spano
Sent: Wednesday, August 25, 2010 7:48 AM
To: steve@arlis.org; greenstone-users@list.scms.waikato.ac.nz
Subject: RE: [greenstone-users] Collection size anomaly

Steve, the way Greenstone stores all objects and indexes is the same as other applications do. If you import a 2 MB PDF, then you will have 2 MB in import folder, 2 MB in archive (after import process), 2 MB in indexassoc (after buildcol process), and a few more KB (the full text index).
Greenstone relays on operating system. Sometimes Windows is not good retrieving file sizes and free space (in other features too!!!). Perhaps you can install a third party software to get the correct values for disk space.

Try this: http://www.glenn.delahoy.com/software/files/DiskAnalyser201.zip

It is very easy to install.

Hope this helps.

Diego Spano
Prodigio Consultores
Bernardo de Irigoyen N? 1114 2?B
Capital Federal - Argentina
Tel: (54 11) 5093-5313

-----Mensaje original-----
De: greenstone-users-bounces@list.scms.waikato.ac.nz
[mailto:greenstone-users-bounces@list.scms.waikato.ac.nz] En nombre de Steve Johnson Enviado el: Martes, 24 de Agosto de 2010 08:45 p.m.
Para: greenstone-users@list.scms.waikato.ac.nz
Asunto: [greenstone-users] Collection size anomaly

Should one expect anomalies when checking the total size of a Greenstone collection, perhaps particularly in a Windows environment? I have a Greenstone 2.82 collection which, according to Windows file manager, is larger than the total space-in-use on the drive on which Greenstone resides.

I built the collection on a Windows XP desktop machine. The size of the collection displays as 145 gig (on the property sheet for the named collection folder, inside the collect folder.) The computer contains one 160 gig disk, which shows 90 gig as being free and 70 gig in use.

The 4982 import files. in 780 folders, in this Greenstone collection occupy 46 gig. I am getting ready to move this collection to a Linux development server for testing prior to deployment on a live Linux server.
An accurate take on collection size is obviously important. Am I missing something about the way Greenstone organizes files on Windows systems?

I would appreciate any pointers or comments, on or off list. I did not find this subject in the mailing list archives.

Steve Johnson
Systems Coordinator/Management Team
Alaska Resources Library & Information Services (ARLIS) steve@arlis.org

greenstone-users mailing list

greenstone-users mailing list