|Date||Thu Aug 26 03:32:31 2010|
|Subject||[greenstone-users] Collection size anomaly|
So, if I understand you correctly, Greenstone stores 3 separate copies of each digital resource in its original form and another in plain text? Are my collections 3 times larger than they need to be?
I see this is a huge issue for hard drive space requirements when creating a library - to have 3 copies of a resource in the library, plus the original if it was on your hard drive in the first place. Is this issue going to be addressed in subsequent versions of Greenstone? Is there a way to safely pare this down to just one copy of the resource in the library (not including plain text version)?
MAF Learning Technologies
Steve, the way Greenstone stores all objects and indexes is the same as other applications do. If you import a 2 MB PDF, then you will have 2 MB in import folder, 2 MB in archive (after import process), 2 MB in indexassoc (after buildcol process), and a few more KB (the full text index).
It is very easy to install.
Hope this helps.
Should one expect anomalies when checking the total size of a Greenstone collection, perhaps particularly in a Windows environment? I have a Greenstone 2.82 collection which, according to Windows file manager, is larger than the total space-in-use on the drive on which Greenstone resides.
I built the collection on a Windows XP desktop machine. The size of the collection displays as 145 gig (on the property sheet for the named collection folder, inside the collect folder.) The computer contains one 160 gig disk, which shows 90 gig as being free and 70 gig in use.
The 4982 import files. in 780 folders, in this Greenstone collection occupy 46 gig. I am getting ready to move this collection to a Linux development server for testing prior to deployment on a live Linux server.
I would appreciate any pointers or comments, on or off list. I did not find this subject in the mailing list archives.