From | Mehrling, Martin |
Date | Mon Aug 31 22:52:11 2009 |
Subject | [greenstone-users] RE: [greenstone-devel] distributed storage for objects? |
In-Reply-To | (004d01ca1d09$4a1546e0$de3fd4a0$-gov-ar) |
Diego,
Thanks again for your help! I'm using version 2.81 so perhaps that's an issue. I also tried using the UNC path instead of the drive letter and that didn't work at all. The objects aren't being pulled into the GLI, so I think that must have something to do with it... I will keep trying things and will send the solution once I figure it out. Martin ************************************
From: Diego Spano [mailto:dspano@orsna.gov.ar]
Martin, What GS versions have you?. I□m using GS2.81 on linux. So I wrote a little script to do the incremental import/build on a daily basis. The script is the following: **************************************************************
The script do the following tasks:
2- Run import process for collection "auditoria" taking the files from /windows/auditoria/import and saving archive files on /archives/auditoria (/archive is a file system mounted on a separate disk) 3- Run build process reading archive files /archives/auditoria and saving indexes in /gsdl/collect/auditoria/index (this needed fo incremental indexing) 4- Because I don□t want to have a copy of archives files in /index/assoc, I modified the basebuildproc.pm, just to avoid the copy process from archives to assoc. To get access to source documents I simply make a link from /index/assoc to /archives/auditoria and thats all. I know that in Windows I had an error like you comment but I can□t remember on which GS version. In that moment the error was solved by Waikato and it should be ok in newer versions. After your message, I made a fast test in windows with 2.82 and it seems to be working OK, but I did not dig too much.... Regards! Diego
Diego, Very helpful! Thanks! It works well on the same drive but when I try different drives (Windows Server 2003) it kind of works but not as well. I'm getting errors like this: import.pl> util::hard_link: unable to create hard link. Attempting to copy file: C:Program FilesGreenstone2 mpF53.gif -> D:GstoneColls2archivesHASHfb50.dirmoonflag_thumb.gif Should I not use drive letters? I also want to read/write to different machines. Do you know why this is raising an error? Could you let me know what values you are using for 'importdir' and 'archivedir'? Thanks a lot,
************************************
From: Diego Spano [mailto:dspano@orsna.gov.ar]
Martin, In GS2 you have 3 folders that requieres disk space: import, archive and index. The index folder MUST be inside the collection□s root folder. You can□t separate it. But the other two yes. The import process has two important options: -importdir: indicates from where to take the source documents. It can be another disk or another machine.
The build process has the option -archivedir too, and shoul be set to the same path that you use in import process. Let me comment you about a digital library I set here: GS was on installed on a linux server and I mounted a Windows file system over a linux filesystem. The server has 2 disks (not two partitions but two separate disks). When I run the import process the source documents are read from the windows file system mounted locally (so, no space wasted on linux disk) and the archives are generated on the second disk. Then the buildprocess take documents from that disk and generates indexes on the maind hard disk. This way you vae separate the 3 parts of the process, and even more, having the archives and indexes in differents disks is a way to get a better performance. Hope this helps you. Diego Spano De: greenstone-devel-bounces@list.scms.waikato.ac.nz [mailto:greenstone-devel-bounces@list.scms.waikato.ac.nz] En nombre de Mehrling, Martin
Hi Folks, We are running out of space on our Greenstone server and were wondering how other Greenstone sites deal with this problem. I know Greenstone3 has a more distributed model, but we aren't ready to move to that version just yet. Have any of you figured out a way to store objects in multiple locations? Thanks a lot for any ideas! Martin ************************************
|