Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -

From Tomáš Fiala
DateTue, 09 Jan 2007 13:42:11 +0100
Subject Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -
In-Reply-To (!~!UENERkVCMDkAAQACAAAAAAAAAAAAAAAAABgAAAAAAAAAyqR8sqfeLUK9jdmD6lzpfOKAAAAQAAAArb1StSFewEyLDVOUihvINgEAAAAA-jus-gov-ar)
Hello Diego,

thank you for for your email. I received the .zip file, but inside it
was just empty folder.

I also received some email which says that the file was removed.

Please upload it on some free www upload service and send me the link.

Many thanks,

sincerely,

Tomas Fiala

Diego Spano wrote / napísal(a):
> Tomas, I´m sending you the program as a zip file.
>
> Let me explain how dows it works.
>
> 1- just open a cmd window and go to collect\your_collect\import folder (I
> assume you have documents there. If not go to where they are!)
> 2- Run the following command: "dir /b /s >item.dir". This will generate a
> txt file containing all folders with images and texts inside them. Close cmd
> window.
> 3- Copy indexador.exe in collect\your_collect\import folder.
> 4- Run indexador.exe.
> 5- The program lets you add 4 metadata to your documents. The first one is
> Title and I´t cannot be disabled. If you wish to add more metadata simply
> enable the check box and write the metadata name.
> 6- The program begins reading item.dir and will use last part of the path of
> each line as the document title. Suppose you have the following item.dir:
>
> \import\book name 1\image1.tif
> \import\book name 1\image1.txt
> \import\book name 1\image2.tif
> \import\book name 1\image2.txt
> \import\book name 1\image3.tif
> \import\book name 1\image3.txt
> \import\book name 2\image1.tif
> \import\book name 2\image1.txt
> \import\book name 2\image2.tif
> \import\book name 2\image2.txt
> \import\book name 3\image1.tif
> \import\book name 3\image1.txt
> \import\book name 3\image2.tif
> \import\book name 3\image2.txt
> \import\book name 3\image3.tif
> \import\book name 3\image3.txt
>
> The last part of the path for the first line is "book name 1" so the program
> assume that this is the document title. Obviously, you can edit metadata
> value. When you press start button the program read item.dir until it finds
> that the path has changed, this indicates that "book name 2" starts. So,
> title changes too. And this repeats until end of file.You will find inside
> each folder a file named hhmmssn.item (the name is the creation time of the
> file) that contains metadata and a list of pages descriptions.
>
> Well, hope this help you. Let me know if you have any problem.
>
> Regards
>
> Diego Spano
> Archivo Digital
> Secretaria de DD. HH.
> Ministerio de Justicia y DD. HH.
> Tel.: 5167-6550
>
>
>
> -----Mensaje original-----
> De: Tomáš Fiala [mailto:tomas.fiala@ulib.sk]
> Enviado el: Lunes, 08 de Enero de 2007 03:41 a.m.
> Para: Diego Spano
> CC: greenstone-users@list.scms.waikato.ac.nz
> Asunto: Re: [greenstone-users] Tool for writing item files - Paged Image
> collection - pagedImage plugin -
>
> Hello Diego,
>
> fantastic ! That is the solution I was looking for ! Please allow me to
> thank you for developing this cool program. Without it, using PagedImage
> plug would be much harder for me.
>
> Please send me the program to following emails: tomas.fiala@ulib.sk ;
> tom.fiala@gmail.com. (I hope that the filters will not delete the
> attachment) or just upload it to http://rapidshare.com/ a send me the link.
>
> Again, big thanks !
>
> Sincerely,
>
> Tomas Fiala
>
>
> Diego Spano wrote / napísal(a):
>
>> Hi Tomas, I developed a little program that creates .item files with
>> the metadata you want to assign. With this program we have procesed
>> thousands of images. How does this program work? Suppose you have this
>>
> folder structure:
>
>> \import\doc 1\image1.tif
>> \import\doc 1\image1.txt
>> \import\doc 1\image2.tif
>> \import\doc 1\image2.txt
>> \import\doc 1\image3.tif
>> \import\doc 1\image3.txt
>> \import\doc 2\image1.tif
>> \import\doc 2\image1.txt
>> \import\doc 2\image2.tif
>> \import\doc 2\image2.txt
>> \import\doc 3\image1.tif
>> \import\doc 3\image1.txt
>> \import\doc 3\image2.tif
>> \import\doc 3\image2.txt
>> \import\doc 3\image3.tif
>> \import\doc 3\image3.txt
>>
>> You have folders and inside them you have tiffs and txt files. Both
>> files have the same name and no matter how many files you have inside
>> them. In the example i named the files imagex.txt and imagex.tif but
>> you can put the name you want, you only have to take in account that
>> image and text files MUST have the same name.
>>
>> The program will create a.item file inside doc1, doc2 and doc3
>> folders. Is a .exe file, so you have to run it in Windows. If your
>> scenario is like mine, then I will send you the executable file. Let me
>>
> know!!!
>
>> Bye
>>
>> Diego Spano
>> Archivo Digital
>> Secretaria de DD. HH.
>> Ministerio de Justicia y DD. HH.
>> Tel.: 5167-6550
>>
>>
>>
>> -----Mensaje original-----
>> De: greenstone-users-bounces@list.scms.waikato.ac.nz
>> [mailto:greenstone-users-bounces@list.scms.waikato.ac.nz] En nombre de
>> Tomáš Fiala Enviado el: Jueves, 04 de Enero de 2007 07:32 a.m.
>> Para: greenstone-users@list.scms.waikato.ac.nz
>> Asunto: [greenstone-users] Tool for writing item files - Paged Image
>> collection - pagedImage plugin -
>>
>> Hello,
>>
>> I am creating pagedimage collection. I have a book with 500 pages
>> (Images+OCR) and its very uneasy to write .item files manually.
>>
>> Please, does anyone know a tool which generates the .item files
>> automatically ???
>>
>> What are the other ways of putting OCR+Image files together ?
>>
>> Many thanks for your help !
>>
>> Sincerely,
>>
>> Tomas Fiala
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>