Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -

From Tomáš Fiala
DateWed, 10 Jan 2007 09:42:03 +0100
Subject Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -
In-Reply-To (!~!UENERkVCMDkAAQACAAAAAAAAAAAAAAAAABgAAAAAAAAAyqR8sqfeLUK9jdmD6lzpfOKAAAAQAAAArb1StSFewEyLDVOUihvINgEAAAAA-jus-gov-ar)
Hi Diego,

thank you for your help and for creating an easy and fast program for generating .item files !

It will be a big help for me ! The only problem about it is that it process just files with .tif extension.
I tried to process .gif and .jpg, but then hhmmssn.item wasn't created. Please would it be possible to make the
program work with other extensions ? Maybe the UTF-8 support would be also nice, but i think it not really necessary.

Again, BIG THANKS FOR YOUR WORK ! It will save time to many people !

Sincerely,

Tomáš Fiala

Diego Spano wrote / napísal(a):
> Tomas, I´m sending you the program as a zip file.
>
> Let me explain how dows it works.
>
> 1- just open a cmd window and go to collectyour_collectimport folder (I
> assume you have documents there. If not go to where they are!)
> 2- Run the following command: "dir /b /s >item.dir". This will generate a
> txt file containing all folders with images and texts inside them. Close cmd
> window.
> 3- Copy indexador.exe in collectyour_collectimport folder.
> 4- Run indexador.exe.
> 5- The program lets you add 4 metadata to your documents. The first one is
> Title and I´t cannot be disabled. If you wish to add more metadata simply
> enable the check box and write the metadata name.
> 6- The program begins reading item.dir and will use last part of the path of
> each line as the document title. Suppose you have the following item.dir:
>
> importbook name 1image1.tif
> importbook name 1image1.txt
> importbook name 1image2.tif
> importbook name 1image2.txt
> importbook name 1image3.tif
> importbook name 1image3.txt
> importbook name 2image1.tif
> importbook name 2image1.txt
> importbook name 2image2.tif
> importbook name 2image2.txt
> importbook name 3image1.tif
> importbook name 3image1.txt
> importbook name 3image2.tif
> importbook name 3image2.txt
> importbook name 3image3.tif
> importbook name 3image3.txt
>
> The last part of the path for the first line is "book name 1" so the program
> assume that this is the document title. Obviously, you can edit metadata
> value. When you press start button the program read item.dir until it finds
> that the path has changed, this indicates that "book name 2" starts. So,
> title changes too. And this repeats until end of file.You will find inside
> each folder a file named hhmmssn.item (the name is the creation time of the
> file) that contains metadata and a list of pages descriptions.
>
> Well, hope this help you. Let me know if you have any problem.
>
> Regards
>
> Diego Spano
> Archivo Digital
> Secretaria de DD. HH.
> Ministerio de Justicia y DD. HH.
> Tel.: 5167-6550
>
>
>
> -----Mensaje original-----
> De: Tomáš Fiala [mailto:tomas.fiala@ulib.sk]
> Enviado el: Lunes, 08 de Enero de 2007 03:41 a.m.
> Para: Diego Spano
> CC: greenstone-users@list.scms.waikato.ac.nz
> Asunto: Re: [greenstone-users] Tool for writing item files - Paged Image
> collection - pagedImage plugin -
>
> Hello Diego,
>
> fantastic ! That is the solution I was looking for ! Please allow me to
> thank you for developing this cool program. Without it, using PagedImage
> plug would be much harder for me.
>
> Please send me the program to following emails: tomas.fiala@ulib.sk ;
> tom.fiala@gmail.com. (I hope that the filters will not delete the
> attachment) or just upload it to http://rapidshare.com/ a send me the link.
>
> Again, big thanks !
>
> Sincerely,
>
> Tomas Fiala
>
>
> Diego Spano wrote / napísal(a):
>
>> Hi Tomas, I developed a little program that creates .item files with
>> the metadata you want to assign. With this program we have procesed
>> thousands of images. How does this program work? Suppose you have this
>>
> folder structure:
>
>> importdoc 1image1.tif
>> importdoc 1image1.txt
>> importdoc 1image2.tif
>> importdoc 1image2.txt
>> importdoc 1image3.tif
>> importdoc 1image3.txt
>> importdoc 2image1.tif
>> importdoc 2image1.txt
>> importdoc 2image2.tif
>> importdoc 2image2.txt
>> importdoc 3image1.tif
>> importdoc 3image1.txt
>> importdoc 3image2.tif
>> importdoc 3image2.txt
>> importdoc 3image3.tif
>> importdoc 3image3.txt
>>
>> You have folders and inside them you have tiffs and txt files. Both
>> files have the same name and no matter how many files you have inside
>> them. In the example i named the files imagex.txt and imagex.tif but
>> you can put the name you want, you only have to take in account that
>> image and text files MUST have the same name.
>>
>> The program will create a.item file inside doc1, doc2 and doc3
>> folders. Is a .exe file, so you have to run it in Windows. If your
>> scenario is like mine, then I will send you the executable file. Let me
>>
> know!!!
>
>> Bye
>>
>> Diego Spano
>> Archivo Digital
>> Secretaria de DD. HH.
>> Ministerio de Justicia y DD. HH.
>> Tel.: 5167-6550
>>
>>
>>
>> -----Mensaje original-----
>> De: greenstone-users-bounces@list.scms.waikato.ac.nz
>> [mailto:greenstone-users-bounces@list.scms.waikato.ac.nz] En nombre de
>> Tomáš Fiala Enviado el: Jueves, 04 de Enero de 2007 07:32 a.m.
>> Para: greenstone-users@list.scms.waikato.ac.nz
>> Asunto: [greenstone-users] Tool for writing item files - Paged Image
>> collection - pagedImage plugin -
>>
>> Hello,
>>
>> I am creating pagedimage collection. I have a book with 500 pages
>> (Images+OCR) and its very uneasy to write .item files manually.
>>
>> Please, does anyone know a tool which generates the .item files
>> automatically ???
>>
>> What are the other ways of putting OCR+Image files together ?
>>
>> Many thanks for your help !
>>
>> Sincerely,
>>
>> Tomas Fiala
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>

--
Department of digitization
University library Bratislava
http://www.ulib.sk