Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -

From Tomáš Fiala
DateWed, 10 Jan 2007 15:48:13 +0100
Subject Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -
In-Reply-To (200701101429-l0AETFm3002114-gamma-ulib-sk)
Hi Diego,

to do some renaming of extensions and some editing in .item files is for
me fast and easy. The program is still very valuable for me, because it
saves my time. I hope to see your new version soon.

Again, thanks for developing and sharing it !

Sincerely,

Tomáš Fiala

--
Department of digitization
University library Bratislava
http://www.ulib.sk

Diego Spano wrote / napísal(a):
> Tomas, you are right, it only works with tiffs files. Modify the program to
> support other formats is in my todo list, I only need time!!! In the
> meanwhile, if you rename jpg files as tif files you can show it the same way
> and you can run the program without problems.
>
> Diego
>
> -----Mensaje original-----
> De: Tomáš Fiala [mailto:tomas.fiala@ulib.sk]
> Enviado el: Miércoles, 10 de Enero de 2007 05:42 a.m.
> Para: Diego Spano
> CC: greenstone-users@list.scms.waikato.ac.nz
> Asunto: Re: [greenstone-users] Tool for writing item files - Paged Image
> collection - pagedImage plugin -
>
> Hi Diego,
>
> thank you for your help and for creating an easy and fast program for
> generating .item files !
>
> It will be a big help for me ! The only problem about it is that it process
> just files with .tif extension.
> I tried to process .gif and .jpg, but then hhmmssn.item wasn't created.
> Please would it be possible to make the program work with other extensions ?
> Maybe the UTF-8 support would be also nice, but i think it not really
> necessary.
>
> Again, BIG THANKS FOR YOUR WORK ! It will save time to many people !
>
> Sincerely,
>
> Tomáš Fiala
>
>
>
> Diego Spano wrote / napísal(a):
>
>> Tomas, I´m sending you the program as a zip file.
>>
>> Let me explain how dows it works.
>>
>> 1- just open a cmd window and go to collect\your_collect\import
>> folder (I assume you have documents there. If not go to where they
>> are!)
>> 2- Run the following command: "dir /b /s >item.dir". This will
>> generate a txt file containing all folders with images and texts
>> inside them. Close cmd window.
>> 3- Copy indexador.exe in collect\your_collect\import folder.
>> 4- Run indexador.exe.
>> 5- The program lets you add 4 metadata to your documents. The first
>> one is Title and I´t cannot be disabled. If you wish to add more
>> metadata simply enable the check box and write the metadata name.
>> 6- The program begins reading item.dir and will use last part of the
>> path of each line as the document title. Suppose you have the following
>>
> item.dir:
>
>> \import\book name 1\image1.tif
>> \import\book name 1\image1.txt
>> \import\book name 1\image2.tif
>> \import\book name 1\image2.txt
>> \import\book name 1\image3.tif
>> \import\book name 1\image3.txt
>> \import\book name 2\image1.tif
>> \import\book name 2\image1.txt
>> \import\book name 2\image2.tif
>> \import\book name 2\image2.txt
>> \import\book name 3\image1.tif
>> \import\book name 3\image1.txt
>> \import\book name 3\image2.tif
>> \import\book name 3\image2.txt
>> \import\book name 3\image3.tif
>> \import\book name 3\image3.txt
>>
>> The last part of the path for the first line is "book name 1" so the
>> program assume that this is the document title. Obviously, you can
>> edit metadata value. When you press start button the program read
>> item.dir until it finds that the path has changed, this indicates that
>> "book name 2" starts. So, title changes too. And this repeats until
>> end of file.You will find inside each folder a file named hhmmssn.item
>> (the name is the creation time of the
>> file) that contains metadata and a list of pages descriptions.
>>
>> Well, hope this help you. Let me know if you have any problem.
>>
>> Regards
>>
>> Diego Spano
>> Archivo Digital
>> Secretaria de DD. HH.
>> Ministerio de Justicia y DD. HH.
>> Tel.: 5167-6550
>>
>>
>>
>> -----Mensaje original-----
>> De: Tomáš Fiala [mailto:tomas.fiala@ulib.sk] Enviado el: Lunes, 08 de
>> Enero de 2007 03:41 a.m.
>> Para: Diego Spano
>> CC: greenstone-users@list.scms.waikato.ac.nz
>> Asunto: Re: [greenstone-users] Tool for writing item files - Paged
>> Image collection - pagedImage plugin -
>>
>> Hello Diego,
>>
>> fantastic ! That is the solution I was looking for ! Please allow me
>> to thank you for developing this cool program. Without it, using
>> PagedImage plug would be much harder for me.
>>
>> Please send me the program to following emails: tomas.fiala@ulib.sk ;
>> tom.fiala@gmail.com. (I hope that the filters will not delete the
>> attachment) or just upload it to http://rapidshare.com/ a send me the
>>
> link.
>
>> Again, big thanks !
>>
>> Sincerely,
>>
>> Tomas Fiala
>>
>>
>> Diego Spano wrote / napísal(a):
>>
>>
>>> Hi Tomas, I developed a little program that creates .item files with
>>> the metadata you want to assign. With this program we have procesed
>>> thousands of images. How does this program work? Suppose you have
>>> this
>>>
>>>
>> folder structure:
>>
>>
>>> \import\doc 1\image1.tif
>>> \import\doc 1\image1.txt
>>> \import\doc 1\image2.tif
>>> \import\doc 1\image2.txt
>>> \import\doc 1\image3.tif
>>> \import\doc 1\image3.txt
>>> \import\doc 2\image1.tif
>>> \import\doc 2\image1.txt
>>> \import\doc 2\image2.tif
>>> \import\doc 2\image2.txt
>>> \import\doc 3\image1.tif
>>> \import\doc 3\image1.txt
>>> \import\doc 3\image2.tif
>>> \import\doc 3\image2.txt
>>> \import\doc 3\image3.tif
>>> \import\doc 3\image3.txt
>>>
>>> You have folders and inside them you have tiffs and txt files. Both
>>> files have the same name and no matter how many files you have inside
>>> them. In the example i named the files imagex.txt and imagex.tif but
>>> you can put the name you want, you only have to take in account that
>>> image and text files MUST have the same name.
>>>
>>> The program will create a.item file inside doc1, doc2 and doc3
>>> folders. Is a .exe file, so you have to run it in Windows. If your
>>> scenario is like mine, then I will send you the executable file. Let
>>> me
>>>
>>>
>> know!!!
>>
>>
>>> Bye
>>>
>>> Diego Spano
>>> Archivo Digital
>>> Secretaria de DD. HH.
>>> Ministerio de Justicia y DD. HH.
>>> Tel.: 5167-6550
>>>
>>>
>>>
>>> -----Mensaje original-----
>>> De: greenstone-users-bounces@list.scms.waikato.ac.nz
>>> [mailto:greenstone-users-bounces@list.scms.waikato.ac.nz] En nombre
>>> de Tomáš Fiala Enviado el: Jueves, 04 de Enero de 2007 07:32 a.m.
>>> Para: greenstone-users@list.scms.waikato.ac.nz
>>> Asunto: [greenstone-users] Tool for writing item files - Paged Image
>>> collection - pagedImage plugin -
>>>
>>> Hello,
>>>
>>> I am creating pagedimage collection. I have a book with 500 pages
>>> (Images+OCR) and its very uneasy to write .item files manually.
>>>
>>> Please, does anyone know a tool which generates the .item files
>>> automatically ???
>>>
>>> What are the other ways of putting OCR+Image files together ?
>>>
>>> Many thanks for your help !
>>>
>>> Sincerely,
>>>
>>> Tomas Fiala
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>
> --
> Department of digitization
> University library Bratislava
> http://www.ulib.sk
>
>
>
>