Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -

From Tomáš Fiala
DateWed, 10 Jan 2007 11:25:22 +0100
Subject Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -
In-Reply-To (45A4A6DB-2060300-ulib-sk)
Hello,

I just realized that UTF-8 is already supported, I'm sorry.

Tomáš

Tomáš Fiala wrote / napísal(a):
> Hi Diego,
>
> thank you for your help and for creating an easy and fast program for
> generating .item files !
> It will be a big help for me ! The only problem about it is that it
> process just files with .tif extension.
> I tried to process .gif and .jpg, but then hhmmssn.item wasn't
> created. Please would it be possible to make the program work with
> other extensions ? Maybe the UTF-8 support would be also nice, but i
> think it not really necessary.
> Again, BIG THANKS FOR YOUR WORK ! It will save time to many people !
>
> Sincerely,
>
> Tomáš Fiala
>
>
>
> Diego Spano wrote / napísal(a):
>> Tomas, I´m sending you the program as a zip file.
>>
>> Let me explain how dows it works.
>> 1- just open a cmd window and go to collectyour_collectimport
>> folder (I
>> assume you have documents there. If not go to where they are!)
>> 2- Run the following command: "dir /b /s >item.dir". This will
>> generate a
>> txt file containing all folders with images and texts inside them.
>> Close cmd
>> window.
>> 3- Copy indexador.exe in collectyour_collectimport folder. 4- Run
>> indexador.exe.
>> 5- The program lets you add 4 metadata to your documents. The first
>> one is
>> Title and I´t cannot be disabled. If you wish to add more metadata
>> simply
>> enable the check box and write the metadata name.
>> 6- The program begins reading item.dir and will use last part of the
>> path of
>> each line as the document title. Suppose you have the following
>> item.dir:
>>
>> importbook name 1image1.tif
>> importbook name 1image1.txt
>> importbook name 1image2.tif
>> importbook name 1image2.txt
>> importbook name 1image3.tif
>> importbook name 1image3.txt
>> importbook name 2image1.tif
>> importbook name 2image1.txt
>> importbook name 2image2.tif
>> importbook name 2image2.txt
>> importbook name 3image1.tif
>> importbook name 3image1.txt
>> importbook name 3image2.tif
>> importbook name 3image2.txt
>> importbook name 3image3.tif
>> importbook name 3image3.txt
>>
>> The last part of the path for the first line is "book name 1" so the
>> program
>> assume that this is the document title. Obviously, you can edit metadata
>> value. When you press start button the program read item.dir until it
>> finds
>> that the path has changed, this indicates that "book name 2" starts. So,
>> title changes too. And this repeats until end of file.You will find
>> inside
>> each folder a file named hhmmssn.item (the name is the creation time
>> of the
>> file) that contains metadata and a list of pages descriptions.
>>
>> Well, hope this help you. Let me know if you have any problem.
>>
>> Regards
>>
>> Diego Spano Archivo Digital Secretaria de DD. HH. Ministerio de
>> Justicia y DD. HH. Tel.: 5167-6550
>>
>>
>> -----Mensaje original-----
>> De: Tomáš Fiala [mailto:tomas.fiala@ulib.sk] Enviado el: Lunes, 08 de
>> Enero de 2007 03:41 a.m.
>> Para: Diego Spano
>> CC: greenstone-users@list.scms.waikato.ac.nz
>> Asunto: Re: [greenstone-users] Tool for writing item files - Paged Image
>> collection - pagedImage plugin -
>>
>> Hello Diego,
>>
>> fantastic ! That is the solution I was looking for ! Please allow me to
>> thank you for developing this cool program. Without it, using PagedImage
>> plug would be much harder for me.
>>
>> Please send me the program to following emails: tomas.fiala@ulib.sk ;
>> tom.fiala@gmail.com. (I hope that the filters will not delete the
>> attachment) or just upload it to http://rapidshare.com/ a send me the
>> link.
>>
>> Again, big thanks !
>>
>> Sincerely,
>>
>> Tomas Fiala
>>
>>
>> Diego Spano wrote / napísal(a):
>>
>>> Hi Tomas, I developed a little program that creates .item files with
>>> the metadata you want to assign. With this program we have procesed
>>> thousands of images. How does this program work? Suppose you have this
>>>
>> folder structure:
>>
>>> importdoc 1image1.tif
>>> importdoc 1image1.txt
>>> importdoc 1image2.tif
>>> importdoc 1image2.txt
>>> importdoc 1image3.tif
>>> importdoc 1image3.txt
>>> importdoc 2image1.tif
>>> importdoc 2image1.txt
>>> importdoc 2image2.tif
>>> importdoc 2image2.txt
>>> importdoc 3image1.tif
>>> importdoc 3image1.txt
>>> importdoc 3image2.tif
>>> importdoc 3image2.txt
>>> importdoc 3image3.tif
>>> importdoc 3image3.txt
>>>
>>> You have folders and inside them you have tiffs and txt files. Both
>>> files have the same name and no matter how many files you have
>>> inside them. In the example i named the files imagex.txt and
>>> imagex.tif but you can put the name you want, you only have to take
>>> in account that image and text files MUST have the same name.
>>>
>>> The program will create a.item file inside doc1, doc2 and doc3
>>> folders. Is a .exe file, so you have to run it in Windows. If your
>>> scenario is like mine, then I will send you the executable file. Let me
>>>
>> know!!!
>>
>>> Bye
>>>
>>> Diego Spano
>>> Archivo Digital
>>> Secretaria de DD. HH. Ministerio de Justicia y DD. HH. Tel.: 5167-6550
>>>
>>>
>>>
>>> -----Mensaje original-----
>>> De: greenstone-users-bounces@list.scms.waikato.ac.nz
>>> [mailto:greenstone-users-bounces@list.scms.waikato.ac.nz] En nombre
>>> de Tomáš Fiala Enviado el: Jueves, 04 de Enero de 2007 07:32 a.m.
>>> Para: greenstone-users@list.scms.waikato.ac.nz
>>> Asunto: [greenstone-users] Tool for writing item files - Paged Image
>>> collection - pagedImage plugin -
>>>
>>> Hello,
>>>
>>> I am creating pagedimage collection. I have a book with 500 pages
>>> (Images+OCR) and its very uneasy to write .item files manually.
>>>
>>> Please, does anyone know a tool which generates the .item files
>>> automatically ???
>>>
>>> What are the other ways of putting OCR+Image files together ?
>>>
>>> Many thanks for your help !
>>>
>>> Sincerely,
>>>
>>> Tomas Fiala
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> greenstone-users mailing list
> greenstone-users@list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>

--
Department of digitization
University library Bratislava
http://www.ulib.sk