I just realized that UTF-8 is already supported, I'm sorry.
Tomáš Fiala wrote / napísal(a):
> Hi Diego,
> thank you for your help and for creating an easy and fast program for
> generating .item files !
> It will be a big help for me ! The only problem about it is that it
> process just files with .tif extension.
> I tried to process .gif and .jpg, but then hhmmssn.item wasn't
> created. Please would it be possible to make the program work with
> other extensions ? Maybe the UTF-8 support would be also nice, but i
> think it not really necessary.
> Again, BIG THANKS FOR YOUR WORK ! It will save time to many people !
> Tomáš Fiala
> Diego Spano wrote / napísal(a):
>> Tomas, I´m sending you the program as a zip file.
>> Let me explain how dows it works.
>> 1- just open a cmd window and go to collectyour_collectimport
>> folder (I
>> assume you have documents there. If not go to where they are!)
>> 2- Run the following command: "dir /b /s >item.dir". This will
>> generate a
>> txt file containing all folders with images and texts inside them.
>> Close cmd
>> 3- Copy indexador.exe in collectyour_collectimport folder. 4- Run
>> 5- The program lets you add 4 metadata to your documents. The first
>> one is
>> Title and I´t cannot be disabled. If you wish to add more metadata
>> enable the check box and write the metadata name.
>> 6- The program begins reading item.dir and will use last part of the
>> path of
>> each line as the document title. Suppose you have the following
>> importbook name 1image1.tif
>> importbook name 1image1.txt
>> importbook name 1image2.tif
>> importbook name 1image2.txt
>> importbook name 1image3.tif
>> importbook name 1image3.txt
>> importbook name 2image1.tif
>> importbook name 2image1.txt
>> importbook name 2image2.tif
>> importbook name 2image2.txt
>> importbook name 3image1.tif
>> importbook name 3image1.txt
>> importbook name 3image2.tif
>> importbook name 3image2.txt
>> importbook name 3image3.tif
>> importbook name 3image3.txt
>> The last part of the path for the first line is "book name 1" so the
>> assume that this is the document title. Obviously, you can edit metadata
>> value. When you press start button the program read item.dir until it
>> that the path has changed, this indicates that "book name 2" starts. So,
>> title changes too. And this repeats until end of file.You will find
>> each folder a file named hhmmssn.item (the name is the creation time
>> of the
>> file) that contains metadata and a list of pages descriptions.
>> Well, hope this help you. Let me know if you have any problem.
>> Diego Spano Archivo Digital Secretaria de DD. HH. Ministerio de
>> Justicia y DD. HH. Tel.: 5167-6550
>> -----Mensaje original-----
>> De: Tomáš Fiala [mailto:firstname.lastname@example.org] Enviado el: Lunes, 08 de
>> Enero de 2007 03:41 a.m.
>> Para: Diego Spano
>> CC: email@example.com
>> Asunto: Re: [greenstone-users] Tool for writing item files - Paged Image
>> collection - pagedImage plugin -
>> Hello Diego,
>> fantastic ! That is the solution I was looking for ! Please allow me to
>> thank you for developing this cool program. Without it, using PagedImage
>> plug would be much harder for me.
>> Please send me the program to following emails: firstname.lastname@example.org ;
>> email@example.com. (I hope that the filters will not delete the
>> attachment) or just upload it to http://rapidshare.com/ a send me the
>> Again, big thanks !
>> Tomas Fiala
>> Diego Spano wrote / napísal(a):
>>> Hi Tomas, I developed a little program that creates .item files with
>>> the metadata you want to assign. With this program we have procesed
>>> thousands of images. How does this program work? Suppose you have this
>> folder structure:
>>> importdoc 1image1.tif
>>> importdoc 1image1.txt
>>> importdoc 1image2.tif
>>> importdoc 1image2.txt
>>> importdoc 1image3.tif
>>> importdoc 1image3.txt
>>> importdoc 2image1.tif
>>> importdoc 2image1.txt
>>> importdoc 2image2.tif
>>> importdoc 2image2.txt
>>> importdoc 3image1.tif
>>> importdoc 3image1.txt
>>> importdoc 3image2.tif
>>> importdoc 3image2.txt
>>> importdoc 3image3.tif
>>> importdoc 3image3.txt
>>> You have folders and inside them you have tiffs and txt files. Both
>>> files have the same name and no matter how many files you have
>>> inside them. In the example i named the files imagex.txt and
>>> imagex.tif but you can put the name you want, you only have to take
>>> in account that image and text files MUST have the same name.
>>> The program will create a.item file inside doc1, doc2 and doc3
>>> folders. Is a .exe file, so you have to run it in Windows. If your
>>> scenario is like mine, then I will send you the executable file. Let me
>>> Diego Spano
>>> Archivo Digital
>>> Secretaria de DD. HH. Ministerio de Justicia y DD. HH. Tel.: 5167-6550
>>> -----Mensaje original-----
>>> De: firstname.lastname@example.org
>>> [mailto:email@example.com] En nombre
>>> de Tomáš Fiala Enviado el: Jueves, 04 de Enero de 2007 07:32 a.m.
>>> Para: firstname.lastname@example.org
>>> Asunto: [greenstone-users] Tool for writing item files - Paged Image
>>> collection - pagedImage plugin -
>>> I am creating pagedimage collection. I have a book with 500 pages
>>> (Images+OCR) and its very uneasy to write .item files manually.
>>> Please, does anyone know a tool which generates the .item files
>>> automatically ???
>>> What are the other ways of putting OCR+Image files together ?
>>> Many thanks for your help !
>>> Tomas Fiala
> greenstone-users mailing list
Department of digitization
University library Bratislava