Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -

From Michael Dewsnip
DateWed, 10 Jan 2007 16:55:32 +1300
Subject Re: [greenstone-users] Tool for writing item files - Paged Image collection - pagedImage plugin -
In-Reply-To (!~!UENERkVCMDkAAQACAAAAAAAAAAAAAAAAABgAAAAAAAAAyqR8sqfeLUK9jdmD6lzpfOKAAAAQAAAArb1StSFewEyLDVOUihvINgEAAAAA-jus-gov-ar)
Hi Diego, Arthur,

It's great to hear about the work you've been doing on programs for
generating .item files, and from the discussion on the lists it's
obvious that many people would find these useful.

We'd be happy to link to these programs on the Greenstone Wiki (and host
them, if necessary), if you are willing to share them. Or, if you wanted
to GPL your code we could even include them in future releases of
Greenstone.

Keep up the good work!

All the best,

Michael

Diego Spano wrote:

>Tomas, I´m sending you the program as a zip file.
>
>Let me explain how dows it works.
>
>1- just open a cmd window and go to collectyour_collectimport folder (I
>assume you have documents there. If not go to where they are!)
>2- Run the following command: "dir /b /s >item.dir". This will generate a
>txt file containing all folders with images and texts inside them. Close cmd
>window.
>3- Copy indexador.exe in collectyour_collectimport folder.
>4- Run indexador.exe.
>5- The program lets you add 4 metadata to your documents. The first one is
>Title and I´t cannot be disabled. If you wish to add more metadata simply
>enable the check box and write the metadata name.
>6- The program begins reading item.dir and will use last part of the path of
>each line as the document title. Suppose you have the following item.dir:
>
> importbook name 1image1.tif
> importbook name 1image1.txt
> importbook name 1image2.tif
> importbook name 1image2.txt
> importbook name 1image3.tif
> importbook name 1image3.txt
> importbook name 2image1.tif
> importbook name 2image1.txt
> importbook name 2image2.tif
> importbook name 2image2.txt
> importbook name 3image1.tif
> importbook name 3image1.txt
> importbook name 3image2.tif
> importbook name 3image2.txt
> importbook name 3image3.tif
> importbook name 3image3.txt
>
>The last part of the path for the first line is "book name 1" so the program
>assume that this is the document title. Obviously, you can edit metadata
>value. When you press start button the program read item.dir until it finds
>that the path has changed, this indicates that "book name 2" starts. So,
>title changes too. And this repeats until end of file.You will find inside
>each folder a file named hhmmssn.item (the name is the creation time of the
>file) that contains metadata and a list of pages descriptions.
>
>Well, hope this help you. Let me know if you have any problem.
>
>Regards
>
>Diego Spano
>Archivo Digital
>Secretaria de DD. HH.
>Ministerio de Justicia y DD. HH.
>Tel.: 5167-6550
>
>
>
>-----Mensaje original-----
>De: Tomáš Fiala [mailto:tomas.fiala@ulib.sk]
>Enviado el: Lunes, 08 de Enero de 2007 03:41 a.m.
>Para: Diego Spano
>CC: greenstone-users@list.scms.waikato.ac.nz
>Asunto: Re: [greenstone-users] Tool for writing item files - Paged Image
>collection - pagedImage plugin -
>
>Hello Diego,
>
>fantastic ! That is the solution I was looking for ! Please allow me to
>thank you for developing this cool program. Without it, using PagedImage
>plug would be much harder for me.
>
>Please send me the program to following emails: tomas.fiala@ulib.sk ;
>tom.fiala@gmail.com. (I hope that the filters will not delete the
>attachment) or just upload it to http://rapidshare.com/ a send me the link.
>
>Again, big thanks !
>
>Sincerely,
>
>Tomas Fiala
>
>
>Diego Spano wrote / napísal(a):
>
>
>>Hi Tomas, I developed a little program that creates .item files with
>>the metadata you want to assign. With this program we have procesed
>>thousands of images. How does this program work? Suppose you have this
>>
>>
>folder structure:
>
>
>>importdoc 1image1.tif
>>importdoc 1image1.txt
>>importdoc 1image2.tif
>>importdoc 1image2.txt
>>importdoc 1image3.tif
>>importdoc 1image3.txt
>>importdoc 2image1.tif
>>importdoc 2image1.txt
>>importdoc 2image2.tif
>>importdoc 2image2.txt
>>importdoc 3image1.tif
>>importdoc 3image1.txt
>>importdoc 3image2.tif
>>importdoc 3image2.txt
>>importdoc 3image3.tif
>>importdoc 3image3.txt
>>
>>You have folders and inside them you have tiffs and txt files. Both
>>files have the same name and no matter how many files you have inside
>>them. In the example i named the files imagex.txt and imagex.tif but
>>you can put the name you want, you only have to take in account that
>>image and text files MUST have the same name.
>>
>>The program will create a.item file inside doc1, doc2 and doc3
>>folders. Is a .exe file, so you have to run it in Windows. If your
>>scenario is like mine, then I will send you the executable file. Let me
>>
>>
>know!!!
>
>
>>Bye
>>
>>Diego Spano
>>Archivo Digital
>>Secretaria de DD. HH.
>>Ministerio de Justicia y DD. HH.
>>Tel.: 5167-6550
>>
>>
>>
>>-----Mensaje original-----
>>De: greenstone-users-bounces@list.scms.waikato.ac.nz
>>[mailto:greenstone-users-bounces@list.scms.waikato.ac.nz] En nombre de
>>Tomáš Fiala Enviado el: Jueves, 04 de Enero de 2007 07:32 a.m.
>>Para: greenstone-users@list.scms.waikato.ac.nz
>>Asunto: [greenstone-users] Tool for writing item files - Paged Image
>>collection - pagedImage plugin -
>>
>>Hello,
>>
>>I am creating pagedimage collection. I have a book with 500 pages
>>(Images+OCR) and its very uneasy to write .item files manually.
>>
>>Please, does anyone know a tool which generates the .item files
>>automatically ???
>>
>>What are the other ways of putting OCR+Image files together ?
>>
>>Many thanks for your help !
>>
>>Sincerely,
>>
>>Tomas Fiala
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>
>
>
>------------------------------------------------------------------------
>
>_______________________________________________
>greenstone-users mailing list
>greenstone-users@list.scms.waikato.ac.nz
>https://list.scms.waikato.ac.nz/mailman/listinfo/greenstone-users
>
>