|Date||Thu Jul 2 21:14:45 2009|
|Subject||[greenstone-devel] Merging separate PDF page files - Running pdftk from within Greenstone|
I am building a digital library to display newspaper pages and am importing an individual (searchable image) PDF file for each newspaper page. Metadata will include information on individual articles, including the pages on which articles occur. In addition to full text searching, I want to include the facility to search by article title. Some of the articles run over several pages and I want the user to be able to view a found article as a single PDF file. This would require that the separate page files making up the article be merged together to form a single file.
I know that pdftk.exe can merge files. Are you able to run pdftk.exe from inside Greenstone by modifying one of their perl macros? If I can get Greenstone to do this I could get the system to look at the metadata for the page numbers holding the article and to produce and deliver a single PDF file for the article in real time.