|Date||Fri, 01 Jun 2007 12:17:07 +1200|
|Subject||Re: [greenstone-users] MARC --> GREENSTONE|
I've taken a look at this. MARCPlug uses the perl library MARC:Record to read marc files. MARC records can be in two different encodings, MARC-8, and UTF-8. This library defaults to using the MARC-8 encoding when reading the files, and I dont believe it handles the records automatically if they are in fact in UTF-8. So the problem is not how greenstone handles the encoding, but seems to be with the MARC:Record reader before the data even gets to greenstone.
I looked up MARC:Record on cpan, turns out that there is a "encoding" function that has been added to the Record library, that allows the encoding to be set, however this function isnt in the MARC perl library currently in greenstone. You could try updating the MARC libraries used by greenstone perhaps, and/or using the encoding function below to set the encoding to UTF-8 in MARCPlug.pm
Hope this helps,
-- DL Consulting Greenstone Digital Library and Digitisation Specialists email@example.com www.dlconsulting.com
Tom□□ Fiala wrote: