Are you by any chance building collections containing PDF files? There is a
problem with the handling of certain PDF files which causes two strange
characters to be added into the Title metadata extracted from these files.
This means the doc.xml files fail to parse correctly, and during building you
get the error you report.
There is a simple patch to this problem, so if this sounds like you, let me
know and I'll send it to you and the list.
All the best,
Doug Carter wrote:
> Hi all,
> Since moving to 2.41, I'm seeing an error that I've not seen before in
> the .../etc/fail.log:
> doc.xml: no plugin could process this file
> In the past, I only saw recognizable file names, with "failed to convert"
> messages. I have multiple collections, and each has from 5-20 of these
> lines in their fail.log. I don't know where this message is coming from,
> if it's a real problem, or how I can get more information about the error.
> Any ideas?
> Doug Carter
> Mercy Corps
> greenstone-devel mailing list