From: Alberto B. <al...@me...> - 2006-02-10 22:01:07
|
Hi all, i'm trying to convert newstpaper pdf files to djvu while retaining text stream as metadata to allow search into djview application. I've compiled gsdjvu against gs 8.15 and installed it. Then using djvudigital using --lines or --words i get this error: *** msepdjvu: corrupted file (syntax error in text comment) *** (csepdjvu.cpp:1170) *** 'bool Comments::parse_comment_line(BufferByteStream&)' It seems like an encoding problem, but i don't know how to track it down, any hint is welcome| :) The text in the pdf was probably generated in a windows environment with charset windows-1251 (the win latin1 like charset) and the system where i compiled and where i'm using it has an utf-8 locale. Anyone had the same problem? Anyone has any hint of what can i do in order to complete the conversion process? thanks Alberto |