Re: [pdftohtml] Trouble wirh Unicode Cyrillic...
Status: Beta
Brought to you by:
meshko
|
From: Mikhail K. <me...@cs...> - 2002-07-16 00:18:09
|
> You definitely don't want to modify the tables in the source code -- > it's much easier to add external encoding files. As Mikhail said, you > could try using the UTF-8 encoding, or try downloading the Cyrillic > support package, and then selecting the KOI8-R encoding. provided that Bolgarians use KOI8-R :) I doubt that, because Bolgarian most likely has a couple of extra characters. E.g. Ukrainian has special encoding KOI8-U etc. > > Unfortunately I don't completely understand yet how encoding handling > > works in xpdf. > > I know how it works in Xpdf :-), but I have no idea how that affects > pdftohtml. Unless I broke something, it should work exactly like pdftotext. |