Huffman dictionary encoding issue
Brought to you by:
nzsjb
In SingleTreeDictionaryEntry the dictionary cfg file is read using new StreamReader(fileStream) http://msdn.microsoft.com/en-us/library/yhfzs7at.aspx which assumes UTF-8 encoding but the bundled "Huffman Dictionary GBR.cfg" file is not UTF-8 encoded. This results in accented characters etc. coming out as garbage (particularly noticeable on Irish EPG). At least one other person has encountered this issue, see thread https://sourceforge.net/projects/epgcollector/forums/forum/1125946/topic/4075540
I have verified that transcoding the dictionary to UTF-8 fixes the issue. Transcoded file attached.
I didn't look through other dictionaries/classes to determine if this is limited to this scenario or if other dictionaries have the same problem.
OpenTV GBR dictionary - UTF-8 encoded
Relevant files changed to UTF-8 in v3.2