From: Jim <li...@yg...> - 2005-06-23 08:01:28
|
On Wed, 22 Jun 2005, Andrej wrote: > we would like to index a Japanese website. The pages are using utf-8 > character encoding. The ht://Dig FAQ states that ht://Dig cannot index > Japanese pages yet, since they require 16-bit characters, which is not > supported by ht://Dig. > > Has there been an update lately concerning this problem or do you know of a > possible workaround that will enable us to index the pages nonetheless? I am not aware of any progress in this area. The 3.2.x code still lacks support for multi-byte characters. There are still plans to add some level of Unicode support to a future version, but when such a version might become available is a complete unknown. Jim |