by fail how do you mean - not the expected results - currently the tokenisation is not very unicode aware, development is currently focused on lower bit unicode as a primary focus, this should be easy to address in the tokenisation code however
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
by fail how do you mean - not the expected results - currently the tokenisation is not very unicode aware, development is currently focused on lower bit unicode as a primary focus, this should be easy to address in the tokenisation code however