Menu

#1 utf -> kh -> utf conversion is not lossless

open
None
5
2010-02-09
2010-02-07
criss
No

I think we need to work more on this.

python ../src/utf82kh.py test.txt
output: test-k.txt

python ../src/kh2utf8.py ../build/test-k.txt > test-k-u.txt
output: test-k-u.txt

ideally the test-k-u.txt should be the file identical to the test.txt

we need to test these with many samples.

Discussion

  • criss

    criss - 2010-02-09

    Assigned to ashu:

    Ashu you can use the sample file attached to try out if there is leak in encoding, also you can find the large hindi text in corpus in trunk/coupus/hindi.txt

     
  • criss

    criss - 2010-02-09

    Sample hindi file

     
  • criss

    criss - 2010-02-09

    Forgot to assigne

     
  • criss

    criss - 2010-02-09
    • assigned_to: nobody --> ashuverma89
     

Log in to post a comment.

MongoDB Logo MongoDB