Some-NLP-experiments Code
Brought to you by:
kopinjol
File | Date | Author | Commit |
---|---|---|---|
README.md | 2022-08-14 |
![]() |
[ae7087] Update README.md |
blog1.txt | 2022-08-14 |
![]() |
[9fd0ab] Add files via upload |
driver_t.py | 2022-08-14 |
![]() |
[9fd0ab] Add files via upload |
tokenite.py | 2022-08-14 |
![]() |
[9fd0ab] Add files via upload |
Some NLP experiments starting with a tokenization attempt in Python.
The code tokenite.py reads a text file "blog1.txt" and tries to tokenize it. The code doesnot work as is, but is almost on the verge of working. Any suggestions will be greatly appreciated.
I define a class called text and define methods inside it. The method count defines a generator which I use in the method named t_tok. But if you look closely at 66 to 72 you will see that I am modifying the outer limit of the for loop while in the loop. It doesnot work. But I dont see the reason why.