Menu

Tree [9fd0ab] main /
 History

HTTPS access


File Date Author Commit
 README.md 2022-08-14 Kopinjol Baishya Kopinjol Baishya [ae7087] Update README.md
 blog1.txt 2022-08-14 Kopinjol Baishya Kopinjol Baishya [9fd0ab] Add files via upload
 driver_t.py 2022-08-14 Kopinjol Baishya Kopinjol Baishya [9fd0ab] Add files via upload
 tokenite.py 2022-08-14 Kopinjol Baishya Kopinjol Baishya [9fd0ab] Add files via upload

Read Me

Some-NLP-experiments

Some NLP experiments starting with a tokenization attempt in Python.
The code tokenite.py reads a text file "blog1.txt" and tries to tokenize it. The code doesnot work as is, but is almost on the verge of working. Any suggestions will be greatly appreciated.

I define a class called text and define methods inside it. The method count defines a generator which I use in the method named t_tok. But if you look closely at 66 to 72 you will see that I am modifying the outer limit of the for loop while in the loop. It doesnot work. But I dont see the reason why.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.