Pipeline for training Language Models using PyTorch. Inspired by Yandex Data School NLP Course (week 03: Language Modeling) Prepared text file with space-separated words on each line.
Features
- Statistical Language Modeling
- Scripts for training statistical language models
- Scripts for validation statistical language models using perplexity
- Scripts for generation new sequences using statistical language models
- RNN Language Modeling
- Various implemented models
License
MIT LicenseFollow Pipeline for training Language Models
Other Useful Business Software
Go From AI Idea to AI App Fast
Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Pipeline for training Language Models!