Pipeline for training Language Models using PyTorch. Inspired by Yandex Data School NLP Course (week 03: Language Modeling) Prepared text file with space-separated words on each line.
Features
- Statistical Language Modeling
- Scripts for training statistical language models
- Scripts for validation statistical language models using perplexity
- Scripts for generation new sequences using statistical language models
- RNN Language Modeling
- Various implemented models
License
MIT LicenseFollow Pipeline for training Language Models
Other Useful Business Software
Forever Free Full-Stack Observability | Grafana Cloud
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Pipeline for training Language Models!