Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
NVIDIA NeMo Curator 0.6.0 source code.tar.gz | 2025-01-07 | 2.4 MB | |
NVIDIA NeMo Curator 0.6.0 source code.zip | 2025-01-07 | 2.6 MB | |
README.md | 2025-01-07 | 343 Bytes | |
Totals: 3 Items | 5.0 MB | 0 |
What's changed
- Synthetic Data Generation for Text Retrieval
- LLM-based Filters
- Easiness
- Answerability
- Q&A Retrieval Generation Pipeline
- Parallel Dataset Curation for Machine Translation
- Load/Write Bitext Files
- Heuristic filtering (Histogram, Length Ratio)
- Classifier filtering (Comet, Cometoid)