Dataset of GPT-2 outputs for research in detection, biases, and more
kaldi-asr/kaldi is the official location of the Kaldi project
A PyTorch-based Speech Toolkit
Harness LLMs with Multi-Agent Programming
The most accurate natural language detection library for Python
Access large language models from the command-line
Trained models & code to predict toxic comments
Underthesea - Vietnamese NLP Toolkit
An easy-to-use LLMs quantization package with user-friendly apis
LLM based data scientist, AI native data application
Superlinked is a Python framework for AI Engineers
Python implementation of TextRank algorithms
Adding guardrails to large language models
Phi-3.5 for Mac: Locally-run Vision and Language Models
Model Context Protocol tool support for LangChain
Simple, Pythonic building blocks to evaluate LLM applications
Data and tools for generating and inspecting OLMo pre-training data
Persian NLP Toolkit
WikiChat is an improved RAG
ReFT: Representation Finetuning for Language Models
TextWorld is a sandbox learning environment for the training
Research-oriented chatbot framework
Large Language Model Text Generation Inference
Standalone, small, language-neutral
Integrating LLMs into structured NLP pipelines