Petastorm library enables single machine or distributed training
MARS5 speech model (TTS) from CAMB.AI
A TTS model capable of generating ultra-realistic dialogue
AutoGluon: AutoML for Image, Text, and Tabular Data
AI-Powered Personalized Learning Assistant
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
A Repo For Document AI
A Systematic Framework for Interactive World Modeling
Automatically translates the text of a video based on a subtitle file
Making RAG Simpler with Small and Open-Sourced Language Models
Automate native Android apps with AI using accessibility APIs
Ultra-Efficient LLMs on End Device
LLM-based Reinforcement Learning audio edit model
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Text-space optimizer that trains reusable natural-language skills
A straightforward method for training your LLM
Public opinion analysis system
Visual Causal Flow
"Big Model" trains a visual multimodal VLM with 26M parameters
Interface for OuteTTS models
Simple, Pythonic building blocks to evaluate LLM applications
Tensor search for humans
Context database designed specifically for AI Agents
Models for the spaCy Natural Language Processing (NLP) library