26m function call model that runs on incredibly small devices
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
TensorRT LLM provides users with an easy-to-use Python API
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
gpt-oss-120b and gpt-oss-20b are two open-weight language models
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Offline inference engine for art, real-time voice conversations
NeuTTS model built from small LLM backbones
On-device TTS model by Neuphonic
Python framework for building scalable multi-agent systems
Question and Answer based on Anything
Bidirectional token-classification model for identifiable info
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Play ChatGPT and other LLM with Xiaomi AI Speaker
Supercharge Your Model Training
Language-model investigation agent with a terminal UI
Voice Recognition to Text Tool
950 line, minimal, extensible LLM inference engine built from scratch
State-of-the-art Parameter-Efficient Fine-Tuning
Official PyTorch Implementation
Multi-lingual large voice generation model, providing inference
AIMET is a library that provides advanced quantization and compression
A 0.1B Omni model trained from scratch
Claude Code, but it runs on your Mac for free
Numerical differential equation solvers in JAX