Centralized agent control plane for governing runtime agent behavior
A nearly-live implementation of OpenAI's Whisper
An experimental version of DeepSeek model
Advancing Open-source World Models
Contexts Optical Compression
Multi-modal large language model designed for audio understanding
Open-source framework for intelligent speech interaction
Concatenate a directory full of files into a single prompt
Easy Docker setup for Stable Diffusion with user-friendly UI
A high-quality rapid TTS voice cloning model
Parse files for optimal RAG
Shell command execution server implementing the Model Context Protocol
The behavior guidance framework for customer-facing LLM agents
Paste Markdown and AI responses into Word Excel instantly fast
Book about interpretable machine learning
A 0.1B Omni model trained from scratch
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Learn it. Build it. Ship it for others
Qwen3-ASR is an open-source series of ASR models
An open-source toolkit for monitoring Language Learning Models (LLMs)
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Provides convenient access to the Anthropic REST API from any Python 3
TFX is an end-to-end platform for deploying production ML pipelines
Magnetoencephalography (MEG) and Electroencephalography EEG in Python
Python framework for adversarial attacks, and data augmentation