Accurate × Fast × Comprehensive
Instant voice cloning by MIT and MyShell. Audio foundation model
Renderer for the harmony response format to be used with gpt-oss
A Markdown-first memory system, a standalone library for any AI agent
Self-evolving autonomous agent framework
Collections of robotics environments
Deepfakes Software For All
The data structure for multimodal data
AI Toolkit for Healthcare Imaging
CogView4, CogView3-Plus and CogView3(ECCV 2024)
CNCF Sandbox Project
Stable Diffusion web UI
Create UIs for your machine learning model in Python in 3 minutes
General-purpose image editing model that delivers high-fidelity
Utilities intended for use with Llama models
Sharp Monocular Metric Depth in Less Than a Second
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Open-source autonomous AI software engineer
From Vibe Coding to Agentic Engineering
Provides code for running inference with the SegmentAnything Model
TensorFlow is an open source library for machine learning
Multimodal embedding and reranking models built on Qwen3-VL
Framework for building and orchestrating multi-agent AI systems
kaldi-asr/kaldi is the official location of the Kaldi project
OCR software, free and offline