Instant voice cloning by MIT and MyShell. Audio foundation model
Multilingual Document Layout Parsing in a Single Vision-Language Model
Claude Code skill implementing Manus-style persistent planning
CLIP, Predict the most relevant text snippet given an image
The simplest, fastest repository for training/finetuning models
DSPy: The framework for programming—not prompting—language models
PraisonAI application combines AutoGen and CrewAI or similar framework
A Python framework to write Kubernetes operators in just a few lines
Agent S: an open agentic framework that uses computers like a human
Build portable, production-ready MLOps pipelines
An extremely fast Python linter, written in Rust
The CUDA target for Numba
Agent-ready RPA suite with visual workflow automation tools engine
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Less Code, Lower Barrier, Faster Deployment
Full stack, modern web application generator
LLM training code for MosaicML foundation models
Official SeedVR2 Video Upscaler for ComfyUI
PersonaPlex code
Programmatic access to the AlphaGenome model
Open-source, code-first Python toolkit for building, evaluating, etc.
Optax is a gradient processing and optimization library for JAX
Mentat - The AI Coding Assistant
Formula recognition based on LaTeX-OCR and ONNXRuntime
Implementation of Python 3 running in the browser