A SOTA open-source image editing model
HunyuanVideo: A Systematic Framework For Large Video Generation Model
FlashInfer: Kernel Library for LLM Serving
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code
Open-source, code-first Python toolkit for building, evaluating, etc.
Train a 26M-parameter GPT from scratch in just 2h
The official Python client for the Huggingface Hub
Ultralytics YOLO
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Free, high-quality text-to-speech API endpoint to replace OpenAI
A high performance implementation of HDBSCAN clustering
MOSS‑TTS Family open‑source speech and sound generation model
Open source healthcare AI
Open source AI pair programmer for coding, debugging, automation
Autonomous LLM agent for end-to-end data science workflows
Open source RAG framework for building scalable modular AI apps
Secure local-first microVM sandbox for running untrusted code fast
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Enhances Tesseract OCR output using LLMs (local or API)
Clone a voice in 5 seconds to generate arbitrary speech in real-time
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Opensource browser using agents
gpt-4o for windows, macos and linux
Compress tool outputs, logs, files, and RAG chunks