Helps data scientists define testable self-documenting dataflows
Agentic, Reasoning, and Coding (ARC) foundation models
Personal AI, On Personal Devices
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Robust Speech Recognition via Large-Scale Weak Supervision
AI tool that removes hardcoded subtitles and text from videos locally
The official Python library for the OpenAI API
Image inpainting tool powered by SOTA AI Model
AI Fully Automated Short Video Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Official inference repo for FLUX.1 models
Lets make video diffusion practical
Everything you need to build state-of-the-art foundation models
Create videos with Stable Diffusion
A high-throughput and memory-efficient inference and serving engine
Model Context Protocol Server for Apache OpenDAL™
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Interact with your documents using the power of GPT
A lightweight audio-to-MIDI converter with pitch bend detection
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Tokenizer-Free TTS for Multilingual Speech Generation
Open-Sora: Democratizing Efficient Video Production for All
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Kimi Code CLI is your next CLI agent