Robust Speech Recognition via Large-Scale Weak Supervision
A high-throughput and memory-efficient inference and serving engine
Advanced language and coding AI model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Awesome multilingual OCR toolkits based on PaddlePaddle
Agentic, Reasoning, and Coding (ARC) foundation models
Petastorm library enables single machine or distributed training
AI Toolkit for Healthcare Imaging
Official inference library for Mistral models
Image inpainting tool powered by SOTA AI Model
The official Python library for the OpenAI API
Open-Sora: Democratizing Efficient Video Production for All
Model Context Protocol Server for Apache OpenDAL™
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
The Open Source Cowork Desktop to Unlock Your Exceptional Productivity
Synchronized Translation for Videos
Everything you need to build state-of-the-art foundation models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Generate audiobooks from e-books, voice cloning & 1107+ languages
Official inference repo for FLUX.2 models
Qwen-Image is a powerful image generation foundation model
Qwen2.5-VL is the multimodal large language model series
Python binding to the Apache Tika™ REST services