Awesome multilingual OCR toolkits based on PaddlePaddle
Python SDK for Claude Agent
Visual Causal Flow
From Images to High-Fidelity 3D Assets
Video Object and Interaction Deletion
Qwen3.5 is the large language model series developed by Qwen team
RGBD video generation model conditioned on camera input
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Open Source Speech Language Model
A multimodal model for brain response prediction
Claude Code image, a one-stop open source transit service
Controllable & emotion-expressive zero-shot TTS
Long-form streaming TTS system for multi-speaker dialogue generation
Contexts Optical Compression
Open-source framework for intelligent speech interaction
Audio foundation model excelling in audio understanding
Qwen3-ASR is an open-source series of ASR models
State of the art LLM and coding model
Analyze computation-communication overlap in V3/R1
Pushing the Limits of Mathematical Reasoning in Open Language Models
Foundational Models for State-of-the-Art Speech and Text Translation
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Example Discord bot written in Python that uses the completions API
Let us control diffusion models