Lets make video diffusion practical
Qwen3-TTS is an open-source series of TTS models
OCR expert VLM powered by Hunyuan's native multimodal architecture
The official repo of Qwen chat & pretrained large language model
Python SDK for Claude Agent
Robust Speech Recognition Across Languages, Dialects
Global weather forecasting model using graph neural networks and JAX
Provides convenient access to the Anthropic REST API from any Python 3
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Generate Any 3D Scene in Seconds
Official implementation of DreamCraft3D
Language modeling in a sentence representation space
Large Multimodal Models for Video Understanding and Editing
Pushing the Limits of Mathematical Reasoning in Open Language Models
Chat & pretrained large vision language model
A Conversational Speech Generation Model
Powerful open source image generation model
Example Discord bot written in Python that uses the completions API
Official code for Style Aligned Image Generation via Shared Attention
Tencent’s 36-language state-of-the-art translation model