Lets make video diffusion practical
Qwen3-TTS is an open-source series of TTS models
OCR expert VLM powered by Hunyuan's native multimodal architecture
The official repo of Qwen chat & pretrained large language model
Proxy that exposes Antigravity provided claude / gemini models
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Python SDK for Claude Agent
Robust Speech Recognition Across Languages, Dialects
Global weather forecasting model using graph neural networks and JAX
Provides convenient access to the Anthropic REST API from any Python 3
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Continuous Autonomy for the AI SDK
Generate Any 3D Scene in Seconds
New set of lightweight state-of-the-art, open foundation models
Official implementation of DreamCraft3D
Language modeling in a sentence representation space
Advancing Formal Mathematical Reasoning via Reinforcement Learning
Large Multimodal Models for Video Understanding and Editing
Pushing the Limits of Mathematical Reasoning in Open Language Models
Chat & pretrained large vision language model
A Conversational Speech Generation Model
Powerful open source image generation model
Example Discord bot written in Python that uses the completions API
Official code for Style Aligned Image Generation via Shared Attention
Tencent’s 36-language state-of-the-art translation model