Text and image to video generation: CogVideoX and CogVideo
Official Python inference and LoRA trainer package
Qwen3-TTS is an open-source series of TTS models
The most powerful local music generation model
Qwen3 is the large language model series developed by Qwen team
Official inference repo for FLUX.1 models
Awesome multilingual OCR toolkits based on PaddlePaddle
Recovering the Visual Space from Any Views
A Pragmatic VLA Foundation Model
Provides convenient access to the Anthropic REST API from any Python 3
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Native and Compact Structured Latents for 3D Generation
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
26m function call model that runs on incredibly small devices
Python SDK for Claude Agent
Open-Source Financial Large Language Models
Python inference and LoRA trainer package for the LTX-2 audio–video
ChatGLM-6B: An Open Bilingual Dialogue Language Model
1B text generation model based on the HRM architecture
Industrial-level controllable zero-shot text-to-speech system
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
DeepSeek Coder: Let the Code Write Itself
Qwen3-ASR is an open-source series of ASR models
A Unified Framework for Text-to-3D and Image-to-3D Generation