Text and image to video generation: CogVideoX and CogVideo
Qwen3-TTS is an open-source series of TTS models
Official Python inference and LoRA trainer package
Awesome multilingual OCR toolkits based on PaddlePaddle
Qwen3 is the large language model series developed by Qwen team
Provides convenient access to the Anthropic REST API from any Python 3
The most powerful local music generation model
Python SDK for Claude Agent
Official inference repo for FLUX.1 models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Native and Compact Structured Latents for 3D Generation
Recovering the Visual Space from Any Views
Hackable and optimized Transformers building blocks
Visual Causal Flow
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Python inference and LoRA trainer package for the LTX-2 audio–video
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A Unified Framework for Text-to-3D and Image-to-3D Generation
Industrial-level controllable zero-shot text-to-speech system
26m function call model that runs on incredibly small devices
A Pragmatic VLA Foundation Model
Qwen3-ASR is an open-source series of ASR models
An experimental version of DeepSeek model
DeepSeek Coder: Let the Code Write Itself