Text and image to video generation: CogVideoX and CogVideo
Official Python inference and LoRA trainer package
Qwen3-TTS is an open-source series of TTS models
Awesome multilingual OCR toolkits based on PaddlePaddle
Proxy that exposes Antigravity provided claude / gemini models
The most powerful local music generation model
Official inference repo for FLUX.1 models
Qwen3 is the large language model series developed by Qwen team
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
gpt-oss-120b and gpt-oss-20b are two open-weight language models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Provides convenient access to the Anthropic REST API from any Python 3
Recovering the Visual Space from Any Views
Diffusion Bee is the easiest way to run Stable Diffusion locally
26m function call model that runs on incredibly small devices
Python SDK for Claude Agent
Qwen3.5 is the large language model series developed by Qwen team
1B text generation model based on the HRM architecture
Open-Source Financial Large Language Models
Python inference and LoRA trainer package for the LTX-2 audio–video
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A multimodal model for brain response prediction
Claude Code image, a one-stop open source transit service
Industrial-level controllable zero-shot text-to-speech system