Text and image to video generation: CogVideoX and CogVideo
Proxy that exposes Antigravity provided claude / gemini models
Qwen3-TTS is an open-source series of TTS models
Official Python inference and LoRA trainer package
The most powerful local music generation model
Qwen3 is the large language model series developed by Qwen team
Awesome multilingual OCR toolkits based on PaddlePaddle
Recovering the Visual Space from Any Views
Python SDK for Claude Agent
Qwen3-ASR is an open-source series of ASR models
A Unified Framework for Text-to-3D and Image-to-3D Generation
Official inference repo for FLUX.1 models
Python inference and LoRA trainer package for the LTX-2 audio–video
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Open-Source Financial Large Language Models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Provides convenient access to the Anthropic REST API from any Python 3
Diffusion Bee is the easiest way to run Stable Diffusion locally
Claude Code image, a one-stop open source transit service
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A multimodal model for brain response prediction
Qwen3.5 is the large language model series developed by Qwen team
Visual Causal Flow
DeepSeek Coder: Let the Code Write Itself
ChatGLM-6B: An Open Bilingual Dialogue Language Model