Text and image to video generation: CogVideoX and CogVideo
Official Python inference and LoRA trainer package
Qwen3-TTS is an open-source series of TTS models
Proxy that exposes Antigravity provided claude / gemini models
The most powerful local music generation model
Official inference repo for FLUX.1 models
Qwen3 is the large language model series developed by Qwen team
Awesome multilingual OCR toolkits based on PaddlePaddle
Recovering the Visual Space from Any Views
A Pragmatic VLA Foundation Model
Provides convenient access to the Anthropic REST API from any Python 3
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Native and Compact Structured Latents for 3D Generation
gpt-oss-120b and gpt-oss-20b are two open-weight language models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen3.5 is the large language model series developed by Qwen team
26m function call model that runs on incredibly small devices
Python SDK for Claude Agent
Open-Source Financial Large Language Models
Extension index for stable-diffusion-webui
A multimodal model for brain response prediction
Python inference and LoRA trainer package for the LTX-2 audio–video
Industrial-level controllable zero-shot text-to-speech system
ChatGLM-6B: An Open Bilingual Dialogue Language Model
1B text generation model based on the HRM architecture