Text and image to video generation: CogVideoX and CogVideo
Proxy that exposes Antigravity provided claude / gemini models
Qwen3-TTS is an open-source series of TTS models
Official Python inference and LoRA trainer package
The most powerful local music generation model
Qwen3 is the large language model series developed by Qwen team
Awesome multilingual OCR toolkits based on PaddlePaddle
Recovering the Visual Space from Any Views
Python SDK for Claude Agent
A Unified Framework for Text-to-3D and Image-to-3D Generation
Python inference and LoRA trainer package for the LTX-2 audio–video
Native and Compact Structured Latents for 3D Generation
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Qwen3-ASR is an open-source series of ASR models
Official inference repo for FLUX.1 models
Open-Source Financial Large Language Models
Hackable and optimized Transformers building blocks
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A Pragmatic VLA Foundation Model
Provides convenient access to the Anthropic REST API from any Python 3
Claude Code image, a one-stop open source transit service
Qwen3.5 is the large language model series developed by Qwen team
A multimodal model for brain response prediction
Visual Causal Flow