Marrying Grounding DINO with Segment Anything & Stable Diffusion
Fast stable diffusion on CPU and AI PC
HY-Motion model for 3D character animation generation
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Chinese and English multimodal conversational language model
High-Resolution Image Synthesis with Latent Diffusion Models
Stable Diffusion built-in to Blender
ComfyUI wrapper nodes for WanVideo and related models
The Python code to reproduce illustrations from Machine Learning Book
Supercharge Your LLM with the Fastest KV Cache Layer
Large-language-model & vision-language-model based on Linear Attention
Fast multimodal LLM for real-time voice interaction and AI apps
Chat with it via text and voice
Easy-to-use and high-performance NLP and LLM framework
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Google Gen AI Python SDK provides an interface for developers
Open source libraries and APIs to build custom preprocessing pipelines
Qwen3-ASR is an open-source series of ASR models
Conversational voice AI agents
LLM abstractions that aren't obstructions
Flexible Photo Recrafting While Preserving Your Identity
ImageBind One Embedding Space to Bind Them All
Multi-tool for semantic search
Flowly is 100x faster than OpenClaw
Unified Multimodal Understanding and Generation Models