SOTA discrete acoustic codec models with 40/75 tokens per second
InvokeAI is a leading creative engine for Stable Diffusion models
ComfyUI wrapper nodes for WanVideo and related models
Unleashing 10,000+ Word Generation from Long Context LLMs
Document content and metadata extraction microservice
A python tool that uses GPT-4, FFmpeg, and OpenCV
Edit videos with Claude Code
Bailing is a voice dialogue robot similar to GPT-4o
Chinese and English multimodal conversational language model
One-click deployment (including offline integration package)
Framework for building, orchestrating, and deploying AI agents
Open source NLP guide with models, methods, and real use cases
Benchmark LLMs by fighting in Street Fighter 3
Using AI models to automatically provide commentary and edit videos
Public opinion analysis system
Paste Markdown and AI responses into Word Excel instantly fast
Model Context Protocol Server for Apache OpenDAL™
LISA: Reasoning Segmentation via Large Language Model
Agent Skill for generating 2D sprite sheets and map, transparent PNG
Conversational voice AI agents
Automate native Android apps with AI using accessibility APIs
Ultra-Efficient LLMs on End Device
Low-latency AI inference engine optimized for mobile devices
Framework for building neural networks
Generate Any 3D Scene in Seconds