A framework to enable multimodal models to operate a computer
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Open source machine learning framework to automate text conversations
Reverse-engineered Python API for Google Gemini web app
Generate audiobooks from e-books, voice cloning & 1107+ languages
AIHawk aims to easy job hunt process by automating job applications
Framework for orchestrating role-playing, autonomous AI agents
Powerful tool that lets you create and run intelligent agents
Open source AI VTuber platform with voice chat and Live2D avatars
Flowly is 100x faster than OpenClaw
A frontier, first-principles handbook
Kimi Code CLI is your next CLI agent
GitLab automatic code review tool based on large models
A TTS that fits in your CPU (and pocket)
A generative speech model for daily dialogue
InvokeAI is a leading creative engine for Stable Diffusion models
The best way to use Hermes Agent from the web or from your phone
Open source AI model for generating full songs from lyrics prompts
LTX-Video Support for ComfyUI
ChatGPT interface with better UI
Make websites accessible for AI agents
AI agent harness for AI coding agents
A Systematic Framework for Interactive World Modeling
Industrial-level controllable zero-shot text-to-speech system
DeepSeek Coder: Let the Code Write Itself