Taming Stable Diffusion for Lip Sync
GUI for a Vocal Remover that uses Deep Neural Networks
From Images to High-Fidelity 3D Assets
Python inference and LoRA trainer package for the LTX-2 audio–video
Asynchronous multi-platform robot framework written in Python
Making ALL Software Agent-Native
Offline Text To Speech synthesis for python
Automatically translates the text of a video based on a subtitle file
Open-source infrastructure for Computer-Use Agents. Sandboxes
Open source AI VTuber platform with voice chat and Live2D avatars
Chat with your documents using local AI
Document Image Parsing via Heterogeneous Anchor Prompting”
Awesome multilingual OCR toolkits based on PaddlePaddle
Run Local LLMs on Any Device. Open-source
Run LLM prompts from your shell
The agent that grows with you
Persistent AI memory using local Markdown knowledge graphs
A cross-platform Python library for differentiable programming
Definitions for AI/ML tasks like dataset creation
Improve your Baduk skills by training with KataGo
On-device Speech-to-Intent engine powered by deep learning
Enable AI to control your desktop, mobile and HMI devices
Private chat with local GPT with document, images, video, etc.
Parallax is a distributed model serving framework
Enterprise platform for building and orchestrating AI agent workflows