Conversational voice AI agents
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Multi-user UI for managing and running Stable Diffusion workflows tool
Context-aware desktop AI assistant that understands screen content
Multilingual speech recognition and audio understanding model
OpenDAN is an open source Personal AI OS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Unleashing 10,000+ Word Generation from Long Context LLMs
Accelerate local LLM inference and finetuning
State-of-the-art TTS model under 25MB
Revolutionizing Database Interactions with Private LLM Technology
A single-file tkinter-based Ollama GUI project
AI Agent Networks for Open Collaboration
A robust, efficient, low-latency speech-to-text library
RL implementations
PraisonAI application combines AutoGen and CrewAI or similar framework
Automate browser-based workflows with LLMs and Computer Vision
Python-based neural networks API
A step-by-step guide to build your own AI agent
The repository provides code for running inference with SAM 2
GLM-4 series: Open Multilingual Multimodal Chat LMs
OpenLIT is an open-source LLM Observability tool
The ultimate RAG for your monorepo
Framework for building AI-powered interactive digital humans and agent