Instantly generate AI-powered subtitles on your device
Stable Diffusion web UI
A system for agentic LLM-powered data processing and ETL
Open source NLP guide with models, methods, and real use cases
Video translation and dubbing tool powered by LLMs
Efficient few-shot learning with Sentence Transformers
A suite of advanced multi-modal LLMs
The GPU-powered AI application database
Stanford NLP Python library for many human languages
Stable Diffusion built-in to Blender
A Model Context Protocol (MCP) server
TextWorld is a sandbox learning environment for the training
OpenRecall is a fully open-source, privacy-first alternative
Open Source Speech Language Model
AI-assisted storyboard and video generation tool
Implementing large models into scenario-based applications
SQL-Driven RAG Engine
Build chatbots and conversational experiences using React
Framework for building real-time voice and multimodal AI agents
Knowledge Graph Generation from Any Text
Open-source multi-speaker long-form text-to-speech model
Controllable and fast Text-to-Speech for over 7000 languages
The python library for real-time communication
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Towards Human-Sounding Speech