Efficient few-shot learning with Sentence Transformers
A suite of advanced multi-modal LLMs
The GPU-powered AI application database
Stanford NLP Python library for many human languages
Stable Diffusion built-in to Blender
A Model Context Protocol (MCP) server
TextWorld is a sandbox learning environment for the training
Open Source Speech Language Model
AI-assisted storyboard and video generation tool
Implementing large models into scenario-based applications
SQL-Driven RAG Engine
Build chatbots and conversational experiences using React
Framework for building real-time voice and multimodal AI agents
Knowledge Graph Generation from Any Text
Controllable and fast Text-to-Speech for over 7000 languages
The python library for real-time communication
Open-source multi-speaker long-form text-to-speech model
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Towards Human-Sounding Speech
Extract schema, statistics and entities from datasets
A sound cloning tool with a web interface, using your voice
Structured data extraction and instruction calling with ML, LLM
Automated translation solution for visual novels
Web-based tool converts GitHub repository contents
Semantic search and document parsing tools for the command line