MCP server for interfacing with Godot game engine
A multimodal model for brain response prediction
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Go efficient multilingual NLP and text segmentation
Open source text-to-speech tool, supports extra-long text
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Speech-AI-Forge is a project developed around TTS generation model
Your Personal AI super intelligence. Private, simple and powerful
Implementation of Imagen, Google's Text-to-Image Neural Network
Python binding to the Apache Tika™ REST services
The behavior guidance framework for customer-facing LLM agents
Implementation of Phenaki Video, which uses Mask GIT
Generate blog articles from video or audio
A text-to-speech, speech-to-text and speech-to-speech library
AI-powered bridge connecting LLMs and advanced AI agents
Workflow and speech recognition app
The official Python SDK for the ElevenLabs API
A community-supported supercharged version of paperless
Synchronized Translation for Videos
Persian NLP Toolkit
An open-source toolkit for monitoring Language Learning Models (LLMs)
Browser extension and cross-platform desktop app based on ChatGPT API
A TTS that fits in your CPU (and pocket)
Instantly generate AI-powered subtitles on your device
Open source visual editor for building React drag-and-drop pages