Voice Recognition to Text Tool
A minimalist command line knowledge base manager
Speech recognition module for Python
SOTA Open Source TTS
Stanford NLP Python library for many human languages
MTEB: Massive Text Embedding Benchmark
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Generate audiobooks from e-books, voice cloning & 1107+ languages
Speech-AI-Forge is a project developed around TTS generation model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
High accuracy RAG for answering questions from scientific documents
The behavior guidance framework for customer-facing LLM agents
A python parametric CAD scripting framework based on OCCT
Mozc - a Japanese Input Method Editor designed for multi-platform
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
The Ren'Py Visual Novel Engine
Tokenizer-Free TTS for Multilingual Speech Generation
Text and image to video generation: CogVideoX and CogVideo
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Implementation of Imagen, Google's Text-to-Image Neural Network
Claude Code skill implementing Manus-style persistent planning
A simple, high-quality voice conversion tool focused on ease of use
Statusline plugin for vim with prompts for several other applications
Mixture-of-Experts Vision-Language Models for Advanced Multimodal