Stanford NLP Python library for many human languages
SOTA Open Source TTS
A fast and lightweight IDE
MTEB: Massive Text Embedding Benchmark
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Generate audiobooks from e-books, voice cloning & 1107+ languages
Speech-AI-Forge is a project developed around TTS generation model
High accuracy RAG for answering questions from scientific documents
The behavior guidance framework for customer-facing LLM agents
A python parametric CAD scripting framework based on OCCT
The Ren'Py Visual Novel Engine
Mozc - a Japanese Input Method Editor designed for multi-platform
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Tokenizer-Free TTS for Multilingual Speech Generation
Implementation of Imagen, Google's Text-to-Image Neural Network
Statusline plugin for vim with prompts for several other applications
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Text and image to video generation: CogVideoX and CogVideo
Transforming Multimodal Content into Captivating Multilingual Audio
CLIP, Predict the most relevant text snippet given an image
A simple, high-quality voice conversion tool focused on ease of use
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Tools to ease the creation of snippets, syntax definitions, etc.