A lightweight text-to-speech model with zero-shot voice cloning
OCR software, free and offline
A minimalist command line knowledge base manager
A simple tool for reading in poorly redacted documents
Edit PDF files with Nano Banana
Voice Recognition to Text Tool
SOTA Open Source TTS
A fast and lightweight IDE
Speech recognition module for Python
State-of-the-art TTS model under 25MB
Stanford NLP Python library for many human languages
High accuracy RAG for answering questions from scientific documents
A Family of Open Sourced Music Foundation Models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
A python parametric CAD scripting framework based on OCCT
Generate audiobooks from e-books, voice cloning & 1107+ languages
tiktoken is a fast BPE tokeniser for use with OpenAI's models
MTEB: Massive Text Embedding Benchmark
Speech-AI-Forge is a project developed around TTS generation model
The Ren'Py Visual Novel Engine
Tokenizer-Free TTS for Multilingual Speech Generation
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Text and image to video generation: CogVideoX and CogVideo
Tools to ease the creation of snippets, syntax definitions, etc.