A Conversational Speech Generation Model
Towards Human-Level Text-to-Speech through Style Diffusion
A Python application to add watermarks (text or image) to PDF files
Retrieval Augmented Generation (RAG) framework
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
AI-powered tool to quickly remove watermarks from images flawlessly
Translate English to Bangla using CSV file format and range wise.
Overcoming Data Limitations for High-Quality Video Diffusion Models
mice stt tts
Python implementation of TextRank algorithms
dashAI: an interactive platform for training, evaluating and deploying
Free AI-powered church presentation & live sermon display app
Text to Speech Utility
AI-powered semantic indexing: automating the creation of book indexes
Ainee - AI Notetaking and Learning Companion
A graphical manager for ollama that can manage your LLMs
Clarity in the current fast-paced mess of Open Source innovation
Toolkit for audio, music, and speech generation
Guiding Instruction-based Image Editing via Multimodal Large Language
VITS2 backbone with multilingual-bert
Tool-integrated Reasoning LLM Agents
Code for the paper Language Models are Unsupervised Multitask Learners
AIlice is a fully autonomous, general-purpose AI agent