Foundation model for image generation
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A Redis HTTP interface with JSON output
An open-source toolkit for monitoring Language Learning Models (LLMs)
Full git and GitHub integration with Sublime Text
Industrial-level controllable zero-shot text-to-speech system
Make bilingual epub books Using AI translate
Underthesea - Vietnamese NLP Toolkit
Crowdsourcing platform for full text transcription and tagging
Translate the video from one language to another and embed dubbing
SoTA open-source TTS
Modern desktop RSS reader built with Electron, React, and Fluent UI
Generate audiobooks from e-books, voice cloning & 1107+ languages
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Speech-AI-Forge is a project developed around TTS generation model
A Python toolbox for gaining geometric insights
The most accurate natural language detection library for Python
Collection of Gemma 3 variants that are trained for performance
A full spaCy pipeline and models for scientific/biomedical documents
A fast TTS architecture with conditional flow matching
Create videos with Stable Diffusion
Google Gen AI Python SDK provides an interface for developers
Qwen3-omni is a natively end-to-end, omni-modal LLM
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A community-supported supercharged version of paperless