Automatically translates the text of a video based on a subtitle file
Han Language Processing
A Web UI for easy subtitle using whisper model
Official PyTorch Implementation
Reading book source
Flowly is 100x faster than OpenClaw
Framework for building AI-powered interactive digital humans and agent
Conversational voice AI agents
Scalable generative AI framework built for researchers and developers
Multi-modal large language model designed for audio understanding
Stanford NLP Python library for many human languages
LLM Large Model of Selling Anchor
High-quality multi-lingual text-to-speech library by MyShell.ai
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A Conversational Speech Generation Model
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
One-click deployment (including offline integration package)
Generate blog articles from video or audio
A very simple framework for state-of-the-art NLP
Transforming Multimodal Content into Captivating Multilingual Audio
Towards Studio-Grade Character Animation via In-Context Learning of 3D
FAIR Sequence Modeling Toolkit 2
Open source plain text editor designed for writing novels
Focus on prompting and generating
Towards Human-Level Text-to-Speech through Style Diffusion