Controllable and fast Text-to-Speech for over 7000 languages
Advanced AI Explainability for computer vision
LLM-based agent for general purpose software engineering tasks
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Edit videos with Claude Code
HunyuanVideo: A Systematic Framework For Large Video Generation Model
OCR expert VLM powered by Hunyuan's native multimodal architecture
Audio foundation model excelling in audio understanding
LLM Large Model of Selling Anchor
Open source AI VTuber platform with voice chat and Live2D avatars
A Personalized LLM-powered Agent Frameworks
Official implementation of DreamCraft3D
Generative AI reference workflows
Fast file processing tool with drag & drop and batch rename
Translate English to Bangla using CSV file format and range wise.
AI-powered semantic indexing: automating the creation of book indexes
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
A Conversational Speech Generation Model
100% offline, AI-powered PDF redaction
Obsei is a low code AI powered automation tool
Transformers4Rec is a flexible and efficient library
Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM
PDF Combiner is a user-friendly, GUI-based tool built in