GLM-4-Voice | End-to-End Chinese-English Conversational Model
Open-source abilities for OpenHome agents
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Graphical interface for yt-dlp, a tool for downloading YouTube videos.
Private chat with local GPT with document, images, video, etc.
Controllable and fast Text-to-Speech for over 7000 languages
Open source Loom alternative. Beautiful, shareable screen recordings
Changelog makes world-class developer pods
A lyric player component library aims to look similar to iPad
Rust framework for building modular and scalable LLM-powered apps
Framework for building realtime multimodal voice AI agents apps
Examples and guides for using the Gemini API
The Tiny JavaScript Game Engine That Can!
A GPU-accelerated library containing highly optimized building blocks
Transcribe and translate audio offline on your personal computer
Generate high-definition story short videos with one click using AI
A tool for transcoding lossless audio files
"VideoRAG: Chat with Your Videos
Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop
Context data platform for building observable, self-learning AI agents
Foundational model for human-like, expressive TTS
Official MiniMax Model Context Protocol (MCP) server
Transcribe any audio to text, translate and edit subtitles 100% locall
A free multi-track audio editor and recorder
Open-source, Ad-free and Multi-source Music player.