Towards Human-Sounding Speech
Spring AI Alibaba examples for building and testing AI apps
Cross-platform, customizable ML solutions
Document Image Parsing via Heterogeneous Anchor Prompting”
Software that uses AI to perform real-time voice conversion
Abstraction layer over YouTube's internal API
Build Vision Agents quickly with any model or video provider
Framework for building realtime multimodal voice AI agents apps
Development skills for AI coding agents
Instantly generate AI-powered subtitles on your device
Private chat with local GPT with document, images, video, etc.
The Tiny JavaScript Game Engine That Can!
Get your documents ready for gen AI
Large Multimodal Models for Video Understanding and Editing
Freeverb3 DSP VST effect plugins
LLM Large Model of Selling Anchor
Controllable and fast Text-to-Speech for over 7000 languages
Build cross-modal and multimodal applications on the cloud
Rust binding generator, feature-rich, but seamless and simple
A system-wide equalizer for Windows 7 / 8 / 8.1 / 10 / 11
Multi-Format Audio-Encoder Front-end
Multichannel audio manipulation and batch processing for OpenMusic.
Free all in one audio/video ffmpeg batch encoder
Nyquist is a language for sound synthesis and music composition.
Stream audio from your PC to the internet with ease