Swift community driven package for OpenAI public API
Voice Recognition to Text Tool
Towards Human-Sounding Speech
AI tool that turns Hacker News posts into daily podcast updates
Learn audio and video knowledge, organize materials
Encode decode, rgb yuv h264 aac flv mp4 rtmp
Convert files and web content into clean, usable Markdown easily
Open Source Speech Language Model
Use Microsoft Edge's online text-to-speech service from Python
A cycle-accurate Nintendo Game Boy Advance emulator
A react-based starter app for using the Live API over websockets
Build AI-powered semantic search applications
Framework for building real-time voice and multimodal AI agents
Command library suitable for Android. It implements audio and video
AI that sees your screen and listens to conversations
Instantly generate AI-powered subtitles on your device
A sound cloning tool with a web interface, using your voice
Spring AI Alibaba examples for building and testing AI apps
online video editor built with nextjs, remotion and ffmpeg
Linus learns analog circuits
2023, the latest audio and video learning materials, projects
Build Vision Agents quickly with any model or video provider
Cross-platform, customizable ML solutions
Document Image Parsing via Heterogeneous Anchor Prompting”