Self-hosted AI audio transcription
The next-generation file converter
Pythonic bindings for FFmpeg's libraries
A web-based GUI for quickly generating common FFmpeg command-line
Fast multimodal LLM for real-time voice interaction and AI apps
AI-powered tool for generating, optimizing, and translating subtitles
FFmate is a modern and powerful automation layer
A suite of advanced multi-modal LLMs
The python library for real-time communication
Towards Human-Sounding Speech
Use Microsoft Edge's online text-to-speech service from Python
Command library suitable for Android. It implements audio and video
AI tool that turns Hacker News posts into daily podcast updates
Encode decode, rgb yuv h264 aac flv mp4 rtmp
A complete code to download for a cool Discord music bot
Convert files and web content into clean, usable Markdown easily
Open Source Speech Language Model
A cycle-accurate Nintendo Game Boy Advance emulator
A react-based starter app for using the Live API over websockets
Framework for building real-time voice and multimodal AI agents
Instantly generate AI-powered subtitles on your device
Cross-platform, customizable ML solutions
2023, the latest audio and video learning materials, projects
Spring AI Alibaba examples for building and testing AI apps
online video editor built with nextjs, remotion and ffmpeg