Showing 22 open source projects for "audio processing"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    FFBox

    FFBox

    A multimedia transcoded treasure chest / a FFmpeg case

    FFBox is a graphical multimedia processing application that provides an accessible interface for working with FFmpeg operations such as encoding, conversion, and editing. It allows users to perform tasks like trimming, merging, and compressing media files without using command-line tools. The software supports a wide range of audio and video formats, making it suitable for diverse media workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Claude Code Video Vision

    Claude Code Video Vision

    Give Claude the ability to watch and understand videos

    ...It supports multiple backends for audio processing, including local and cloud-based options, enabling flexible deployment depending on privacy or performance requirements.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    AI-Media2Doc

    AI-Media2Doc

    AI tool converting video/audio into structured documents instantly

    AI-Media2Doc is a web-based application that uses large language models to convert video and audio content into structured, readable documents in a single workflow. It is designed to transform multimedia inputs into formats such as knowledge notes, summaries, mind maps, and social-style articles, making content easier to review and reuse. AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not uploaded externally. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    ...It’s designed to replace the random assortment of “free online tools” people use for quick tasks, while avoiding ads, tracking, and the need to upload sensitive files to unknown servers. A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent to the backend. The tool catalog spans both technical and non-technical needs, including image, video, audio, PDF, text, date/time, math, and data format utilities like JSON/CSV/XML helpers. It’s also packaged for straightforward self-hosting, with a lightweight Docker image and simple run commands, so it can be deployed quickly on a homelab or internal network.
    Downloads: 10 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    Bilidown

    Bilidown

    Bilibili video parsing download tool, supports 8K video

    bilidown is a command-line tool designed to download videos and audio content from the Bilibili platform with high flexibility and control. It supports downloading single videos, playlists, and series, allowing users to archive content efficiently. The tool integrates FFmpeg to merge audio and video streams when necessary, ensuring compatibility and high-quality output. It provides options for selecting resolution, format, and output structure, giving users control over the download process....
    Downloads: 25 This Week
    Last Update:
    See Project
  • 6
    Youwee

    Youwee

    A beautiful, cross-platform downloader for YouTube, TikTok, Instagram

    Youwee is a modern cross-platform media downloader built with Tauri and React that supports downloading content from over 1800 websites including YouTube, TikTok, and Instagram. It provides a polished graphical interface that simplifies media downloading while leveraging powerful tools like yt-dlp and FFmpeg under the hood. The application supports advanced features such as batch downloads, playlist management, and extraction of audio or subtitles. It also integrates AI capabilities,...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 7
    Handy STT

    Handy STT

    A free, open source, and extensible speech-to-text application

    Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active...
    Downloads: 46 This Week
    Last Update:
    See Project
  • 8
    ScreenPipe

    ScreenPipe

    AI app store powered by 24/7 desktop history. open source

    Screenpipe is an AI app store powered by continuous desktop history recording. It operates entirely locally, offering developers a platform to build, distribute, and monetize AI applications that leverage comprehensive contextual data from users' desktop activities. ​
    Downloads: 33 This Week
    Last Update:
    See Project
  • 9
    VERT.sh

    VERT.sh

    The next-generation file converter

    VERT is a modern, privacy-focused file conversion platform that leverages WebAssembly to perform conversions entirely on the user’s device rather than relying on cloud-based processing. Built with Svelte and TypeScript, it provides a clean and responsive interface for converting a wide variety of file types, including images, audio, video, and documents. One of its defining characteristics is its local-first approach, which eliminates the need to upload files to external servers, thereby improving both privacy and performance. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    ffmpeg-commander

    ffmpeg-commander

    A web-based GUI for quickly generating common FFmpeg command-line

    ...The interface is inspired by tools like HandBrake, aiming to lower the barrier to entry for FFmpeg usage. Overall, it acts as a bridge between ease of use and powerful multimedia processing capabilities.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Suno AI API

    Suno AI API

    Use API to call the music generation AI of suno.ai

    Suno API is an unofficial open-source interface that enables developers to programmatically interact with Suno’s AI music generation platform, allowing automated creation of songs, lyrics, and audio content through API calls. It replicates the behavior of Suno’s web-based creation tools by reverse engineering internal endpoints and exposing them through a developer-friendly interface built with Python and FastAPI. The system supports asynchronous processing, enabling efficient handling of multiple generation requests and making it suitable for scalable applications and automation pipelines. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    Agili Hacker Podcast

    Agili Hacker Podcast

    AI tool that turns Hacker News posts into daily podcast updates

    Hacker Podcast is an AI-powered project that turns top Hacker News stories into a Chinese podcast. It automatically fetches trending posts each day, processes the content with AI, and generates concise summaries before converting them into audio. This creates a hands-free way to stay updated on tech, startups, and developer discussions without reading long threads. Hacker Podcast combines content aggregation, natural language processing, and text-to-speech to deliver clear and digestible updates. Users can listen through web interfaces or podcast platforms, while also accessing written summaries for deeper reading. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Markdownify MCP Server

    Markdownify MCP Server

    Convert files and web content into clean, usable Markdown easily

    ...It also allows retrieval of existing Markdown files, making it useful for documentation, research, and AI-assisted workflows. By standardizing content into Markdown, it helps unify inputs across different sources for better processing and integration with AI tools and developer environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    ...It ships with demo branches that show grounded search, function calling, and visualization—one example has the model calling a function that renders Vega/Altair graphs directly in the UI. Under the hood there’s an event-emitting WebSocket client, an audio in/out processing layer, and a minimal scaffolded view so you can focus on your app logic rather than wiring.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Spring AI Alibaba Examples

    Spring AI Alibaba Examples

    Spring AI Alibaba examples for building and testing AI apps

    ...It is designed to help developers understand core concepts, explore practical implementations, and follow best practices when building AI-powered systems using the Spring ecosystem. Each module focuses on a specific use case such as chat, image processing, audio handling, graph workflows, and retrieval-augmented generation. The examples highlight how to integrate AI models, manage prompts, handle memory, and build multi-model or multi-agent workflows. Developers can explore individual project folders for detailed instructions and implementation guidance. Spring AI Alibaba Examples also supports experimentation through playground modules and encourages contributions to expand real-world AI use cases and improve development practices.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    clip-js

    clip-js

    online video editor built with nextjs, remotion and ffmpeg

    clip-js is a browser-based video editor built with modern web technologies such as Next.js and Remotion, designed to provide real-time editing and rendering directly in the browser. It enables users to create and edit video compositions using a timeline interface, combining video, audio, images, and text layers into a single project. The system uses a WebAssembly port of FFmpeg to perform high-quality rendering, allowing export of videos without relying on server-side processing. It includes interactive controls for trimming, splitting, and arranging media elements with precise timing. The editor supports dynamic adjustments such as opacity, positioning, and layering to fine-tune compositions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AutoSubs

    AutoSubs

    Instantly generate AI-powered subtitles on your device

    ...Users can customize subtitle styling, adjust timing, and export results in multiple formats, making it suitable for content creators, filmmakers, and editors. AutoSubs is designed with performance in mind, offering efficient processing through a Rust-based backend and supporting multiple operating systems including Windows, macOS, and Linux.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 18
    Free Fps. Video FPS Converter

    Free Fps. Video FPS Converter

    Desktop app to change a video FPS

    Free FPS is an open‑source desktop app and scripts to change a video file frame rate (FPS) using FFmpeg. Unlike video editors, it does not add effects or alter content - it only adjusts playback speed and, if needed, re-encodes audio as well. Useful if you work with multiple videos shot at different frame rates that cannot be combined or edited without interpolation or frame loss. Also doubles as a fast video compressor: keep the original FPS and raise compression (e.g., higher CRF or...
    Leader badge
    Downloads: 22 This Week
    Last Update:
    See Project
  • 19
    wasmboy

    wasmboy

    Game Boy / Game Boy Color Emulator Library

    wasmboy is a Game Boy and Game Boy Color emulator built using WebAssembly and JavaScript, designed to run efficiently in both browsers and Node environments. It leverages modern web technologies such as HTML5 canvas and the Web Audio API to deliver graphics and sound directly within a web interface. The project emphasizes portability and integration, allowing it to be embedded into other applications as a reusable dependency. It supports a wide range of emulator features including save...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Tone.js

    Tone.js

    A Web Audio framework for making interactive music in the browser

    Tone.js is an open source Web Audio framework for creating interactive music in the browser. It has advanced scheduling capabilities, synths, effects and more. Tone.js is built to be useful and familiar to both musicians and audio programmers. It has common DAW (digital audio workstation) features for those looking to schedule events and tinker with pre-built synths and effects.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    aws-lambda-ffmpeg

    aws-lambda-ffmpeg

    An S3-triggered Amazon Web Services Lambda function

    aws-lambda-ffmpeg is a project that enables running FFmpeg within AWS Lambda environments, allowing serverless media processing workflows. It packages FFmpeg binaries and provides examples of how to execute video and audio processing tasks in a serverless context. The system allows users to perform operations such as transcoding, resizing, and format conversion without managing dedicated servers. It leverages cloud storage services like S3 for input and output handling, enabling scalable processing pipelines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    pulsorclip

    pulsorclip

    Download videos from almost any website

    ...The application focuses on a controlled workflow instead of instant downloads. Users first provide a media URL, then select format, quality, and container before processing the file. It includes both a web interface built with Next.js and a Telegram bot that offers the same guided experience through chat. Both share a common backend in a monorepo structure. The system is fully self-hosted and designed to run in a single Docker deployment including both web and bot services. Users can download videos and audio in formats such as MP4, WebM, MKV, MP3, and M4A, with server-side processing and progress tracking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB