Showing 25 open source projects for "output"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    ...Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    LiveAvatar

    LiveAvatar

    Streaming Real-time Audio-Driven Avatar Generation

    LiveAvatar is an open-source research and implementation project that provides a unified framework for real-time, streaming, interactive avatar video generation driven by audio and other control signals. It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions. The project co-designs algorithms and system...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Mopidy

    Mopidy

    Mopidy is an extensible music server written in Python

    ...With Mopidy's extension support, you can easily add backends for new music sources. Mopidy is a Python application that runs in a terminal or in the background on Linux computers or Macs that have network connectivity and audio output. Out of the box, Mopidy is an HTTP server. If you install the Mopidy-MPD extension, it becomes an MPD server too. Many additional frontends for controlling Mopidy are available as extensions. You and the people around you can all connect their favorite MPD or web client to the Mopidy server to search for music and manage the playlist together.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MediaCrate — Video/Audio Downloader

    MediaCrate — Video/Audio Downloader

    Download video and audio from over 1,000+ websites with one click

    MediaCrate is a lightweight desktop application for downloading video and audio from various websites, including YouTube, Instagram, TikTok, Facebook and many others. It's rather simple to use. Paste a link, select format and quality, and download. MediaCrate is designed with performance and simplicity in mind, maintaining minimal CPU usage while idle and a small memory footprint during downloads. Project links: Website: justagwas.com/projects/mediacrate GitHub:...
    Leader badge
    Downloads: 19 This Week
    Last Update:
    See Project
  • 7
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    ...The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a graphical user interface and API access, making it suitable for casual users as well as developers who want to integrate voice transformation into their own applications. The project also supports GPU acceleration, enabling faster inference and smoother real-time performance on compatible hardware. Additionally, it includes tools for training and managing voice models, giving users the ability to create personalized voice profiles.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 8
    A2M — Audio to MIDI

    A2M — Audio to MIDI

    A2M is a desktop app that converts AUDIO TO MIDI in one click.

    A2M (Audio To MIDI) is a simple desktop tool for transcribing local audio files into MIDI files with one click. It is designed primarily for piano recording transcription, and works best on solo piano recordings. Using A2M is straightforward: Select an audio file, click Convert, and the application generates a MIDI file automatically in your Downloads/A2M folder. All processing is done locally on your device, no uploads, no accounts, and no telemetry. The app runs on CPU by...
    Leader badge
    Downloads: 61 This Week
    Last Update:
    See Project
  • 9
    Free Karaoke File Maker

    Free Karaoke File Maker

    Free Karaoke File Maker

    ...By default, it will be saved with 2 audio tracks of singer + melody. If you want to save only the melody without the singer's voice, you have to select the No Vocal option. To save the output file, click Save Folder and choose the location you want to save (Default: Desktop). If you are sure of the above preparations, you can change the file you want to change by holding down the mouse and dragging it onto the Drag & Drop Input File. (No internet needed) You can also change it by clicking Select File.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10

    MP3FileStructureAnalyser

    Analyse a MP3 file structure (ID3 and frame headers) into JSON objects

    The project contains a small command line tool (written in Python) that analyses the file structure (ID3 and frame headers) of MP3 files and projects the structure into a JSON object.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    vahatraker

    vahatraker

    a live MIDI sequencer for JACK

    ...Adhering to Unix philosophy, driving MIDI is the one thing it tries to do well, adhering to other doctrines - enabling expression, with added value of: - live editing - fast workflow - intuitive midi-in - unheard of time signatures - scenes a'la 'ton - fractal turtles - fits on a floppy - doesn't make sound Frankly speaking, vht was envisaged as a re-creation of seq24 in tracker form for author's "studio needs" and offers similar functionality (and limitations). It relies 100% on JACK audio connection kit for input/output/synch and uses jack_capture for rendering. The GUI has similar dependencies as gnome-calculator and tracker paradigm was chosen to allow for rhythmic gymnastics otherwise hard to convey. Low level stuff was done in C and wrapped in Python. Human interfacing part of contraption employs GTK through gobject introspection and was also contrived in the language we shall no longer spam abo
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    Midi_Master_Clock

    A simple command line Midi Master Clock

    MIDI Master Clock: Simple MIDI Song and Control Change Manager MIDI Master Clock is a lightweight, command-line utility for effortless MIDI device control and synchronization. It lets you define songs or cues in a simple text file, each with its own BPM and custom MIDI Control Change (CC) messages. What It Does: Reads Song Data: Configure song titles, BPM, and MIDI CCs (e.g., CC_ch0 10) in an easy-to-read data.txt file. Interactive Control: Select songs from a list, advance through...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    footswitch3basic

    footswitch3basic

    Audio Transcription software for Linux (Gstreamer) with a foot pedal

    ...There are timestamp macros, along with media player control macros for Play/Pause toggle Jump backwards Jump forwards Slower Faster Normal Audio speed Jump to timestamp This is the "BASIC" version stripped of some of the bells and whistles. As fs3 Basic uses LibreOffice for the transcription input, it means you are working in a world class editor, which can produce output files in most popular formats including MS Word files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    footswitch2basic

    footswitch2basic

    Audio Transcription software for Linux (Vlc) with a foot pedal

    ...Control of the media player from LibreOffice can be via Hotkeys or an integrated toolbar. As fs2 Basic uses LibreOffice for the transcription input, it means you are working in a world class editor, which can produce output files in most popular formats including MS Word files. If moving from Footswitch2 to Footswitch2 Basic, any existing database files will be converted for you.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    psgdump

    Dump psg/ym chip tune files to txt and midi format

    PSGDump tool is parser and converter for chip tune files. It supports PSG and YM input file formats, focusing on AY/YM chip tunes from ZX Spectrum and Atari ST. The tool produces text output of notes played and creates multi-track MIDI file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    superboucle

    superboucle

    Loop based software with jack transport, record and midi controllable

    SuperBoucle is a loop based software fully controllable with any midi device. SuperBoucle is also synced with jack transport. You can use it on live performance or for composition. SuperBoucle is composed of a matrix of sample controllable with external midi device like pad. SuperBoucle will send back information to midi device (light up led). Sample will always start and stop on a beat or group of beats. You can adjust duration of sample (loop period) in beat and offset in beat. But you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Splitalbum is a shntool frontend for Linux. It helps you to split your audio cd images into separate tracks according to cue sheet. It also uses cuetools and mutagen for tagging and has a Qt GUI. Most of modern audio formats are supported.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Python Script to process a wave file and transform to movement (key frames) on Blender 2.5. Input: sound file (.wav) Output: Key frames direct into the selected object Work to meshes and bones (location, scale or rotation in any axis)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    slurry is a simple python program that plays sounds at random. it is being created primarily for an experimental film screening in June 2010. it will continue to be developed after this.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    streamplayer
    A very simple, CLI, stream player. It's just a mplayer wrapper.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Moved to Github: http://github.com/tremby/py-lyrics
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Convert text to International Morse Code. Input is ASCII text. Output can be: - . -..- - on the console, raw 8bit PCM suitable for piping to /dev/audio, .wav files or even (mp3|ogg). Good for headlines on your MP3 player or code practice.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    Amphion is an audio server and a collection of clients designed to play audio tracks, maintain playlists, and tweak the output volume. Amphion lets you plug a server into your stereo and control the tracks played on your stereo from another machine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ...It can be used both, for general audio processing/editing and as a fuzzbox. Based on a very flexible plug-in system, it has been coded in python and currently uses portaudio for sound input/output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB