Showing 57 open source projects for "text based"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    JSpeech

    JSpeech

    Java library designed to integrate Speech-to-Text

    jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3,...
    Leader badge
    Downloads: 90 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    SY-FloorBoard
    A collection of graphical midi based editors, each made specifically for the Boss SY-1000 , Roland VG-99 guitar, or VB-99 bass synth units, which provide an alternative to the Boss/Roland version of midi editors, with full features and control of the SY-1000, VG-99, or VB-99 from a Windows, Mac, or Linux computer.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 6
    JMP3Renamer
    JMP3Renamer is a plugin-based renamer/tagger written in Java. It supports automatical assignment of the data to the files and magic cookies to specify the filename format. Currently available plugins: Discogs, MusicBrainz, Filename, Filetag, Mp3, Ogg
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Midi_Master_Clock

    A simple command line Midi Master Clock

    .... - MIDI Sync Commands (Stop, Song Position Pointer (0), Start) to prepare your devices. - Accurate MIDI Clock signals based on the song's BPM, keeping your gear perfectly in sync.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    QuickBeat

    QuickBeat

    Fully Customizable Full-Fledged Audio Player Supporting Many Formats

    Full fledged music player and local library manager. Enjoy your media collection in one unified interface, have access to your local network, Deezer, Youtube-DL and Remote file, never miss information on your remote media via remote tags reading, tweak your music how you want using a variety of built-in audio effects and if that doesn't suit you, you can import your winamp plugins , immerse yourself with our selection of visualizers and if you get bored with them, import your winamp,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Media Converter
    Media Converter is a plugin based video and audio converter. It uses FFmpeg as its engine, which allows it to convert to a lot of formats.
    Leader badge
    Downloads: 117 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Pearl MATE Studio 11

    Pearl MATE Studio 11

    The Ultimate Audio Recording Studio for Linux

    This Update to the first release comes with the latest Low Latency kernel pre-installed as well as the Generic kernel and by many requests the MixBus DAW is back in place of Ardour. Minor bugs were fixed as well as software added. Here is whats new. 1) Mixbus 8 (Fully Functioning Demo) 2) Tuner Added 3) Pearl Extras Installer. Great for Producers coming from a Windows system using VST's. The script will install all packages needed to start using those plugins with searching google on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    MX Terminal

    MX Terminal

    A chat style app for the M32, X32, M-Air, X-Air digital consoles

    MX Terminal is a simple chat style app that can control the Midas and Behringer digital consoles including the M32, X32, M-Air and X-Air. Built on the popular Live Toolbox OSC engine, the text based interface provides the user full OSC,tidbit and the new English commands sets. Responses from the console can be returned in real world values (db, hz, etc.) instead of OSC values.And text to speech (TTS) is available for these responses for the visually impaired.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    XZVoice

    XZVoice

    Free and open source text-to-speech software

    Text-to-speech software developed by Electron + vue + ElementUI + js. The high-fidelity and flexible configuration of speech synthesis products opens up the closed loop of human-computer interaction and enables applications to sound realistically. A variety of timbres are available, and functions such as adjusting speech rate, intonation, and volume are provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    LT_Command

    A terminal app for the M32, X32, M-Air, X-Air digital consoles

    LT_Command is a terminal/console app that can control the Midas and Behringer digital consoles including the M32, X32, M-Air and X-Air. Built on the popular Live Toolbox OSC engine, the text based interface provides the user full OSC,tidbit and the new English commands sets. It can also be launched from other popular apps using the command line arguments. Responses from the console can be returned in OSC or real world values (db, hz, etc.). Output can be sent to the terminal screen or logged to a file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16

    Audius downloader

    Audius.co downloader with visuals and option ?search=[query]

    <!DOCTYPE HTML PUBLIC> <html> <head> <title>Audius Downloader 1.2.ok</title> <meta name="generator" content="Easy Adobe Truetype Media Error webfonts v1.2.ok" /> <meta name="date" content="2020/10/28" /> <meta name="version" content="1.2.ok" /><meta name="description" content="Audius.co downloader with visuals and option ?search=[query]"> <script src="https://ajax.googleapis.com/ajax/libs/jquery/3.5.1/jquery.min.js"></script> <script type="text/javascript"> var ssqry=""; var...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    ABC is a text-based music notation language. This project aims to promote the ABC music notation language by providing cross-platform, open-source tools for manipulating, displaying and playing (via MIDI) ABC files. This project is here for historic reasons and there is not too much activity now. The abcmidi project is updated more frequently.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    chatbot_chung
    ...SORT gentext text generation algorythm option added (desktop version) .
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    AhoTTS Multilingual, a Multilingual TTS

    Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

    Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 18 This Week
    Last Update:
    See Project
  • 21
    BeepComp

    BeepComp

    Text-based chiptune creator

    Compose chiptunes with text files! And make mp3 and wav files of your songs to share with the world :) The audio synthesizer engine comes with 10 channels (9 music + 1 drum). The retro "beep" sounds reminiscent of old video game consoles and vintage PCs will take you back to the 8-bit era. You can shape your sound with waveforms - square, sawtooth etc. - and add LFO, delay and volume envelopes. Enjoy the 80's-ish synth drums, also! Demo...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Takt

    Takt

    Text-based Music Programming Tools

    Takt is a text-based language for describing note and chord sequences as well as algorithmic compositon rules with C-like syntax. This software package contains a REPL (read-eval-play loop) interpreter for Takt, a MIDI-file-to-Takt translator, and an Emacs interface program which enables note entry from a MIDI keyboard.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    NotesTyper

    Convert your text typing into music

    NotesTyper system converts text typing into music. You will need computer keyboard, microphone (notebook mic is ok) and Chrome or Firefox browser. NotesTyper has multiple operating modes and settings, which allow to produce different music from same text. Computer keyboards do no allow to input keypress velocity, but NotesTyper overcomes this restriction by processing microphone input level when you type. When generating music all nuances of your typing are taken into account: what keys were...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Nightingale

    Nightingale

    A community supported fork of the Songbird media player and library.

    Nightingale is a community created fork of the Songbird media player. It is developed by a proud community and we are equally proud to bring you the most extensible, feature-rich media experience on Windows, Mac, and Linux. See the official website at http://getnightingale.com for the source, builds, and information. On Sourceforge, we provide our releases, the binary deps for building, as well as builds for testing purposes.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB