Showing 69 open source projects for "text based"

View related business solutions
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 256 This Week
    Last Update:
    See Project
  • 4
    JSpeech

    JSpeech

    Java library designed to integrate Speech-to-Text

    jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3,...
    Leader badge
    Downloads: 90 This Week
    Last Update:
    See Project
  • 6
    SY-FloorBoard
    A collection of graphical midi based editors, each made specifically for the Boss SY-1000 , Roland VG-99 guitar, or VB-99 bass synth units, which provide an alternative to the Boss/Roland version of midi editors, with full features and control of the SY-1000, VG-99, or VB-99 from a Windows, Mac, or Linux computer.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 7
    JMP3Renamer
    JMP3Renamer is a plugin-based renamer/tagger written in Java. It supports automatical assignment of the data to the files and magic cookies to specify the filename format. Currently available plugins: Discogs, MusicBrainz, Filename, Filetag, Mp3, Ogg
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Midi_Master_Clock

    A simple command line Midi Master Clock

    .... - MIDI Sync Commands (Stop, Song Position Pointer (0), Start) to prepare your devices. - Accurate MIDI Clock signals based on the song's BPM, keeping your gear perfectly in sync.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    QuickBeat

    QuickBeat

    Fully Customizable Full-Fledged Audio Player Supporting Many Formats

    Full fledged music player and local library manager. Enjoy your media collection in one unified interface, have access to your local network, Deezer, Youtube-DL and Remote file, never miss information on your remote media via remote tags reading, tweak your music how you want using a variety of built-in audio effects and if that doesn't suit you, you can import your winamp plugins , immerse yourself with our selection of visualizers and if you get bored with them, import your winamp,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Pearl MATE Studio 11

    Pearl MATE Studio 11

    The Ultimate Audio Recording Studio for Linux

    This Update to the first release comes with the latest Low Latency kernel pre-installed as well as the Generic kernel and by many requests the MixBus DAW is back in place of Ardour. Minor bugs were fixed as well as software added. Here is whats new. 1) Mixbus 8 (Fully Functioning Demo) 2) Tuner Added 3) Pearl Extras Installer. Great for Producers coming from a Windows system using VST's. The script will install all packages needed to start using those plugins with searching google on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    MX Terminal

    MX Terminal

    A chat style app for the M32, X32, M-Air, X-Air digital consoles

    MX Terminal is a simple chat style app that can control the Midas and Behringer digital consoles including the M32, X32, M-Air and X-Air. Built on the popular Live Toolbox OSC engine, the text based interface provides the user full OSC,tidbit and the new English commands sets. Responses from the console can be returned in real world values (db, hz, etc.) instead of OSC values.And text to speech (TTS) is available for these responses for the visually impaired.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    XZVoice

    XZVoice

    Free and open source text-to-speech software

    Text-to-speech software developed by Electron + vue + ElementUI + js. The high-fidelity and flexible configuration of speech synthesis products opens up the closed loop of human-computer interaction and enables applications to sound realistically. A variety of timbres are available, and functions such as adjusting speech rate, intonation, and volume are provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    gentext_chung text and music generator
    gentext genmidi chung is a small text generation / midi piano music gen generator based on SORT dychotomic algorythm (inspired from ai neural networks RNN LSTM and Markov chains but not at all the same) .Trained with an input text file , it can generate random variants text / music stream in response to user input or freely (user enters empty input) or realtime non stop.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    LT_Command

    A terminal app for the M32, X32, M-Air, X-Air digital consoles

    LT_Command is a terminal/console app that can control the Midas and Behringer digital consoles including the M32, X32, M-Air and X-Air. Built on the popular Live Toolbox OSC engine, the text based interface provides the user full OSC,tidbit and the new English commands sets. It can also be launched from other popular apps using the command line arguments. Responses from the console can be returned in OSC or real world values (db, hz, etc.). Output can be sent to the terminal screen or logged to a file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17

    Audius downloader

    Audius.co downloader with visuals and option ?search=[query]

    <!DOCTYPE HTML PUBLIC> <html> <head> <title>Audius Downloader 1.2.ok</title> <meta name="generator" content="Easy Adobe Truetype Media Error webfonts v1.2.ok" /> <meta name="date" content="2020/10/28" /> <meta name="version" content="1.2.ok" /><meta name="description" content="Audius.co downloader with visuals and option ?search=[query]"> <script src="https://ajax.googleapis.com/ajax/libs/jquery/3.5.1/jquery.min.js"></script> <script type="text/javascript"> var ssqry=""; var...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    ABC is a text-based music notation language. This project aims to promote the ABC music notation language by providing cross-platform, open-source tools for manipulating, displaying and playing (via MIDI) ABC files. This project is here for historic reasons and there is not too much activity now. The abcmidi project is updated more frequently.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    chatbot_chung
    ...SORT gentext text generation algorythm option added (desktop version) .
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20

    AhoTTS Multilingual, a Multilingual TTS

    Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

    Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 18 This Week
    Last Update:
    See Project
  • 22
    subs2srs

    subs2srs

    Convert movies and TV shows to flashcards

    subs2srs allows you to create import files for Anki or other Spaced Repetition Systems (SRS) based on your favorite foreign language movies and TV shows to aid in the language learning process. See http://subs2srs.sourceforge.net/ for more information.
    Leader badge
    Downloads: 39 This Week
    Last Update:
    See Project
  • 23
    BeepComp

    BeepComp

    Text-based chiptune creator

    Compose chiptunes with text files! And make mp3 and wav files of your songs to share with the world :) The audio synthesizer engine comes with 10 channels (9 music + 1 drum). The retro "beep" sounds reminiscent of old video game consoles and vintage PCs will take you back to the 8-bit era. You can shape your sound with waveforms - square, sawtooth etc. - and add LFO, delay and volume envelopes. Enjoy the 80's-ish synth drums, also! Demo...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    Takt

    Takt

    Text-based Music Programming Tools

    Takt is a text-based language for describing note and chord sequences as well as algorithmic compositon rules with C-like syntax. This software package contains a REPL (read-eval-play loop) interpreter for Takt, a MIDI-file-to-Takt translator, and an Emacs interface program which enables note entry from a MIDI keyboard.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    NotesTyper

    Convert your text typing into music

    NotesTyper system converts text typing into music. You will need computer keyboard, microphone (notebook mic is ok) and Chrome or Firefox browser. NotesTyper has multiple operating modes and settings, which allow to produce different music from same text. Computer keyboards do no allow to input keypress velocity, but NotesTyper overcomes this restriction by processing microphone input level when you type. When generating music all nuances of your typing are taken into account: what keys were...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB