Showing 90 open source projects for "text based"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 256 This Week
    Last Update:
    See Project
  • 4
    JSpeech

    JSpeech

    Java library designed to integrate Speech-to-Text

    jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3,...
    Leader badge
    Downloads: 90 This Week
    Last Update:
    See Project
  • 6
    SY-FloorBoard
    A collection of graphical midi based editors, each made specifically for the Boss SY-1000 , Roland VG-99 guitar, or VB-99 bass synth units, which provide an alternative to the Boss/Roland version of midi editors, with full features and control of the SY-1000, VG-99, or VB-99 from a Windows, Mac, or Linux computer.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 7
    Drumstick MIDI Monitor

    Drumstick MIDI Monitor

    MIDI monitor for Linux

    Drumstick MIDI Monitor is a MIDI monitor for Linux using ALSA sequencer and Qt5 user interface.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    JMP3Renamer
    JMP3Renamer is a plugin-based renamer/tagger written in Java. It supports automatical assignment of the data to the files and magic cookies to specify the filename format. Currently available plugins: Discogs, MusicBrainz, Filename, Filetag, Mp3, Ogg
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Midi_Master_Clock

    A simple command line Midi Master Clock

    .... - MIDI Sync Commands (Stop, Song Position Pointer (0), Start) to prepare your devices. - Accurate MIDI Clock signals based on the song's BPM, keeping your gear perfectly in sync.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    ARITA

    ARITA

    Extraordinary audio player for FreeBSD & GNU/Linux

    Unlike conventional audio players, ARITA aims to help music lovers to keep their precious collections well organized and tidy. Therefore it takes a different approach to arranging audio files. They must be emplaced into libraries: 'archives' or 'cuesheets'. Archive - compressionless container like CPIO. Instead of tags for description of tracks, a plain XML/YAML file is placed alongside each archive. Rather than embedding cover art in every track, put a single image next to a library. TIFF...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    QuickBeat

    QuickBeat

    Fully Customizable Full-Fledged Audio Player Supporting Many Formats

    Full fledged music player and local library manager. Enjoy your media collection in one unified interface, have access to your local network, Deezer, Youtube-DL and Remote file, never miss information on your remote media via remote tags reading, tweak your music how you want using a variety of built-in audio effects and if that doesn't suit you, you can import your winamp plugins , immerse yourself with our selection of visualizers and if you get bored with them, import your winamp,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Media Converter
    Media Converter is a plugin based video and audio converter. It uses FFmpeg as its engine, which allows it to convert to a lot of formats.
    Leader badge
    Downloads: 117 This Week
    Last Update:
    See Project
  • 13
    Pearl MATE Studio 11

    Pearl MATE Studio 11

    The Ultimate Audio Recording Studio for Linux

    This Update to the first release comes with the latest Low Latency kernel pre-installed as well as the Generic kernel and by many requests the MixBus DAW is back in place of Ardour. Minor bugs were fixed as well as software added. Here is whats new. 1) Mixbus 8 (Fully Functioning Demo) 2) Tuner Added 3) Pearl Extras Installer. Great for Producers coming from a Windows system using VST's. The script will install all packages needed to start using those plugins with searching google on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    MX Terminal

    MX Terminal

    A chat style app for the M32, X32, M-Air, X-Air digital consoles

    MX Terminal is a simple chat style app that can control the Midas and Behringer digital consoles including the M32, X32, M-Air and X-Air. Built on the popular Live Toolbox OSC engine, the text based interface provides the user full OSC,tidbit and the new English commands sets. Responses from the console can be returned in real world values (db, hz, etc.) instead of OSC values.And text to speech (TTS) is available for these responses for the visually impaired.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    XZVoice

    XZVoice

    Free and open source text-to-speech software

    Text-to-speech software developed by Electron + vue + ElementUI + js. The high-fidelity and flexible configuration of speech synthesis products opens up the closed loop of human-computer interaction and enables applications to sound realistically. A variety of timbres are available, and functions such as adjusting speech rate, intonation, and volume are provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    gentext_chung text and music generator
    gentext genmidi chung is a small text generation / midi piano music gen generator based on SORT dychotomic algorythm (inspired from ai neural networks RNN LSTM and Markov chains but not at all the same) .Trained with an input text file , it can generate random variants text / music stream in response to user input or freely (user enters empty input) or realtime non stop.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    LT_Command

    A terminal app for the M32, X32, M-Air, X-Air digital consoles

    LT_Command is a terminal/console app that can control the Midas and Behringer digital consoles including the M32, X32, M-Air and X-Air. Built on the popular Live Toolbox OSC engine, the text based interface provides the user full OSC,tidbit and the new English commands sets. It can also be launched from other popular apps using the command line arguments. Responses from the console can be returned in OSC or real world values (db, hz, etc.). Output can be sent to the terminal screen or logged to a file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20

    Audius downloader

    Audius.co downloader with visuals and option ?search=[query]

    <!DOCTYPE HTML PUBLIC> <html> <head> <title>Audius Downloader 1.2.ok</title> <meta name="generator" content="Easy Adobe Truetype Media Error webfonts v1.2.ok" /> <meta name="date" content="2020/10/28" /> <meta name="version" content="1.2.ok" /><meta name="description" content="Audius.co downloader with visuals and option ?search=[query]"> <script src="https://ajax.googleapis.com/ajax/libs/jquery/3.5.1/jquery.min.js"></script> <script type="text/javascript"> var ssqry=""; var...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    ABC is a text-based music notation language. This project aims to promote the ABC music notation language by providing cross-platform, open-source tools for manipulating, displaying and playing (via MIDI) ABC files. This project is here for historic reasons and there is not too much activity now. The abcmidi project is updated more frequently.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    chatbot_chung
    ...SORT gentext text generation algorythm option added (desktop version) .
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23

    AhoTTS Multilingual, a Multilingual TTS

    Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

    Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Personal Radio Station OSE

    Personal Radio Station OSE

    Simple and efficent Radio Automation system linux based

    Gambas applications to manage and broadcast a little radio station in pure client server architecture. installation steps: 1) run settings module (preferences and db access information) 2) import using Music Manager application (use this module every time you would ingest new songs) 3) start main player application (PRS_ose) and keep broadcasting :D 4) add music strategies, commercials, etc with extra applications from the PRS suite :D Preferred language Gambas3 Database server...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 18 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB