Showing 216 open source projects for "text"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Outgrown Windows Task Scheduler? Icon
    Outgrown Windows Task Scheduler?

    Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

    Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.
    Download Free Tool
  • 1
    Frescobaldi

    Frescobaldi

    LilyPond sheet music text editor

    Frescobaldi is a free and open source LilyPond sheet music text editor. Designed to be powerful yet lightweight and easy-to-use, Frescobaldi offers great functionality and a host of useful features such as music view with advanced two-way Point & Click, Midi capturing to enter music, a Snippet Manager and many more. Frescobaldi is named after Girolamo Frescobaldi (1583-1643), an Italian composer of keyboard music in the late Renaissance and early Baroque period.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 2
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 4
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Tagify

    Tagify

    Lightweight, efficient Tags input component in Vanilla JS

    Transforms an input field or a textarea into a Tags component, in an easy, customizable way, with great performance and a small code footprint, exploded with features. Customizable HTML templates for the different areas of the component (wrapper, tags, dropdown, dropdown item, dropdown header, dropdown footer) Shows suggestions list (flexible settings & styling) at full (component) width or next to the typed texted (caret) Allows setting suggestions' aliases for easier fuzzy-searching....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    Omilo - a text to speech application

    Omilo is a simple text to speech application

    Omilo is a simple text to speech application for Windows and Linux using Festival, Flite, Marytts and Piper voices.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Buzz

    Buzz

    Transcribe and translate audio offline on your personal computer

    Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for Windows: App is not signed, you will get a warning when you install it. ...
    Leader badge
    Downloads: 4,252 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 10
    Fx FloorBoard

    Fx FloorBoard

    Graphical editor software for many Boss and Roland effect & synth unit

    Editors for the BOSS GT-1, 3, 5, 6, 8, Pro, 10, 100, 001 Guitar Multi-Effects Processors, and the BOSS GT-1B,6B,10B Bass Multi-Effects Processors. And for the Katana Amplifier. and various Boss and Roland guitar synths. This software can visually edit parameters via Midi,USB, or Bluetooth(via external device on some units) on the Multi-Effects/Amp/Synth Processor.
    Leader badge
    Downloads: 684 This Week
    Last Update:
    See Project
  • 11
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 147 This Week
    Last Update:
    See Project
  • 12
    RemoteTTS

    RemoteTTS

    Tool to remotely activate Text-To-Speech (TTS) on a server

    The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3,...
    Leader badge
    Downloads: 91 This Week
    Last Update:
    See Project
  • 14
    htmid

    htmid

    Generative Music For Beginners and Everyone Else

    Generative music is a fascinating and innovative approach to music creation that involves creating procedurally generated music that evolves and changes over time. Whether you're a beginner or a seasoned musician, this guide will introduce you to the world of generative music and show you how to create your own live music performances. Generative music is music that is ever-changing and created in real-time. It can be created by anyone, with or without musical experience. Learn how to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Drumstick MIDI Monitor

    Drumstick MIDI Monitor

    MIDI monitor for Linux

    Drumstick MIDI Monitor is a MIDI monitor for Linux using ALSA sequencer and Qt5 user interface.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 16
    SY-FloorBoard
    A collection of graphical midi based editors, each made specifically for the Boss SY-1000 , Roland VG-99 guitar, or VB-99 bass synth units, which provide an alternative to the Boss/Roland version of midi editors, with full features and control of the SY-1000, VG-99, or VB-99 from a Windows, Mac, or Linux computer.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17

    abc2midiu

    Updated version of abc2midi

    This project is to create an updated version of abc2midi, a tool to generate MIDI files from abc music notation. As well as improving the code, the intention is to integrate features from and improve compatibilty with the abc 2.2 standard.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Akkeyrdion

    Akkeyrdion

    Chromatic MIDI accordion with computer keyboard

    ...Supports both right and left hand system, currently: - C griff (right hand) - B griff (right hand) - Stradella (left hand standard bass system) Computer keyboard layout must be implemented as a text file in the "keyboard" folder. Currently only italian layout is implemented [UPDATE: added a ISO-fr file in osx release], if you want to implement a layout for your keyboard just read the instruction in the "ISO-it" file (please let me know if it works!). Current version: 1.0 Release notes -> https://sourceforge.net/p/akkeyrdion/blog/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    JMP3Renamer
    JMP3Renamer is a plugin-based renamer/tagger written in Java. It supports automatical assignment of the data to the files and magic cookies to specify the filename format. Currently available plugins: Discogs, MusicBrainz, Filename, Filetag, Mp3, Ogg
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    Midi_Master_Clock

    A simple command line Midi Master Clock

    MIDI Master Clock: Simple MIDI Song and Control Change Manager MIDI Master Clock is a lightweight, command-line utility for effortless MIDI device control and synchronization. It lets you define songs or cues in a simple text file, each with its own BPM and custom MIDI Control Change (CC) messages. What It Does: Reads Song Data: Configure song titles, BPM, and MIDI CCs (e.g., CC_ch0 10) in an easy-to-read data.txt file. Interactive Control: Select songs from a list, advance through your set, or quit, all from the command line. Automated MIDI Commands: When you select a song, MIDI Master Clock automatically sends: - All specified MIDI Control Changes (CCs), ideal for triggering effects or states...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    audiochain

    audiochain

    The Java Multitrack Audio Recording Software AudioChain

    AudioChain is a software for recording and mixing sound. It is primarily designed to simplify your home recording tasks, while it is not limited to that. This software helps you focus on creating your song, which should be a very musical and creative process. AudioChain aims to avoid bothering you to handle with a complicated graphical user interface. The AudioChain software is a convenient alternative to a hardware multitrack recorder. All you need is a computer with a good audio...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ARITA

    ARITA

    Extraordinary audio player for FreeBSD & GNU/Linux

    ...Optional 'signature' files can be created to keep checksums of tracks, so you may ask ARITA to verify them and make sure nothing is damaged due to a sudden failure of HDD/SSD. As for 'cuesheets': tracks are merged into a single continuous audio file and a supplementary text file, which provides information on where tracks start and end.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    QuickBeat

    QuickBeat

    Fully Customizable Full-Fledged Audio Player Supporting Many Formats

    Full fledged music player and local library manager. Enjoy your media collection in one unified interface, have access to your local network, Deezer, Youtube-DL and Remote file, never miss information on your remote media via remote tags reading, tweak your music how you want using a variety of built-in audio effects and if that doesn't suit you, you can import your winamp plugins , immerse yourself with our selection of visualizers and if you get bored with them, import your winamp,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Quick DICOM Tag Editor

    Quick DICOM Tag Editor

    View and edit DICOM tags (Windows & Mac & Linux)

    * General description - View and modify DICOM tags of a single file - View and modify DICOM tags from multiple files - Dump DICOM tags into a text file - Preview images (DICOM pixel data) * Update history - All update history list have been moved to the Wiki page
    Leader badge
    Downloads: 59 This Week
    Last Update:
    See Project
  • 25
    Coqui STT

    Coqui STT

    The deep learning toolkit for speech-to-text

    ...With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next