text to free download

Showing 47 open source projects for "text to"

View related business solutions

Sound/Audio Python Clear Filters & Widen Search

AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
Outgrown Windows Task Scheduler?
Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.

Download Free Tool
1

PersonaPlex

PersonaPlex code

...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.

Downloads: 9 This Week

Last Update: 7 days ago
See Project
2

AudioCraft

Audiocraft is a library for audio processing and generation

AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. ...

Downloads: 1 This Week

Last Update: 2025-10-13
See Project
3

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...

Downloads: 7 This Week

Last Update: 2025-12-31
See Project
4

Moshi

A speech-text foundation model for real time dialogue

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps).

Downloads: 4 This Week

Last Update: 2024-11-05
See Project
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)

Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.

Learn More
5

Text to Chord

Turn words into chords

Convert words and sentences to 5 note chords you can use to inspire music creation. Have fun turning your name, your city name, your friends' names, your team's name, your pet's name into wild and original harmonies that go beyond serialism and classic jazz.

Downloads: 2 This Week

Last Update: 2025-12-06
See Project
6

Podcastfy.ai

Transforming Multimodal Content into Captivating Multilingual Audio

Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.

Downloads: 0 This Week

Last Update: 2024-11-16
See Project
7

Text to Waveform

Create synth presets from words

Convert words to waveforms you can load into a synthesizer oscillator to create synth presets. Have fun turning your name, your friends' names, your city name, your pet's name, your team's name into synth presets you can use to produce a track.

Downloads: 1 This Week

Last Update: 2023-12-09
See Project
8

Midi_Master_Clock

A simple command line Midi Master Clock

MIDI Master Clock: Simple MIDI Song and Control Change Manager MIDI Master Clock is a lightweight, command-line utility for effortless MIDI device control and synchronization. It lets you define songs or cues in a simple text file, each with its own BPM and custom MIDI Control Change (CC) messages. What It Does: Reads Song Data: Configure song titles, BPM, and MIDI CCs (e.g., CC_ch0 10) in an easy-to-read data.txt file. Interactive Control: Select songs from a list, advance through your set, or quit, all from the command line. Automated MIDI Commands: When you select a song, MIDI Master Clock automatically sends: - All specified MIDI Control Changes (CCs), ideal for triggering effects or states...

Downloads: 0 This Week

Last Update: 2025-06-17
See Project
9

psgdump

Dump psg/ym chip tune files to txt and midi format

PSGDump tool is parser and converter for chip tune files. It supports PSG and YM input file formats, focusing on AY/YM chip tunes from ZX Spectrum and Atari ST. The tool produces text output of notes played and creates multi-track MIDI file.

Downloads: 0 This Week

Last Update: 2022-09-19
See Project
Grafana: The open and composable observability platform
Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

Grafana is the open source analytics & monitoring solution for every database.

Learn More
10

18k-youtube-download

❤️ 18k-youtube-download with python and kivy Dev.Wk-18k

18k-youtube-download A simple project to make gui on kivy mixed with the function of downloading music from youtube with youtube_dl package

Downloads: 0 This Week

Last Update: 2022-02-25
See Project
11

DeepSpeech

Open source embedded speech-to-text engine

...If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 16 This Week

Last Update: 2021-04-08
See Project
12

FastoCloud PRO

IPTV/NVR/CCTV/Video cloud https://fastocloud.com

IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:

Downloads: 0 This Week

Last Update: 2020-06-20
See Project
13

Defox text to speech and downloader

Written or imported text offline read or online download.

This software design to convert text to speech and download the converted speech. Description : • Installation setup with two languages (English, French) • Two areas called text reading and speech downloading • Many languages supported to download center Note 1: I'm a student yet and I'm not in the software designing industry. Therefore maybe I haven't software making skills.

1 Review

Downloads: 0 This Week

Last Update: 2019-09-27
See Project
14

oscp

multiplatform, small and handy audio/video player with network remote

...feats: - small and fast - zip/rar support - ncurses ui with easy navigation - gtk or classic winamp skins mode (with the ability to load random one) - network remotes (crossplatform linux/windows, written in pygtk, protocol is plain text, ie. easy to write own client) - scrobble (last.fm) - built-in equalizer and effects - lirc (remote) - slave libs statically compiled in (no deps) -- read more in wiki above

Downloads: 0 This Week

Last Update: 2019-06-08
See Project
15

Canorus

Music score editor

Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond

8 Reviews

Downloads: 24 This Week

Last Update: 2020-01-04
See Project
16

AngelReader

An E-book, Audio-book, & Library Loader in One Application

AngelReader: A minimalist but powerful GUI application that has the capacity to load [1] E-books in plain text format with the least use of both software and hardware resources. It can also load [2] Audio-books with the basic functions of play, stop, pause, and resume with the same minimalist economy that doesn't hog computer resources. When used in integration with the AngelReader Library Selector, it can function as a mini library management system for books in electronic formats. ...

Downloads: 0 This Week

Last Update: 2018-10-16
See Project
17

FM2TXT

RtlSdr listen to radio, recognize audio, and writes text file log

Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt

Downloads: 1 This Week

Last Update: 2017-12-17
See Project
18

puddletag

SImple, powerful audio tagger for GNU/Linux

...The usual tag editor features are supported like extracting tag information from filenames, renaming files based on their tags by using patterns and basic tag editing. Then there’re Functions, which can do things like replace text, trim it, do case conversions, etc. Actions can automate repetitive tasks. Doing web lookups using Amazon (including cover art), Discogs (does cover art too!), FreeDB and MusicBrainz is also supported. There’s quite a bit more, but I’ve reached my comma quota. Supported formats: ID3v1, ID3v2 (mp3), MP4 (mp4, m4a, etc.), VorbisComments (ogg, flac), Musepack (mpc), Monkey’s Audio (.ape) and WavPack (wv).

12 Reviews

Downloads: 6 This Week

Last Update: 2015-10-12
See Project
19

Analyzer for Files

A tool to look into file contents

Analyzer for Files (AoF) is a tool to look into file contents, analyze the structure with installed plug-ins, and show the results with several split windows including converted data and a tree if successful. It was designed as a workbench with a core and plug-in extensions. It can handle the normal plain-text file and data, complex binaries supported with the corresponding plug-ins. What's more, the developers can deploy and release their own plug-ins according to the plug-in developing standard. The development site is https://bitbucket.org/cadappl/anafile.

Downloads: 6 This Week

Last Update: 2015-05-17
See Project
20

Steel TTS

A cross-platform wrapper for common text-to-speech engines in Python

Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead developer, Jasper Danielson, at jrd4@rice.edu.

Downloads: 3 This Week

Last Update: 2016-03-15
See Project
21

AarTon

AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.

Downloads: 0 This Week

Last Update: 2013-11-14
See Project
22

Speect

Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices. It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG.

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
23

PhonoRipper

A GUI tool for creating audio CDs from analog sources.

...Features include a fully buffered WAV file recorder with mixer and VU meter (supporting both ALSA and OSS drivers), automatic detection of track borders, an intuitive drag'n'drop CD writing interface with CD-text support, CDDB queries of CD text data, and a WYSIWYG CD cover editor.

Downloads: 0 This Week

Last Update: 2012-11-17
See Project
24

EnKoDeur-Mixeur

EnKoDeur-Mixeur (EKD) is an open source software which makes videos, pictures and audio post-production. It can be also used to convert videos in many formats. It is written in python and use the PyQt4 bindings.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-30
See Project
25

Canorus - a music score editor

Canorus is the next generation music score editor. It is the official successor of NoteEdit. It uses Qt4, has scripting capibilities and uses modern development standards like eclipse, patterns and test units as base.

1 Review

Downloads: 0 This Week

Last Update: 2014-06-09
See Project