Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "converting transcript to audio"

x

Sort By:

Relevance

Clear All Filters

OS

BSD 37
- FreeBSD 1
Linux 37
Windows 25
More...
Mac 22
ChromeOS 20
Desktop Operating Systems 1

Category

Multimedia 28
Artificial Intelligence 8
System 6
Desktop Environment 2
Software Development 2
Scientific/Engineering 1

License

OSI-Approved Open Source 35
Creative Commons Attribution License 1
Other License 1
Public Domain 1

Translations

English 23
German 3
Chinese (Simplified) 2
Catalan 1
More...
Dutch 1
Estonian 1
French 1
Greek 1
Italian 1
Polish 1
Portuguese 1
Romanian 1
Russian 1
Spanish 1
Turkish 1

Programming Language

Java 8
Python 7
C++ 6
Perl 5
More...
TypeScript 5
C 4
PHP 3
Unix Shell 3
JavaScript 2
Objective C 1

Status

Beta 9
Production/Stable 9
Pre-Alpha 7
Alpha 5
More...
Inactive 2
Planning 1
Mature 1

37 projects for "converting transcript to audio" with 1 filter applied:

BSD Clear Filters & Widen Search

Earn up to 16% annual interest with Nexo.
More flexibility. More control.

Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

MARS5

MARS5 speech model (TTS) from CAMB.AI

...The model is built to handle prosodically challenging content such as sports commentary, anime dialogue, and other high-energy or highly varied speech patterns with realistic rhythm and intonation. To control speaker identity, MARS5 uses a short reference audio clip, typically between 2 and 12 seconds, from which it learns the voice characteristics. It supports two main inference modes: shallow clone, which is faster and only needs the reference audio, and deep clone, which additionally uses the transcript of the reference audio to increase similarity and naturalness at the cost of more computation.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
2

Claude Code Video Vision

Give Claude the ability to watch and understand videos

...It supports multiple backends for audio processing, including local and cloud-based options, enabling flexible deployment depending on privacy or performance requirements.

Downloads: 2 This Week

Last Update: 4 days ago
See Project
3

EasyVoice

Open source text-to-speech tool, supports extra-long text

easyVoice is an open-source text-to-speech platform aimed at turning long-form text and novels into high-quality audio, with a strong focus on usability and scalability. It provides a web interface where users can paste or upload large texts and generate speech and subtitles in a single workflow, even for works exceeding 100,000 characters. The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure...

Downloads: 3 This Week

Last Update: 2026-01-26
See Project
4

Agili Hacker Podcast

AI tool that turns Hacker News posts into daily podcast updates

Hacker Podcast is an AI-powered project that turns top Hacker News stories into a Chinese podcast. It automatically fetches trending posts each day, processes the content with AI, and generates concise summaries before converting them into audio. This creates a hands-free way to stay updated on tech, startups, and developer discussions without reading long threads. Hacker Podcast combines content aggregation, natural language processing, and text-to-speech to deliver clear and digestible updates. Users can listen through web interfaces or podcast platforms, while also accessing written summaries for deeper reading. ...

Downloads: 3 This Week

Last Update: 7 days ago
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
5

VoxCPM

TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

...Trained on a large 1.8-million-hour bilingual corpus, VoxCPM can infer appropriate speaking style from context, dynamically adjusting intonation, rhythm, and emotional tone. It supports zero-shot voice cloning from a short reference audio clip, capturing timbre, accent, and pacing to closely mimic a target speaker without per-speaker fine-tuning.

Downloads: 37 This Week

Last Update: 2026-04-08
See Project
6

AI-Media2Doc

AI tool converting video/audio into structured documents instantly

AI-Media2Doc is a web-based application that uses large language models to convert video and audio content into structured, readable documents in a single workflow. It is designed to transform multimedia inputs into formats such as knowledge notes, summaries, mind maps, and social-style articles, making content easier to review and reuse. AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
7

VERT.sh

The next-generation file converter

VERT is a modern, privacy-focused file conversion platform that leverages WebAssembly to perform conversions entirely on the user’s device rather than relying on cloud-based processing. Built with Svelte and TypeScript, it provides a clean and responsive interface for converting a wide variety of file types, including images, audio, video, and documents. One of its defining characteristics is its local-first approach, which eliminates the need to upload files to external servers, thereby improving both privacy and performance. The system supports over 250 file formats and includes customizable conversion settings, allowing users to fine-tune output parameters. ...

Downloads: 0 This Week

Last Update: 2026-04-08
See Project
8

Streamer-Sales

LLM Large Model of Selling Anchor

Streamer-Sales is an open-source large language model system designed specifically for e-commerce live streaming and automated product promotion. The project focuses on generating persuasive product descriptions and live presentation scripts that mimic the style of professional online sales hosts. By analyzing product characteristics and marketing information, the model can produce engaging explanations that emphasize benefits, features, and emotional appeal to encourage viewers to make...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
9

Perl Audio Converter

Linux Audio Converter / Tagger / CD Ripper

A Linux CLI tool for converting multiple audio types from one format to another. It supports the following audio formats: 3G2, 3GP, 8SVX, AAC, AC3, ADTS, AIFF, AL, AMB, AMR, APE, AU, AVR, BONK, CAF, CDR, CVU, DAT, DTS, DVMS, F32, F64, FAP, FLA, FLAC, FSSD, GSRT, HCOM, IMA, IRCAM, LA, MAT, AUD, MAT4, MAT5, M4A, M4R, MP2, MP3, MP4, MP4A, MPC, MPP, NIST, OFF, OFR, OFS, OPUS, OGA,OGG, PAF, PRC, PVF, RA, RAM, RAW, RF64, SD2, SF, SHN, SMP, SND,SOU, SPX, SRN, TAK, TTA, TXW, VOC, VMS, VQF, W64, WAV, WMA, and WV. ...

4 Reviews

Downloads: 10 This Week

Last Update: 2021-02-09
See Project
Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
10

Linux-Intelligent-Ocr-Solution

Easy-OCR solution and Tesseract trainer for GNU/Linux

Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial :...

5 Reviews

Downloads: 6 This Week

Last Update: 2020-10-19
See Project
11

Subsonic

Subsonic is a web-based media streamer, providing ubiquitous access to your music and video collection. More than 20 apps are available for Android, iPhone, Windows Phone, BlackBerry, Roku, Chumby, Sonos etc. Supports virtually all media formats, converting files on the fly. Also includes a Podcast receiver and jukebox feature allowing you to control what's playing on your computer from your mobile phone.

38 Reviews

Downloads: 22 This Week

Last Update: 2019-11-10
See Project
12

EXP Soundboard

Simple soundboard app with hotkeys

A soundboard that supports almost all MP3s and WAVs. Sounds can be triggered with custom keyboard hot-keys and played through up to 2 outputs. i.e. Your speakers and a virtual audio cable. Also allows for your mic to pass into the virtual audio cable when enabling Mic Injector. This soundboard also incorporates a save feature. REQUIREMENTS: - Java 7 If you want sounds to be played through voice chat you'll need a virtual audio cable. (For Windows users I recommend the...

18 Reviews

Downloads: 1,179 This Week

Last Update: 2016-06-13
See Project
13

Graphical Youtube Downloader

GYD is a youtube-dl GUI based on QT

GYD - Graphical Youtube Downloader is a GUI for youtube-dl. It is easy to use and it supports most of the youtube-dl features and some extra features like converting files and a "youtube to MP3 / OGG" (video to audio) function since Version 0.3a. When you like to use the extra features you must install ffmpeg. Youtube-dl - http://rg3.github.com/youtube-dl/ NOTE: GYD 0.3.x is the last version using QT4 (does not work with QT5).

Downloads: 1 This Week

Last Update: 2016-11-17
See Project
14

MediaEncodingCluster

MediaEncodingCluster is an Enterprise Class, Video Cluster Environment with a Plattform Independent Client - Server Architecture for distributed video/audio converting/encoding tool over a grid Computing Network Design. more on http://docs.codergrid.de

2 Reviews

Downloads: 0 This Week

Last Update: 2020-07-10
See Project
15

MobileMate

A video and audio converting tool customized on Tinycore Linux.

MobileMate is an open source video and audio converting tool customized on Tinycore Linux. It use Bash to glue open source tools such as Mplayer, FFmpeg(Libav), Zenity, Grep, Sed, etc.. It features as a self-booting tiny linux, can easy expand to your language, small size, easy to custom as your needs, etc..

Downloads: 0 This Week

Last Update: 2017-10-31
See Project
16

transqript

a program to transcript audio files

transqript can be used to transcribe audio files of interviews etc. to text files.

1 Review

Downloads: 0 This Week

Last Update: 2012-07-16
See Project
17

MediaScan

Set of python scripts for bulk conversion of media files. The scripts scan directory trees for video and audio files and converting them to avi, ogg, or mp3 appropriately. Relies on mencoder, lame, mplayer, and oggenc.

Downloads: 0 This Week

Last Update: 2019-02-08
See Project
18

eMusicCenter

A Framework for mp3 content. Base is a database containing your music. On top there may be plugins for ripping, playing and converting music content.

Downloads: 0 This Week

Last Update: 2013-05-07
See Project
19

QtAP

The Qt Audio Processor is an ultimate audio files processing software, including ripping, converting, tagging and burning to, from and between every available audio codec.

Downloads: 0 This Week

Last Update: 2017-12-31
See Project
20

MediaDropBox

A GUI for audio and video encoding and playing for portable devices using ffmpeg and mplayer.

Downloads: 0 This Week

Last Update: 2013-04-10
See Project
21

Tinger Converter

This soft can convert all kinds of audios and videos to MP3 formate. She can run on any kinds of platform, like windows, linux, MAC, etc. Especially she is better at multi-tasks. The audio like mp3, wma, etc. The video like rmvb, rmvb, avi, mkv, etc.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-09
See Project
22

Speech Made Visible

Speech Made Visible is an experiment in showing some of the qualities of speech in printed text. Analyze a recording for attributes like pitch, intensity (loudness), and speed; then style the words in a transcript to suggest those characteristics.

1 Review

Downloads: 0 This Week

Last Update: 2015-06-29
See Project
23

LEG - The Linux Encoder GUI

LEG is the Linux Encoder Gui. It exist in order to make life easier for users to do file conversions, whether its converting plain avi to mpeg or performing dvd rips and converting to different media types like IPODs, SmartPhones, IPAQs, etc.

Downloads: 0 This Week

Last Update: 2013-03-13
See Project
24

lossless2lossy

Lossless2lossy is a conversion script for mass converting your ENTIRE music collection (or just one album) from one format to another whilst mirroring the directory structure and tags of the original format. Supports ape,flac,wavpack(& hybrid),ogg,mp3.

Downloads: 0 This Week

Last Update: 2017-11-12
See Project
25

ProteinMusic

ProteinMusic is a Java program converting DNA sequences into music. The original idea for this project came from R. D. King at the University of Wales, Aberystwyth and C. G Angus from the Shamen (King, R.D. & Angus, C.G. (1996)).

Downloads: 0 This Week

Last Update: 2014-03-17
See Project

Previous
You're on page 1
2
Next

Related Searches

soundboard

exp soundboard

voice cloning

ai

mp3 to amr converter

libre office + ocr

nokia windows phone 8.1 mobile apps

linux soundboard

cluster video transcoding

transcribe mp4 to text

Related Categories

Multimedia

Artificial Intelligence

System

Desktop Environment

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise