Join/Login
Open Source Software
Business Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Open Source Software

Business Software

SourceForge Podcast

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Browse Open Source
Artificial Intelligence
Speech to Text Software
Search Results

Search Results for "text to speech file"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 34
Linux 24
Mac 19
More...
BSD 6
ChromeOS 5
Desktop Operating Systems 1
Mobile Operating Systems 1

Category

Artificial Intelligence 41
Multimedia 14
Business 4
Education 3
Communications 2
Productivity 2
Scientific/Engineering 2
Database 1
Software Development 1
Text Editors 1

License

OSI-Approved Open Source 21
Creative Commons Attribution License 1
Other License 1

Translations

English 13
German 2
Spanish 2
Arabic 1
More...
Catalan 1
French 1
Korean 1
Polish 1

Programming Language

Python 10
C# 7
Java 6
C++ 5
More...
BASIC 1
C 1
Delphi/Kylix 1
Go 1
JavaScript 1
PHP 1
Tcl 1
VBScript 1

Status

Production/Stable 8
Beta 4
Alpha 2
Pre-Alpha 1

Showing 41 open source projects for "text to speech file"

View related business solutions

Speech to Text Clear Filters & Widen Search

Our Free Plans just got better! | Auth0 by Okta
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.

Try free now
Bright Data - All in One Platform for Proxies and Web Scraping
Say goodbye to blocks, restrictions, and CAPTCHAs

Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.

Get Started
1

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 81 This Week

Last Update: 2024-09-30
See Project
2

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip...

Downloads: 14 This Week

Last Update: 2024-05-05
See Project
3

TTS Voice Wizard

Speech to Text to Speech, sends text as OSC messages

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) Use TTS Voice Wizard's accessibility features to improve your VRChat experience (it works outside of VRChat too!) You can convert your Speech-to-Text and back to Speech through various Speech Recognition and Text-to-Speech methods. You can send what you say as OSC messages to VRChat to be displayed on your avatar using KillFrenzyAvatarText or VRChats...

Downloads: 27 This Week

Last Update: 2024-04-11
See Project
4

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 4 This Week

Last Update: 21 hours ago
See Project
Cybersecurity Solutions to Protect, Detect and Respond Against Cyberattacks
Kroll's elite cyber risk experts deliver end-to-end cyber security services for organizations in a wide range of sectors, across the globe.

From system upgrades or a move to the cloud … to applications meant to improve the customer experience … and to integral third-party relationships, one misstep can cascade into IP theft, wire fraud, ransomware, data breaches and more; not to mention regulatory action, civil litigation and reputational damage. That’s why we’ve structured end-to-end solutions to manage the entire threat lifecycle.

Learn More
5

Whishper

Transcribe any audio to text, translate and edit subtitles 100% locall

Open-source, local-first audio transcription and subtitling suite with a simple web UI. Thanks to open-source technologies, Whishper can run 100% offline. Your data never leaves your computer. Whishper allows you to translate your transcriptions to and from more than 60 languages thanks to Argos Translate and LibreTranslate. Download the transcriptions in many formats (json, txt, vtt, srt). Easily edit your subtitles right in the Web-UI.

Downloads: 8 This Week

Last Update: 2024-09-10
See Project
6

Coqui STT

The deep learning toolkit for speech-to-text

Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems...

Downloads: 2 This Week

Last Update: 2022-09-03
See Project
7

SoundTranscriber

SoundTranscriber can be used to generate automatic transcription / aut

SoundTranscriber can be used to generate automatic transcription / aut

1 Review

Downloads: 6 This Week

Last Update: 2024-07-30
See Project
8

VATSG

Video automatic transcribe and translated subtitle generator

It generates srt format subtitle from videofile which can be any source language that whisper support , and then make translated subtitle file of your target language which deepl support. This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you...

Downloads: 5 This Week

Last Update: 2023-09-19
See Project
9

Conversations

App in java for chatting to a generative A.I. (involving tts and stt)

Java application for chatting to generative AI Llama3. * The user can speak into the microphone (speechToText), edit the recognized text and send it to the AI. * The AI responds and the server returns that response in real time, and the sentences converted to audio (textToSpeech), and the application broadcasts them through the speaker. The application is prepared so that only one user occupies the server's resources, so if the server is busy, in theory it will not let you connect...

Downloads: 1 This Week

Last Update: 2024-08-16
See Project
Control remote support software for remote workers and IT teams
Raise the bar for remote support and reduce customer downtime.

ConnectWise ScreenConnect, formerly ConnectWise Control, is a remote support solution for Managed Service Providers (MSP), Value Added Resellers (VAR), internal IT teams, and managed security providers. Fast, reliable, secure, and simple to use, ConnectWise ScreenConnect helps businesses solve their customers' issues faster from any location. The platform features remote support, remote access, remote meeting, customization, and integrations with leading business tools.

Learn More
10

Sintetizador GMS

Herramienta para Sintetizar voces en la PC

Herramienta para escuchar documentos, puede copiar y pegar, guardar documentos y generar audio hablado.

Downloads: 0 This Week

Last Update: 2024-03-20
See Project
11

Mice TTM

mice stt tts

Dieses Tool wird speziell für die Barrierefreiheit unter Linux entwickelt. Es ermöglicht das umwandeln/konvertieren/parsen von Texten die aus einer Spracherkennung stammen, in Diktate sowie das Ausführen von Makros. Dies funktioniert ohne Internet, da die Spracherkennung auf dem PC selbst erfolgt. Mausbewegungen auf benannte Wörter und dann entsprechend auswählen oder per Sprachbefehl klicken. Außerdem können Textpassagen z.B. unter Libreoffice Wirter per Sprachbefehl entsprechend...

Downloads: 0 This Week

Last Update: 2024-08-07
See Project
12

Translate-Subtitle-File

Subtitle Creation Assistant

Subtitle group machine translation assistant - [Function 1: Translate subtitle file] .srt .ass .vtt [Function 2: Voice to text] (Drag in video or audio to recognize subtitles) (The latest version v4.1.0 Update time 2021 2 May 23) 12 translation service providers can be configured, such as Google, Baidu, Tencent, Caiyun, IBM, Azure, Amazon, etc. (6 voice service providers can be configured: Alibaba Cloud, Xunfei, Tencent Cloud, IBM, Azure, Amazon ) Advantages: 1. You can use multiple service...

Downloads: 2 This Week

Last Update: 2022-09-29
See Project
13

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following...

Downloads: 35 This Week

Last Update: 2021-04-08
See Project
14

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki...

2 Reviews

Downloads: 4 This Week

Last Update: 2020-03-15
See Project
15

GoodByeCatpcha

Solver ReCaptcha v2 Free

An async Python library to automate solving ReCAPTCHA v2 by images/audio using Mozilla's DeepSpeech, PocketSphinx, Microsoft Azure’s, Google Speech and Amazon's Transcribe Speech-to-Text API. Also image recognition to detect the object suggested in the captcha. Built with Pyppeteer for Chrome automation framework and similarities to Puppeteer, PyDub for easily converting MP3 files into WAV, aiohttp for async minimalistic web-server, and Python’s built-in AsyncIO for convenience.

Downloads: 0 This Week

Last Update: 2020-06-24
See Project
16

CMU Sphinx

Speech Recognition Toolkit

Thank you for visiting! ----> Maintenance and improvement work has MOVED to https://cmusphinx.github.io/ Please go there for the most recent software and documentation. <---- CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.

58 Reviews

Downloads: 1,033 This Week

Last Update: 2024-01-11
See Project
17

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

..., free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian

4 Reviews

Downloads: 2 This Week

Last Update: 2018-07-23
See Project
18

insofts player

insofts-player Free media player, with which you can easily and conveniently view video and listen to audio files in various formats, without installing additional codecs. View streaming video, audio. Constantly updating the online media library Additional features: sound recording, uart protocol support, speech to text

2 Reviews

Downloads: 5 This Week

Last Update: 2018-06-09
See Project
19

MDictate

Speech to text using python, pocketsphinx, ready to deploy

Automated speech recognition software is extremely cumbersome. This project's aim is to incrementally improve the quality of an open-source and ready to deploy speech to text recognition system. Runs on Windows using the mdictate.exe, but the core workings are found in the mdictate.py script which should work on Windows/Linux/OS X. In version 1.0, we use pocketsphinx' default setup with a basic graphic interface.

Downloads: 0 This Week

Last Update: 2018-03-15
See Project
20

JAVT - Just Another Voice Transformer

Just Another Speech Recognition and Text to Speech software.

JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.

Downloads: 5 This Week

Last Update: 2020-08-19
See Project
21

JKVSRG Tamil and Amharic Translator

Attractive and user friendly Tamil and Amharic translator

JKVSRG Tamil and Amharic translator translates between Tamil and Amharic. User can transliterate English to Tamil and English to Amharic with the help of Tamil and Amharic Letter System which is included in this software. In addition, User can also translate multilingual text file to Tamil and Amharic text file. It is an interactive and user friendly translator because of its special feature speech synthesize (text to speech). It runs on windows. It takes a smaller amount of memory and few...

Downloads: 0 This Week

Last Update: 2017-03-25
See Project
22

JKVSRG Eng Amharic Translator

Translation between English and Amharic

JKVSRG English and Amharic translator translates English to Amharic and Amharic to English. User can transliterate English to Amharic with the help of Amharic Letter System which is included in this software. In addition, User can also translate English text file to Amharic text file and vice-versa while comparing it with other Amharic Translators. This translator is an interactive and user friendly translator because of its special feature speech synthesize (text to speech). It runs on all...

Downloads: 0 This Week

Last Update: 2017-03-21
See Project
23

XR3Capture

Take screen shots of your computer!

Comments: Capture your computer screen a lot easier with this app. System Requirements: Java 1.8.0_45++ required. GitHub (https://github.com/goxr3plus/XR3Capture)

1 Review

Downloads: 0 This Week

Last Update: 2017-02-10
See Project
24

Sluh

creating user interface for converting speech to text and voice contro

Проект по изучению пригодности\простоты к использованию CMUSphinx, pocketsphinx в пользовательских целях. Попытка создания программного интерфейса по распознаванию русской речи (преобразованию в текст) на базе CMUSphinx, pocketsphinx для голосового управления ПО и прочее.

Downloads: 0 This Week

Last Update: 2016-09-21
See Project
25

Speechalyzer

Process large speech data wrt transcription, labeling and annotation

Speechalyzer: a tool for the daily work of a 'speech worker' It is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text...

Downloads: 0 This Week

Last Update: 2016-04-27
See Project

Previous
You're on page 1
2
Next

Related Searches

cmusphinx-zh-cn-5.2.tar.gz

speech recognition project

arabic text to speech

hindi text to speech

convert audio to srt

Related Categories

Artificial Intelligence

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 454-5900

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2024 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: