Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Multimedia
Sound/Audio
Speech Software
Search Results

Search Results for "speech processing"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 22
Windows 22
Mac 17
More...
BSD 14
ChromeOS 8
Desktop Operating Systems 2
Game Consoles 1
Server Operating Systems 1

Category

Multimedia 26
Artificial Intelligence 12
Scientific/Engineering 12
Text Editors 5
Software Development 2
Business 1
Communications 1
Database 1
System 1

License

OSI-Approved Open Source 22

Translations

English 5
Brazilian Portuguese 1
French 1
German 1
More...
Spanish 1

Programming Language

C++ 9
C 8
Java 8
Perl 3
More...
Python 3
C# 2
Unix Shell 2
Visual Basic .NET 2
IDL 1
MATLAB 1
PHP 1

Status

Beta 9
Production/Stable 9
Pre-Alpha 3
Alpha 3
More...
Planning 1

Showing 26 open source projects for "speech processing"

View related business solutions

Speech Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
1

Moonshine Voice

Fast and accurate automatic speech recognition (ASR) for edge devices

moonshine is an open-source automatic speech recognition toolkit optimized for fast and accurate transcription on edge devices and local environments. The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. ...

Downloads: 9 This Week

Last Update: 2026-06-02
See Project
2

Voxal voice changer

Transform your voice in real-time voxal voice changer

Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality. Whether you're looking to have fun, protect your privacy, or create engaging content,...

1 Review

Downloads: 34 This Week

Last Update: 2025-11-16
See Project
3

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...

Downloads: 35 This Week

Last Update: 2026-03-23
See Project
4

Speech Signal Processing Toolkit (SPTK)

SPTK is a suite of speech signal processing tools for UNIX environments, e.g., LPC analysis, PARCOR analysis, LSP analysis, PARCOR synthesis filter, LSP synthesis filter, vector quantization techniques, and other extended versions of them.

9 Reviews

Downloads: 10 This Week

Last Update: 2023-05-10
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

AhoTTS - TTS for Basque and Spanish

Text-to-Speech for Basque and Spanish

Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/

1 Review

Downloads: 0 This Week

Last Update: 2022-05-03
See Project
6

AhoTTS Multilingual, a Multilingual TTS

Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/

1 Review

Downloads: 0 This Week

Last Update: 2019-11-29
See Project
7

AhoTTS Iparrahotsa

TTS for Basque Lapurdian dialect

AhoTTS Iparrahotsa is the TTS developed at the Aholab Signal Processing Laboratory of the University of the Basque Country (UPV/EHU) for the Lapurdian dialect of Basque. This dialect is spoken at the Northern area of the Basque speaking area (French region). This project was funded by the Euroregion Aquitaine-Euskadi under grant EUSKADI-2012-004.

Downloads: 0 This Week

Last Update: 2016-04-07
See Project
8

yaafe

Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.

1 Review

Downloads: 0 This Week

Last Update: 2016-02-25
See Project
9

Modular Audio Recognition Framework

MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.

3 Reviews

Downloads: 0 This Week

Last Update: 2015-10-06
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Accelerated Feature Extraction Tool

A fast GPU accelerated feature extraction software for speech analysis

A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.

1 Review

Downloads: 0 This Week

Last Update: 2015-05-25
See Project
11

jaivox

Speech recognition application builder and library

Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
12

SetFon Speech Analyzer - Web Praat

SetFon focus is an interface web based for Praat resources (www.praat.org) wich focus speech sound annalysis; it is a gerent program for acoustic analysis PHP/Mysql based. Developed with the framework SIMP.

Downloads: 0 This Week

Last Update: 2015-11-13
See Project
13

Bermuda Text-to-Speech

This project includes basic NLP and DSP techniques for Text-to-Speech

See TTS demo at: http://rslp.racai.ro/index.php?page=tts This is an entirely written in JAVA project which includes a set of tools and methods designed to enable Multilingual Text-to-Speech (TTS) synthesis. We currently support English and Romanian but we will soon train more models and make them available for download. If you want to read more about our other NLP and TTS tools check out http://nlptools.racai.ro.

Downloads: 0 This Week

Last Update: 2014-03-24
See Project
14

InproTK

An Incremental Spoken Dialogue Processing Toolkit

InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/

Downloads: 0 This Week

Last Update: 2015-06-16
See Project
15

de-ess

De-essing software to reduce sibilance in speech using TSP

This de-esser uses a novel approach called Temporal Sibilance Processing. The idea is to distinguish between fricatives and voiced sections of the speech signal by the number of zero crossings in time. Most of the speech file is left untouched (the samples are directly copied from source to destination). Only fricatives that are long enough and loud enough are filtered. The advantage of this approach over traditional approaches is that the clarity of the remaining speech is completely unaffected.

Downloads: 0 This Week

Last Update: 2017-10-21
See Project
16

ASSP Library

Advanced Speech Signal Analysis library provides a structure to handle various file formats and a variety of analysis functions commonly used in speech processing.

Downloads: 0 This Week

Last Update: 2013-05-08
See Project
17

IUT-SimDSP

This is a simple DSP simulator for educational purposes: developed as a course supplement of CIT-4617 (Digital Signal Processing) at Islamic University of Technology (IUT). Written in C++.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
18

semaine

The open source, multimodal interactive "Sensitive Artificial Listener" dialogue system created by the EU project SEMAINE.

Downloads: 0 This Week

Last Update: 2013-04-11
See Project
19

Temporal Twist

Application to adjust the tempo of audio files. Useful for speeding up podcasts without making the speaker sound like a chip-monk.

Downloads: 2 This Week

Last Update: 2014-06-15
See Project
20

Rizon Voice

Voice is a text to speech program with many features. Some of the features include: Reads Text, Rich Text and Word Documents aloud. Custom greeting. Professional document editor. Clipboard monitoring and processing. Good looking animated character.

Downloads: 0 This Week

Last Update: 2015-08-06
See Project
21

Matsig

Matsig is an object-oriented signal class library (Toolbox in MATLAB lingo) for MATLAB 6.5 and later. It implements a signal class, simplifying operations and manipulations common in audio signal processing and speech processing.

Downloads: 0 This Week

Last Update: 2013-04-17
See Project
22

AutoLinguist

Automatically translate english/french/german text to german/french/english text and output speech in appropriate language. All Automagically with the power of the inter-webs.

Downloads: 0 This Week

Last Update: 2016-06-12
See Project
23

Auvai Text to Speech

Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
24

MRCP4J

The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.

Downloads: 0 This Week

Last Update: 2013-04-25
See Project
25

Text to MP3 converter

text file to wav/mp3 converter (reader). use Microsoft Speech API compatible engines (not included). command line interface for batch processing. support dictionary for speech correction.

Downloads: 0 This Week

Last Update: 2015-04-17
See Project

Previous
You're on page 1
2
Next

Related Searches

voice changer

fortnite

echo sound

forensic audio analysis

speech synthesis

sapi 5 voices

tts

feature extraction matlab

mega voice command database

mfcc

Related Categories

Multimedia

Artificial Intelligence

Scientific/Engineering

Text Editors

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise