Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Multimedia
Sound/Audio
Speech Software
Search Results

Search Results for "python source codes"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 30
Windows 25
Mac 22
More...
BSD 16
ChromeOS 10
Desktop Operating Systems 1

Category

Multimedia 35
- Sound/Audio 35
- Graphics 2
Artificial Intelligence 12
Scientific/Engineering 8
Communications 6
Software Development 6
Internet 3
Desktop Environment 2
Text Editors 2
Database 1
Education 1
Social sciences 1
System 1

License

OSI-Approved Open Source 34

Translations

English 14
Chinese (Simplified) 2
German 2
Arabic 1
More...
Brazilian Portuguese 1
French 1
Japanese 1
Russian 1
Spanish 1
Thai 1
Turkish 1

Programming Language

Python 35
C++ 7
C 3
Java 2
JavaScript 2
More...
MATLAB 2
Lisp 1
Tcl 1

Status

Beta 15
Production/Stable 8
Pre-Alpha 4
Planning 3
More...
Alpha 3

Showing 35 open source projects for "python source codes"

View related business solutions

Speech Python Clear Filters & Widen Search

Keep company data safe with Chrome Enterprise
Protect your business with AI policies and data loss prevention in the browser

Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.

Download Chrome
Incredable is the first DLT-secured platform that allows you to save time, eliminate errors, and ensure your organization is compliant all in one place.
For healthcare Providers and Facilities

Incredable streamlines and simplifies the complex process of medical credentialing for hospitals and medical facilities, helping you save valuable time, reduce costs, and minimize risks. With Incredable, you can effortlessly manage all your healthcare providers and their credentials within a single, unified platform. Our state-of-the-art technology ensures top-notch data security, giving you peace of mind.

Learn More
1

Moshi

A speech-text foundation model for real time dialogue

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and...

Downloads: 0 This Week

Last Update: 2024-11-05
See Project
2

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the...

Downloads: 25 This Week

Last Update: 2021-04-08
See Project
3

Defox text to speech and downloader

Written or imported text offline read or online download.

This software design to convert text to speech and download the converted speech. Description : • Installation setup with two languages (English, French) • Two areas called text reading and speech downloading • Many languages supported to download center Note 1: I'm a student yet and I'm not in the software designing industry. Therefore maybe I haven't software making skills. I'm worried about that. ! Note 2 : When you double click on the software maybe it will get some seconds...

1 Review

Downloads: 2 This Week

Last Update: 2019-09-27
See Project
4

FM2TXT

RtlSdr listen to radio, recognize audio, and writes text file log

Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt

Downloads: 2 This Week

Last Update: 2017-12-17
See Project
Desktop and Mobile Device Management Software
It's a modern take on desktop management that can be scaled as per organizational needs.

Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.

Learn More
5

yaafe

Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.

1 Review

Downloads: 0 This Week

Last Update: 2016-02-25
See Project
6

Vapp IVR framework

A Python library to create sophisticated multilingual IVR applications. NOTICE. The repository is frozen, please find the latest version of the software at https://github.com/sippy/vapp

Downloads: 0 This Week

Last Update: 2015-04-14
See Project
7

Steel TTS

A cross-platform wrapper for common text-to-speech engines in Python

Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead...

Downloads: 2 This Week

Last Update: 2016-03-15
See Project
8

pyespeak

Python to eSpeak speech synthesis

ctypes Python module for eSpeak http://espeak.sf.net speech synthesis

Downloads: 0 This Week

Last Update: 2017-10-28
See Project
9

AarTon

AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.

Downloads: 0 This Week

Last Update: 2013-11-14
See Project
Award-Winning Medical Office Software Designed for Your Specialty
Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.

Learn More
10

Voice keyboard

Voice keyboard/dictation. Aims to be a total substitute for a keyboard. Spell out words letter by letter (using code: alpha, bravo, ..). Arrow keys, modifiers work. Speak whole words (but whole word accuracy is not good). Attach commands to some word

Downloads: 0 This Week

Last Update: 2015-04-20
See Project
11

RNNLIB

RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition. full installation and usage instructions given at http://sourceforge.net/p/rnnl/wiki/Home/

2 Reviews

Downloads: 0 This Week

Last Update: 2016-11-28
See Project
12

VoiceCode Programming by Voice Toolbox

VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The aim of the project is to make programming through voice input as easy and productive as with mouse and keyboard. For install, Use subversion, as described in this page: http://sourceforge.net/apps/mediawiki/voicecode/index.php?title=VCode_1_Doc/InstallationManual.

5 Reviews

Downloads: 1 This Week

Last Update: 2013-03-10
See Project
13

Speect

...It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices. It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG. Speect is free and open source software. As a collection it is distributed under a MIT license.

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
14

VEDICS

VEDICS (Voice Enabled Desktop Interaction and Control System) is an assistive software which lets the user to interact with the OS using voice commands. Using this software the user can access any element found on the user's screen.

1 Review

Downloads: 0 This Week

Last Update: 2013-05-28
See Project
15

Book2m4b

This is a Linux project that acts as a front end to cdparanoia, sox, and ffmpeg with the hope of making it incredibly simple to rip many audiobook cds into one mono, audiobook (m4b) format file for use in audio players capable of playing audiobooks.

Downloads: 0 This Week

Last Update: 2019-03-16
See Project
16

AIChatbot

An extensible (by plugin) chatbot project

Downloads: 0 This Week

Last Update: 2015-07-02
See Project
17

SWIPE' pitch extractor

This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.

Downloads: 2 This Week

Last Update: 2013-04-11
See Project
18

QWave

QWave: Qt-based waveform display and audio playback class library.

Downloads: 0 This Week

Last Update: 2013-05-01
See Project
19

Audio Trigger

Performs actions on detected volume threshold Examples : - Launch music on clap - Launch speech recording when you start speaking - Launch guard webcam when a significant sound is detected - Increase or decrease headphones volume when ambient noise pass

Downloads: 1 This Week

Last Update: 2013-04-01
See Project
20

Eve

Eve is a AI project written in python that takes commands verbally or textually to control the computer and eveyday functions.

Downloads: 0 This Week

Last Update: 2013-04-03
See Project
21

ASTA - Auto. Subtitle Timing Annotator

A collection of scripts and programs to automatically annotate video/audio for subtitles. Basically relies on a MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) plug-in for detecting human voice in polyphonic recordings.

Downloads: 0 This Week

Last Update: 2014-04-24
See Project
22

Skimpy PNG/ASCII/WAVE tools

A collection of tools for generating audio and visual (PNG/HTML/WAVE) for use in web sites including CAPTCHA challenges and PNG image creation tools with Javascript mouse tracking support.

Downloads: 0 This Week

Last Update: 2013-04-01
See Project
23

ASR-Builder

ASR-Builder provides an easy-to-use interface to the HTK toolkit, that allows users to build ASR systems. ASR-Builder provides a platform that performs house-keeping tasks when using HTK and also provides default training/testing/recognition scripts.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
24

uListen

uListen is a TTS(Text To Speech) application. It can TALK you the web pages, chm files, pdf files and word files and plain text files.

Downloads: 3 This Week

Last Update: 2013-04-05
See Project
25

Annotation Graph Toolkit

AGTK is a suite of software components for building tools for annotating linguistic signals, time-series data which documents any kind of linguistic behavior (e.g. audio, video). The internal data structures are based on annotation graphs.

Downloads: 4 This Week

Last Update: 2013-04-25
See Project

Previous
You're on page 1
2
Next

Related Searches

deepspeech-0.9.3-models.scorer

convert txt file to .srt file

sdr

feature extraction matlab

ivr

sapi5

voice keyboard yamaha

rnnlib

voice and speech recognition software

tts voices

Related Categories

Multimedia

Artificial Intelligence

Scientific/Engineering

Communications

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: