Join/Login
Open Source Software
Business Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Open Source Software

Business Software

SourceForge Podcast

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Browse Open Source
Search Results

Search Results for "voicebuilding for text-to-speech synthesis"

x

Sort By:

Relevance

OS

Linux 115
Windows 111
Mac 76
More...
BSD 54
ChromeOS 27
Desktop Operating Systems 10
Mobile Operating Systems 7
Server Operating Systems 3
Game Consoles 1

Category

Multimedia 101
Artificial Intelligence 69
Scientific/Engineering 16
Software Development 15
Communications 12
Text Editors 8
Business 7
System 7
Games 5
Education 4
Internet 3
Desktop Environment 2
Formats and Protocols 2
Productivity 1
Religion and Philosophy 1
Social sciences 1

License

OSI-Approved Open Source 132
Creative Commons Attribution License 5
Other License 5
Public Domain 1

Translations

English 52
French 3
German 3
Bengali 2
More...
Chinese (Simplified) 2
Arabic 1
Brazilian Portuguese 1
Bulgarian 1
Chinese (Traditional) 1
Croatian 1
Czech 1
Dutch 1
Indonesian 1
Italian 1
Japanese 1
Russian 1
Slovak 1
Spanish 1
Turkish 1
Ukrainian 1
Urdu 1
Vietnamese 1

Programming Language

C++ 39
C 38
Java 31
Python 27
More...
C# 10
Perl 6
JavaScript 5
Scheme 4
Unix Shell 4
PHP 3
Ruby 3
Lisp 2
Objective C 2
Tcl 2
Visual Basic 2
AppleScript 1
AWK 1
Delphi/Kylix 1
Julia 1
Lazarus 1
Objective-C 2.0 1
R 1
Rexx 1
TypeScript 1
Visual Basic .NET 1

Status

Production/Stable 39
Beta 37
Alpha 26
Pre-Alpha 11
More...
Planning 7
Inactive 2
Mature 1

Showing 161 open source projects for "voicebuilding for text-to-speech synthesis"

View related business solutions

Our Free Plans just got better! | Auth0 by Okta
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.

Try free now
Bright Data - All in One Platform for Proxies and Web Scraping
Say goodbye to blocks, restrictions, and CAPTCHAs

Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.

Get Started
1

VALL-E

PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems. VALL...

Downloads: 19 This Week

Last Update: 2023-04-14
See Project
2

Podcastfy.ai

Transforming Multimodal Content into Captivating Multilingual Audio

Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization...

Downloads: 5 This Week

Last Update: 2 days ago
See Project
3

PyGPT

Open source personal AI Assistant for Linux, Windows and Mac

PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API. Moreover, the application has implemented context memory support, context storage, history...

Downloads: 7 This Week

Last Update: 2024-08-29
See Project
4

DALL-E 2 - Pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer...

Downloads: 5 This Week

Last Update: 2023-10-19
See Project
Global anycast DNS network. | IBM NS1 Connect
Enjoy fast connections to consumers around the globe through an anycast network with 26 points of presence (PoPs).

IBM NS1 Connect provides fast, secure connections to users anywhere in the world with premium DNS and advanced, customizable traffic steering. NS1 Connect’s always-on, API-first architecture enables your IT teams to more efficiently monitor networks, deploy changes and conduct routine maintenance.

Learn More
5

Lobe Chat

An open-source, modern-design AI chat framework

LobeChat, unlock the superpower of your brain. Pioneering the new age of thinking and creating. Built for you, the Super Individual. LobeChat supports file upload and knowledge base functionality. You can upload various types of files including documents, images, audio, and video, as well as create knowledge bases, making it convenient for users to manage and search for files. Additionally, you can utilize files and knowledge base features during conversations, enabling a richer dialogue...

Downloads: 4 This Week

Last Update: 14 hours ago
See Project
6

ESP8266Audio

Arduino library to play MOD, WAV, FLAC, MIDI, RTTTL, MP3

Arduino library for parsing and decoding MOD, WAV, MP3, FLAC, MIDI, AAC, and RTTL files and playing them on an I2S DAC or even using a software-simulated delta-sigma DAC with dynamic 32x-128x oversampling. ESP8266 is fully supported and most mature, but ESP32 is also mostly there with built-in DAC as well as external ones. For real-time, autonomous speech synthesis, check out ESP8266SAM, a library that uses this one and a port of an ancient format-based synthesis program to allow your ESP8266...

Downloads: 3 This Week

Last Update: 2024-09-21
See Project
7

Alan AI for iOS

In-App assistant SDK to build a multimodal conversational UX for iOS

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 3 This Week

Last Update: 2024-07-01
See Project
8

Alan AI

In-App assistant SDK to build a multimodal conversational UX websites

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. To voice enable your app, you only need to get the Alan Client SDK and drop it to your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 3 This Week

Last Update: 2024-10-07
See Project
9

elevenlabs-api

elevenlabs-api is an open source Java wrapper around the ElevenLabs

Elevenlabs-api is an open-source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API. Compiled JARs are available via the Releases tab. To access your ElevenLabs API key, head to the official website, you can view your xi-API-key using the 'Profile' tab on the website. To set up your ElevenLabs API key, you must register it with the ElevenLabsAPI Java API. For any public repository security, you should store your API key in an environment variable, or external from your...

Downloads: 3 This Week

Last Update: 2023-12-25
See Project
HR Onboarding Software
WorkBright streamlines form collection to get your new team members on the job in a quick, compliant, and 100% remote process.

WorkBright is a cloud-based new hire onboarding solution that provides assistance for the processing and induction of new employees before their first day on the job. Simple and easy-to-use, this paperless digital onboarding platform enables new employees to upload photos of relevant documents, fill out their W4s, capture signatures electronically, and complete all paperwork from their tablets, laptops, or smartphones. With WorkBright, organizations can seamlessly eliminate manual data entry, streamline the form correction workflow efficiently, deliver automated reminders, and more.

Learn More
10

Paper2GUI

Convert AI papers to GUI

Convert AI papers to GUI，Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术 Paper2GUI: An AI desktop APP toolbox for ordinary people. It can be used immediately without installation. It already supports 40+ AI models, covering AI painting, speech synthesis, video frame complementing, video super-resolution, object detection, and image stylization. , OCR recognition and other fields. Support Windows, Mac, Linux systems. Paper2GUI: 一款面向普通...

Downloads: 2 This Week

Last Update: 2024-09-20
See Project
11

Imagen - Pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network

Implementation of Imagen, Google's Text-to-Image Neural Network that beats DALL-E2, in Pytorch. It is the new SOTA for text-to-image synthesis. Architecturally, it is actually much simpler than DALL-E2. It consists of a cascading DDPM conditioned on text embeddings from a large pre-trained T5 model (attention network). It also contains dynamic clipping for improved classifier-free guidance, noise level conditioning, and a memory-efficient unit design. It appears neither CLIP nor prior network...

Downloads: 1 This Week

Last Update: 2024-10-07
See Project
12

Alan AI for Android

Assistant SDK to build a multimodal conversational UX for Android

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 0 This Week

Last Update: 2024-07-01
See Project
13

NÜWA - Pytorch

Implementation of NÜWA, attention network for text to video synthesis

Implementation of NÜWA, state of the art attention network for text-to-video synthesis, in Pytorch. It also contains an extension into video and audio generation, using a dual decoder approach. It seems as though a diffusion-based method has taken the new throne for SOTA. However, I will continue on with NUWA, extending it to use multi-headed codes + hierarchical causal transformer. I think that direction is untapped for improving on this line of work. In the paper, they also present a way...

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
14

Video Diffusion - Pytorch

Implementation of Video Diffusion Models

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch. Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch. It uses a special space-time factored U-net, extending generation from 2D images to 3D videos. 14k for difficult moving mnist (converging much faster and better than NUWA) - wip. Any new developments for text-to-video synthesis will be centralized at Imagen-pytorch...

Downloads: 0 This Week

Last Update: 2024-05-03
See Project
15

Omilo - a text to speech application

Omilo is a simple text to speech application

Omilo is a simple text to speech application for Windows and Linux using Festival, Flite, Marytts and Piper voices.

3 Reviews

Downloads: 6 This Week

Last Update: 2024-09-20
See Project
16

MaryTTS

An open-source, multilingual text-to-speech synthesis system

MaryTTS is an open-source, multilingual Text-to-Speech Synthesis platform written in Java. It was originally developed as a collaborative project of DFKI’s Language Technology Lab and the Institute of Phonetics at Saarland University. It is now maintained by the Multimodal Speech Processing Group in the Cluster of Excellence MMCI and DFKI. As of version 5.2, MaryTTS supports German, British and American English, French, Italian, Luxembourgish, Russian, Swedish, Telugu, and Turkish; more...

Downloads: 14 This Week

Last Update: 2023-08-11
See Project
17

Speech Signal Processing Toolkit (SPTK)

SPTK is a suite of speech signal processing tools for UNIX environments, e.g., LPC analysis, PARCOR analysis, LSP analysis, PARCOR synthesis filter, LSP synthesis filter, vector quantization techniques, and other extended versions of them.

9 Reviews

Downloads: 20 This Week

Last Update: 2023-05-10
See Project
18

eGuideDog free software for the blind

eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.

16 Reviews

Downloads: 304 This Week

Last Update: 2 days ago
See Project
19

RHVoice

Free open source speech synthesizer for Russian and other languages

RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers...

Downloads: 2 This Week

Last Update: 2024-07-04
See Project
20

Text to Waveform

Create synth presets from words

Convert words to waveforms you can load into a synthesizer oscillator to create synth presets. Have fun turning your name, your friends' names, your city name, your pet's name, your team's name into synth presets you can use to produce a track.

Downloads: 1 This Week

Last Update: 2023-12-09
See Project
21

Accessible-Coconut

A GNU/Linux operating system accessible for visually impaired.

Accessible-Coconut(AC) is a community driven GNU/Linux operating system which is completely accessible for persons with visual impairment. AC is derived from Ubuntu-MATE. Yes the goal is to make a free and open-source eyes free desktop environment. Forum : https://groups.google.com/forum/#!forum/accessible-coconut Telegram forum : https://telegram.me/accessible_coconut Project home : https://zendalona.com/

Downloads: 82 This Week

Last Update: 2024-02-24
See Project
22

Nyquist

Nyquist is a language for sound synthesis and music composition.

Nyquist is a language for sound synthesis and music composition. It is implemented in C and C++ and runs on Win32, OSX, and Linux. Nyquist combines a powerful functional programming style with efficient signal-processing primitives. Nyquist is also embedded as a scripting language in Audacity.

3 Reviews

Downloads: 35 This Week

Last Update: 2024-08-14
See Project
23

Maia

MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.

Downloads: 0 This Week

Last Update: 2024-04-21
See Project
24

Alan AI for Flutter

SDK to build a multimodal conversational UX for Flutter apps

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 0 This Week

Last Update: 2023-04-19
See Project
25

MARS5-TTS

MARS5 is a fully open-source, hyper-realistic text-to-speech (TTS).

CAMB.AI introduces MARS5, a fully open-source (commercially usable) TTS with break-through prosody and realism available on our Github: https://www.github.com/camb-ai/mars5-tts MARS5 is able to replicate performances (from 2-3s of audio reference) in 140+ languages, even for extremely tough prosodic scenarios like sports commentary, movies, anime and more; hard prosody that most closed-source and open-source TTS models struggle with today. We're excited for you to try, build on and use...

Downloads: 0 This Week

Last Update: 2024-06-08
See Project

Previous
You're on page 1
2
3
4
5
Next

Related Searches

artificial intelligent personal assistant

ai chatbot offline

ai

ai text to video

Related Categories

Artificial Intelligence

Scientific/Engineering

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 454-5900

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2024 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: