voice text free download

Showing 181 open source projects for "voice text"

View related business solutions

Engage for Amazon Connect, the Pre-built Contact Center Platform
Utilizing the power of AWS and Generative AI, Engage provides your customers with highly personalized, exceptional experiences.

Engage is a pre-built, intelligent contact center platform that transforms customer service.

Learn More
Recruit and Manage your Workforce
Evolia makes it easier to hire, schedule and track time worked by frontline in medium and large-sized businesses.

Evolia is a web and mobile platform that connects enterprises with 1000’s of local shift workers and offers free workforce scheduling and time and attendance solutions. Is your business on Evolia?

Learn More
1

TTS Voice Wizard

Speech to Text to Speech, sends text as OSC messages

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) Use TTS Voice Wizard's accessibility features to improve your VRChat experience (it works outside of VRChat too!) You can convert your Speech-to-Text and back to Speech through various Speech Recognition and Text-to-Speech methods. You can send what you say as OSC messages to VRChat to be displayed on your avatar using KillFrenzyAvatarText or VRChats...

Downloads: 41 This Week

Last Update: 2024-04-11
See Project
2

Voice Accounting For Blind & Mute People

Free & Easy AI Voice Accounting Software For Blind & Speechless People

Just download the above zip file, extract it and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, please view and download my full collection of softwares for people with disabilities, here : https://sourceforge.net/projects/softwares-for-disabled-people/ This full collection also includes the Voice Accounting Software as well.

Downloads: 2 This Week

Last Update: 2024-04-30
See Project
3

React Native Voice

React Native Voice Recognition library for iOS and Android

A speech-to-text library for React Native. Manually or automatically link the NativeModule. Drag the Voice.xcodeproj from the @react-native-voice/voice/ios folder to the Libraries group on Xcode in your project. Click on your main project file (the one that represents the .xcodeproj) select Build Phases and drag the static library, lib.Voice.a, from the Libraries/Voice.xcodeproj/Products folder to Link Binary With Libraries. The plugin provides props for extra customization. Every time you...

Downloads: 0 This Week

Last Update: 2023-06-21
See Project
4

Coqui TTS

A deep learning toolkit for Text-to-Speech, battle-tested in research

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality. TTS comes with pre-trained models, tools for measuring dataset quality and is already used in 20+ languages for products and research projects. High-performance Deep Learning models for Text2Speech tasks. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings...

Downloads: 29 This Week

Last Update: 2023-12-12
See Project
HRSoft Compensation - Human Resources Software
HRSoft is the only unified, purpose-built SaaS platform designed to transform your complex HR processes into seamless digital ones

Manage your enterprise’s compensation lifecycle and accurately recognize top performers with a digitized, integrated system. Keep employees invested and your HR team in control while preventing compensation chaos.

Learn More
5

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 51 This Week

Last Update: 2023-12-07
See Project
6

Rasa

Open source machine learning framework to automate text conversations

Rasa is an open source machine learning framework to automate text-and voice-based conversations. With Rasa, you can build contextual assistants on Facebook Messenger, Slack, Google Hangouts, Webex Teams, Microsoft Bot Framework, Rocket.Chat, Mattermost, Telegram, and Twilio or on your own custom conversational channels. Rasa helps you build contextual assistants capable of having layered conversations with lots of back-and-forths. In order for a human to have a meaningful exchange...

Downloads: 17 This Week

Last Update: 2024-04-18
See Project
7

DiscordBotClient

A patched version of discord, with bot login support

A patched version of Discord, with bot login support. Discord Bot Client allows you to use your bot, just like any other user account, except for Friends and Groups.

Downloads: 22 This Week

Last Update: 2024-07-20
See Project
8

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 13 This Week

Last Update: 4 days ago
See Project
9

Signal Desktop

Private messenger for Windows, Mac, and Linux

Say "hello" to a different messaging experience. An unexpected focus on privacy, combined with all of the features you expect. State-of-the-art end-to-end encryption (powered by the open source Signal Protocol) keeps your conversations secure. We can't read your messages or listen to your calls, and no one else can either. Privacy isn’t an optional mode, it’s just the way that Signal works. Every message, every call, every time. Share text, voice messages, photos, videos, GIFs and files...

Downloads: 10 This Week

Last Update: 4 days ago
See Project
Powerful small business accounting software
For small businesses looking for desktop accounting software

With AccountEdge, business owners can organize, process, and report on their financial information so they can focus on their business. Features include: accounting, integrated payroll, sales and purchases, contact management, inventory tracking, time billing, and more.

Learn More
10

PyGPT

Open source personal AI Assistant for Linux, Windows and Mac

PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API. Moreover, the application has implemented context memory support, context storage, history...

Downloads: 13 This Week

Last Update: 2024-08-29
See Project
11

Amiga Memories

A walk along memory lane

Amiga Memories is a project (started & released in 2013) that aims to make video programmes that can be published on the internet. The images and sound produced by Amiga Memories are 100% automatically generated. The generator itself is implemented in Squirrel, the 3D rendering is done on GameStart 3D. An Amiga Memories video is mostly based on a narrative. The purpose of the script is to define the spoken and written content. The spoken text will be read by a voice synthesizer (Text To Speech...

Downloads: 2 This Week

Last Update: 2023-03-22
See Project
12

NVIDIA NeMo

Toolkit for conversational AI

NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI...

Downloads: 3 This Week

Last Update: 2024-09-06
See Project
13

Coqui STT

The deep learning toolkit for speech-to-text

... in post. With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.

Downloads: 8 This Week

Last Update: 2022-09-03
See Project
14

Tock

Tock, the open source conversational AI toolkit

... by leveraging Tock APIs. Try Tock online, deploy in minutes using Docker running configurations, or setup your own platform to the Cloud, On-Premise, even embedded. More and more teams and companies trust in Tock and open conversational platforms. Built-in connectors for numerous text/voice channels, Messenger, WhatsApp, Google Assistant, Alexa, Twitter and more.

Downloads: 2 This Week

Last Update: 2024-09-09
See Project
15

onepoint

Assistant tool that integrates coding, writing, and reading functions

Onepoint is an open-source AI assistant based on Electron, designed to create the ultimate desktop productivity tool. Its initial goal was to develop a smart floating window similar to Apple's intelligent assistant that does not take up desktop space or system performance and can be quickly accessed through global hotkeys for user convenience. With ChatGPT technology, users can continuously train onepoint to generate and reconstruct content with greater accuracy (onpoint), thereby improving...

Downloads: 1 This Week

Last Update: 2024-09-03
See Project
16

Feishu ChatGPT

Voice dialogue, role-playing, multi-topic discussion, picture creation

Feishu × (GPT-3.5 + DALL·E + Whisper) = flying-like work experience. Voice dialogue, role-playing, multi-topic discussion, picture creation, table analysis, document export. Golang language, it goes without saying! Master the gin framework proficiently, developing the backend is as natural as breathing! Familiar with the SDKs of DingTalk, Feishu, Qiwei and other platforms, and be able to develop and integrate a series of amazing functions! Proficient in platform-based detail thinking, let...

Downloads: 0 This Week

Last Update: 2023-11-20
See Project
17

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model

...-speed and ultra-lightweight models, and also cutting-edge technology. We provide production ready streaming asr and streaming tts system. Our frontend contains Text Normalization and Grapheme-to-Phoneme (G2P, including Polyphone and Tone Sandhi). Moreover, we use self-defined linguistic rules to adapt Chinese context.

Downloads: 0 This Week

Last Update: 2024-06-13
See Project
18

nheko

Desktop client for Matrix using Qt and C++20

Desktop client for Matrix using Qt and C++20. The motivation behind the project is to provide a native desktop app for Matrix that feels more like a mainstream chat app (Element, Telegram etc) and less like an IRC client.

Downloads: 0 This Week

Last Update: 2024-06-28
See Project
19

VoiceSmith

[WIP] VoiceSmith makes training text to speech models easy

VoiceSmith makes it possible to train and infer on both single and multispeaker models without any coding experience. It fine-tunes a pretty solid text to speech pipeline based on a modified version of DelightfulTTS and UnivNet on your dataset. Both models were pretrained on a proprietary 5000 speaker dataset. It also provides some tools for dataset preprocessing like automatic text normalization. Windows (only CPU supported currently) or any Linux based operating system. If you want to run...

Downloads: 2 This Week

Last Update: 2023-03-24
See Project
20

Open Camera

Camera app for Android.

Open Camera is an Open Source Camera app for Android(TM) phones and tablets. * Get it on Google Play: https://play.google.com/store/apps/details?id=net.sourceforge.opencamera * Get it on F-Droid: https://f-droid.org/en/packages/net.sourceforge.opencamera/ * Or you can download the APK/source directly, see "Files" above. Android, Google Play and the Google Play logo are trademarks of Google LLC.

46 Reviews

Downloads: 1,673 This Week

Last Update: 2024-06-26
See Project
21

Quillnote

Take beautiful markdown notes and stay organized with task lists

Take beautiful markdown notes and stay organized with task lists. Take beautiful markdown notes whenever you feel inspired. Place them in notebooks and tag them accordingly. Stay organized by making task lists, set reminders and keep everything in one place by attaching related files. Quillnote is fully free and open-source. It will never show you ads, ask you for unnecessary permissions or upload your notes anywhere without you knowing.

Downloads: 3 This Week

Last Update: 2024-02-23
See Project
22

RHVoice

Free open source speech synthesizer for Russian and other languages

RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers...

Downloads: 0 This Week

Last Update: 2024-07-04
See Project
23

eGuideDog free software for the blind

eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.

16 Reviews

Downloads: 277 This Week

Last Update: 3 hours ago
See Project
24

Pearl MATE Studio 12

OSX Styled Powerful Audio Workstation

Pearl MATE Studio 12 is running on the Ubuntu 24.04 base with no snap support. This release does however support flatpak and the text to speech on Pearl is managed with Pied for downloading and selecting natural voice models which through keyboard shortcut, <Alt>+s, which will playback highlighted. This release of Pearl MATE Studio is alot lighter on the pre installed software so the user may choose what they want by installing through our software manager, Gdebi, or Synaptic which are pre...

Downloads: 71 This Week

Last Update: 2024-09-09
See Project
25

Pearl Desktop (PDE) 12

Stable Solid Compiz Powered Multimedia Workhorse

... Piper handles great sounding natural Text to Speech Voice Models.

Downloads: 114 This Week

Last Update: 2024-09-09
See Project