voice to text writer free download

Showing 68 open source projects for "voice to text writer"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

Payroll Services for Small Businesses | QuickBooks
Save 50% off for 3 months with QuickBooks Payroll when you Buy Now

Easily pay your team and access powerful tools, employee benefits, and supportive experts with the #1 online payroll service provider. Manage payroll and access HR and employee services in one place. Pay your team automatically once your payroll setup is complete. We'll calculate, file, and pay your payroll taxes automatically.

Learn More
Free CRM Software With Something for Everyone
216,000+ customers in over 135 countries grow their businesses with HubSpot

Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.

Get free CRM
1

TTS Voice Wizard

Speech to Text to Speech, sends text as OSC messages

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) Use TTS Voice Wizard's accessibility features to improve your VRChat experience (it works outside of VRChat too!) You can convert your Speech-to-Text and back to Speech through various Speech Recognition and Text-to-Speech methods. You can send what you say as OSC messages to VRChat to be displayed on your avatar using KillFrenzyAvatarText or VRChats...

Downloads: 23 This Week

Last Update: 2024-10-27
See Project
2

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 87 This Week

Last Update: 2024-09-30
See Project
3

PyGPT

Open source personal AI Assistant for Linux, Windows and Mac

PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API. Moreover, the application has implemented context memory support, context storage, history...

Downloads: 12 This Week

Last Update: 7 hours ago
See Project
4

Rasa

Open source machine learning framework to automate text conversations

Rasa is an open source machine learning framework to automate text-and voice-based conversations. With Rasa, you can build contextual assistants on Facebook Messenger, Slack, Google Hangouts, Webex Teams, Microsoft Bot Framework, Rocket.Chat, Mattermost, Telegram, and Twilio or on your own custom conversational channels. Rasa helps you build contextual assistants capable of having layered conversations with lots of back-and-forths. In order for a human to have a meaningful exchange...

Downloads: 5 This Week

Last Update: 2024-04-18
See Project
Red Hat Enterprise Linux on Microsoft Azure
Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.

Learn More
5

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 2 This Week

Last Update: 2024-10-29
See Project
6

Feishu ChatGPT

Voice dialogue, role-playing, multi-topic discussion, picture creation

Feishu × (GPT-3.5 + DALL·E + Whisper) = flying-like work experience. Voice dialogue, role-playing, multi-topic discussion, picture creation, table analysis, document export. Golang language, it goes without saying! Master the gin framework proficiently, developing the backend is as natural as breathing! Familiar with the SDKs of DingTalk, Feishu, Qiwei and other platforms, and be able to develop and integrate a series of amazing functions! Proficient in platform-based detail thinking, let...

Downloads: 2 This Week

Last Update: 2023-11-20
See Project
7

onepoint

Assistant tool that integrates coding, writing, and reading functions

Onepoint is an open-source AI assistant based on Electron, designed to create the ultimate desktop productivity tool. Its initial goal was to develop a smart floating window similar to Apple's intelligent assistant that does not take up desktop space or system performance and can be quickly accessed through global hotkeys for user convenience. With ChatGPT technology, users can continuously train onepoint to generate and reconstruct content with greater accuracy (onpoint), thereby improving...

Downloads: 2 This Week

Last Update: 2024-09-03
See Project
8

NVIDIA NeMo

Toolkit for conversational AI

NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI...

Downloads: 1 This Week

Last Update: 2024-10-29
See Project
9

Amiga Memories

A walk along memory lane

Amiga Memories is a project (started & released in 2013) that aims to make video programmes that can be published on the internet. The images and sound produced by Amiga Memories are 100% automatically generated. The generator itself is implemented in Squirrel, the 3D rendering is done on GameStart 3D. An Amiga Memories video is mostly based on a narrative. The purpose of the script is to define the spoken and written content. The spoken text will be read by a voice synthesizer (Text To Speech...

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
Auth0 Free: 25K MAUs + 5-Min Setup
Enterprise Auth, Zero Friction: Any Framework • 30+ SDKs • Universal Login

Production-ready login in 10 lines of code. SSO, MFA & social auth included. Scale seamlessly beyond free tier with Okta’s enterprise security.

Get Your API Keys
10

Voice Accounting For Blind & Mute People

Free & Easy AI Voice Accounting Software For Blind & Speechless People

Just download the above zip file, extract it and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, please view and download my full collection of softwares for people with disabilities, here : https://sourceforge.net/projects/softwares-for-disabled-people/ This full collection also includes the Voice Accounting Software as well.

Downloads: 0 This Week

Last Update: 2024-04-30
See Project
11

Coqui STT

The deep learning toolkit for speech-to-text

... in post. With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.

Downloads: 4 This Week

Last Update: 2022-09-03
See Project
12

eGuideDog free software for the blind

eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.

16 Reviews

Downloads: 383 This Week

Last Update: 5 days ago
See Project
13

AppFlowy

Bring projects, wikis, and teams together with AI.

... with a beautiful rich-text editor that goes beyond just text and bullet points, offering 20+ content types, easy-to-use customized themes, keyboard shortcuts, and color options. It supports real-time team collaboration, enabling you to work with your friends and teammates on the same document in real time, similar to Google Docs. AppFlowy is powered by AppFlowy AI, which is accessible, collaborative, and contextual. Supercharge any type of work in a collaborative team workspace.

Downloads: 18 This Week

Last Update: 23 hours ago
See Project
14

AI Models Downloads

Open Source AI Models Your One-Stop Destination for AI Model Downloads

The download is a sample Voice Pack Trained and Used with Coqui TTS AIModels.org provides a comprehensive directory of AI models for download, ranging from text-to-speech, large language models, to text-to-image models. Our goal is to provide you with easy access to a vast array of pre-trained models, complete with examples and demonstrations. Discover the Power of AI Models AI models have come a long way, empowering various applications that improve our lives on a daily basis. Here...

1 Review

Downloads: 3 This Week

Last Update: 2023-12-17
See Project
15

Maia

MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.

Downloads: 0 This Week

Last Update: 2024-04-21
See Project
16

artikelschreiber

Frontend and Backend Code for ArtikelSchreiber.com and UNAIQUE.NET

Frontend and Backend Code for ArtikelSchreiber.com and UNAIQUE.NET Text Generator deutsch - Dein KI Text Generator kostenlos mit Künstlicher Intelligenz The Software as a Service can be found here: SEO Optimizer: Ghost Writer - Hausarbeiten schreiben mit KI and KI Text Generator This product includes software developed by Sebastian Enger, M.Sc. Copyright (c) 2023, Sebastian Enger, M.Sc. All rights reserved. Frontend and Backend Source Code for Project: https://github.com...

Downloads: 0 This Week

Last Update: 2023-06-24
See Project
17

SpeakLogPSU

SpeakLogPSU can speak chat messages with an individual voice if the NPC or player was configured or with a default one. You will never miss if someone talks to you. Voice cloning can be accomplished with cogui in less than five minutes without GPU. The result is archived and can be used the next time in game. Some TTS projects already started to add tag support to speak text with emotions or sing it. If a game designer has that in mind with a good chat log she can voiced her game...

Downloads: 0 This Week

Last Update: 2024-10-14
See Project
18

RemoteTTS

Tool to remotely activate Text-To-Speech (TTS) on a server

The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.

Downloads: 0 This Week

Last Update: 2024-02-25
See Project
19

Softwares For Blind, Deaf, Handicap

Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People

Just download the above zip file, extract it first and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, keep NumLock ON while using the Numeric Keypad of any Keyboard. Can also attach an external USB keyboard, with seperate Numeric Keypad, if required. I have added some general guidelines for students, using these softwares, on the Wiki Page of this website. Please refer them for more instructions.

Downloads: 0 This Week

Last Update: 2024-06-05
See Project
20

Mycroft

Mycroft Core, the Mycroft Artificial Intelligence platform

Mycroft is the world’s leading open source voice assistant. It is private by default and completely customizable. Our software runs on many platforms, on desktop, our reference hardware, a Raspberry Pi, or your own custom hardware. Our open-source, modular system can be ported to your device or environment, at any price point. Whether you make voice-assistants, televisions, or microwaves. Whether you have a 5-room BnB or a 1000-room hotel. Your customers will get access to all the necessities...

Downloads: 63 This Week

Last Update: 2023-03-21
See Project
21

Project Alice

Main repository of Project Alice, contains main unit source code

Project Alice is a smart voice home assistant that is completely modular and extensible. It was first built around Snips therefore runs entirely offline and never sends or shares your voice interactions with anyone, Project Alice guarantees your privacy in your home or wherever you’re using Project Alice. However, as an option, since we've built Project Alice on top of Snips, Project Alice can be configured to use some online alternatives and fall backs (for example, using Amazon or Google’s...

Downloads: 1 This Week

Last Update: 2023-04-19
See Project
22

Jarvis Python AI Assistant

Python AI assistant

Jarvis is a voice commanding assistant service in Python 3.8 It can recognize human speech, talk to user and execute basic commands. Opens a web page (e.g 'Jarvis open youtube') Play music in Youtube (e.g 'Jarvis play mozart') Increase/decrease the speakers master volume (also can set max/mute speakers volume) (e.g 'Jarvis volume up!') Opens libreoffice suite applications (calc, writer, impress) (e.g 'Jarvis open calc') Tells about something, by searching on the internet (e.g 'Jarvis tells me...

Downloads: 12 This Week

Last Update: 2023-04-19
See Project
23

Parakeet

PAddle PARAllel text-to-speech toolKIT

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN) Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-speech toolkit for the open-source community. It is built on PaddlePaddle dynamic graph and includes many influential TTS models. In order to facilitate exploiting the existing TTS models directly and developing the new ones, Parakeet selects typical models and provides...

Downloads: 0 This Week

Last Update: 2023-03-24
See Project
24

Translate-Subtitle-File

Subtitle Creation Assistant

Subtitle group machine translation assistant - [Function 1: Translate subtitle file] .srt .ass .vtt [Function 2: Voice to text] (Drag in video or audio to recognize subtitles) (The latest version v4.1.0 Update time 2021 2 May 23) 12 translation service providers can be configured, such as Google, Baidu, Tencent, Caiyun, IBM, Azure, Amazon, etc. (6 voice service providers can be configured: Alibaba Cloud, Xunfei, Tencent Cloud, IBM, Azure, Amazon ) Advantages: 1. You can use multiple service...

Downloads: 1 This Week

Last Update: 2022-09-29
See Project
25

PaddlePaddle models

Pre-trained and Reproduced Deep Learning Models

... detection, image segmentation, text recognition, speech synthesis, etc. An end-to-end development kit that meets the needs of enterprises for low-cost development and rapid integration. The model library of Flying Paddle is an industrial-level model library tailored around the actual R&D process of domestic enterprises, serving enterprises in many fields such as energy, finance, industry, and agriculture.

Downloads: 0 This Week

Last Update: 2022-08-01
See Project