Search Results for "open source speech to text software" - Page 11

Sort By:

Showing 542 open source projects for "open source speech to text software"

View related business solutions

Python Clear Filters & Widen Search

Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

sense2vec

Contextually-keyed word vectors

sense2vec (Trask et. al, 2015) is a nice twist on word2vec that lets you learn more interesting and detailed word vectors. This library is a simple Python implementation for loading, querying and training sense2vec models. For more details, check out our blog post. To explore the semantic similarities across all Reddit comments of 2015 and 2019, see the interactive demo.

Downloads: 4 This Week

Last Update: 2024-08-16
See Project
2

textacy

NLP, before and after spaCy

textacy is a Python library for performing a variety of natural language processing (NLP) tasks, built on the high-performance spaCy library. With the fundamentals, tokenization, part-of-speech tagging, dependency parsing, etc., delegated to another library, textacy focuses primarily on the tasks that come before and follow after.

Downloads: 0 This Week

Last Update: 2025-01-22
See Project
3

Amiga Memories

A walk along memory lane

Amiga Memories is a project (started & released in 2013) that aims to make video programmes that can be published on the internet. The images and sound produced by Amiga Memories are 100% automatically generated. The generator itself is implemented in Squirrel, the 3D rendering is done on GameStart 3D. An Amiga Memories video is mostly based on a narrative. The purpose of the script is to define the spoken and written content. The spoken text will be read by a voice synthesizer (Text To...

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
4

Neomake

Asynchronous linting and make framework for Neovim/Vim

Neomake is an asynchronous linting and build framework for Vim and Neovim that predates and inspires newer tooling in this space. It runs “makers” (linters, compilers, format checkers, test commands) in the background and surfaces results as signs, virtual text, or via quickfix/location lists. The system is highly configurable: you can define per-filetype makers, chain multiple tools, and control when they run (on save, on write, on cursor hold, or manually). Run several makers concurrently...

Downloads: 0 This Week

Last Update: 2025-10-14
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
5

Gester

A game engine for console based text games

Gester is a game engine for console based text game. You make text based games easily using Gester. These games can be played by using the 'gest' command. The 'gest' command will invoke the game engine which will read the game script file (.gest file) and present the game on the command line Install Gester using pip: "pip install gester" See our PyPI page for more details: https://pypi.org/project/gester

1 Review

Downloads: 0 This Week

Last Update: 2023-04-12
See Project
6

DiffSinger

Singing Voice Synthesis via Shallow Diffusion Mechanism

DiffSinger is an open-source PyTorch implementation of a diffusion-based acoustic model for singing-voice synthesis (SVS) and also text-to-speech (TTS) in a related variant. The core idea is to view generation of a sung voice (mel-spectrogram) as a diffusion process: starting from noise, the model iteratively “denoises” while being conditioned on a music score (lyrics, pitch, musical timing).

Downloads: 29 This Week

Last Update: 2025-11-28
See Project
7

Xaut (formerly Xautomation)

Python library to automate X11. Simulate mouse movement and mouse clicks, keystrokes, and manipulate windows from Python. Alter the state of caps lock, num lock, and scroll lock. Also access the X11 clipboard. The C library would be useful for those wishing to automate X from another language as well.

Downloads: 0 This Week

Last Update: 2023-02-05
See Project
8

rust.vim

Vim configuration for Rust

rust.vim is the official Vim plugin from the Rust Lang Foundation team that adds support for Rust files in Vim and Neovim. It provides file-detection (so .rs files are recognised), syntax highlighting tuned for Rust’s syntax and macros, and supports formatting via rustfmt and other language-specific tooling. The plugin also integrates with other Vim tools and linters, helping you build a full Rust-editing workflow in Vim. Because Rust’s macro system and language features are more complex...

Downloads: 0 This Week

Last Update: 2025-10-14
See Project
9

Automatic YouTube subtitle generation

Using OpenAI's Whisper to automatically generate YouTube subtitles

Automatic YouTube subtitle generation is a command-line tool that combines YouTube downloading capabilities with AI-powered transcription using Whisper models. It allows users to download videos or audio from YouTube and automatically generate subtitles or transcripts. The tool processes media locally, extracting audio and applying speech recognition to produce accurate text outputs. It supports multiple languages and can handle different Whisper model sizes, balancing performance and...

Downloads: 0 This Week

Last Update: 2026-04-24
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

Pattern

Web mining module for Python, with tools for scraping

Pattern is an open-source Python library that provides tools for web mining, natural language processing, machine learning, and network analysis. The project integrates multiple capabilities into a single framework that allows developers to collect, process, and analyze textual data from the web. It includes modules for web scraping and crawling that can retrieve information from sources such as social media platforms, search engines, and online knowledge bases.

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
11

Diffident

Moved to https://codeberg.org/andybalaam/Diffident

Downloads: 0 This Week

Last Update: 2022-11-04
See Project
12

WaveRNN

WaveRNN Vocoder + TTS

WaveRNN is a PyTorch implementation of DeepMind’s WaveRNN vocoder, bundled with a Tacotron-style TTS front end to form a complete text-to-speech stack. As a vocoder, WaveRNN models raw audio with a compact recurrent neural network that can generate high-quality waveforms more efficiently than many traditional autoregressive models. The repository includes scripts and code for preprocessing datasets such as LJSpeech, training Tacotron to produce mel spectrograms, training WaveRNN on those...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
13

VoiceSmith

[WIP] VoiceSmith makes training text to speech models easy

VoiceSmith makes it possible to train and infer on both single and multispeaker models without any coding experience. It fine-tunes a pretty solid text to speech pipeline based on a modified version of DelightfulTTS and UnivNet on your dataset. Both models were pretrained on a proprietary 5000 speaker dataset. It also provides some tools for dataset preprocessing like automatic text normalization. Windows (only CPU supported currently) or any Linux based operating system. If you want to run...

Downloads: 0 This Week

Last Update: 2023-03-24
See Project
14

ostRAT

OpenSourceTelegramRAT - Remote PC access via Telegram Bot.

ostRAT is free and open source. GPLv3 Сomputer remote control software. Works via telegram bot. A lot of functions, for example: - Screenshot: sends a screenshot - Off: turns off the computer - Url: opens entered link - Write: sends your text to the computer - Move: changes mouse location with x and y - and more! WARNING: Using the bot is recommended only on your device.

2 Reviews

Downloads: 0 This Week

Last Update: 2022-11-01
See Project
15

AI Atelier

Based on the Disco Diffusion, version of the AI art creation software

Based on the Disco Diffusion, we have developed a Chinese & English version of the AI art creation software "AI Atelier". We offer both Text-To-Image models (Disco Diffusion and VQGAN+CLIP) and Text-To-Text (GPT-J-6B and GPT-NEOX-20B) as options. Making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Copyright and license notices must be preserved. When a modified version is used to provide a...

Downloads: 1 This Week

Last Update: 2023-03-23
See Project
16

Fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python

Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the...

Downloads: 0 This Week

Last Update: 2022-06-27
See Project
17

Tensorflow Transformers

State of the art faster Transformer with Tensorflow 2.0

Imagine auto-regressive generation to be 90x faster. tf-transformers (Tensorflow Transformers) is designed to harness the full power of Tensorflow 2, designed specifically for Transformer based architecture. These models can be applied on text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
18

AugLy

A data augmentations library for audio, image, text, and video

AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...

Downloads: 0 This Week

Last Update: 2022-03-29
See Project
19

OpenPrompt

An Open-Source Framework for Prompt-Learning

Prompt-learning is the latest paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks, which modifies the input text with a textual template and directly uses PLMs to conduct pre-trained tasks. OpenPrompt is a library built upon PyTorch and provides a standard, flexible and extensible framework to deploy the prompt-learning pipeline. OpenPrompt supports loading PLMs directly from huggingface transformers. In the future, we will also support PLMs implemented by other...

Downloads: 0 This Week

Last Update: 2022-08-10
See Project
20

Deep Daze

Simple command line tool for text to image generation

Simple command-line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). In true deep learning fashion, more layers will yield better results. Default is at 16, but can be increased to 32 depending on your resources. Technique first devised and shared by Mario Klingemann, it allows you to prime the generator network with a starting image, before being steered towards the text. Simply specify the path to the image you wish to use, and...

Downloads: 0 This Week

Last Update: 2022-03-13
See Project
21

Buzz XML/OPML editor

Buzz is a fast graphical editor for XML files with special support for OPML. Using the OPML convergence tools it will edit about any outline and many forms of indented text, including Python. In fact, Buzz was written with Buzz! It is written in P

Downloads: 4 This Week

Last Update: 2022-05-01
See Project
22

Interpret-Text

State-of-the-art explainers for text-based machine learning models

A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard. Interpret-Text builds on Interpret, an open source python package for training interpretable models and helping to explain blackbox machine learning systems. We have added extensions to support text models. Interpret-Text incorporates community-developed interpretability techniques for NLP models and a visualization dashboard to view the results....

Downloads: 0 This Week

Last Update: 2023-12-19
See Project
23

Rank-BM25

A Collection of BM25 Algorithms in Python

A collection of algorithms for querying a set of documents and returning the ones most relevant to the query. The most common use case for these algorithms is, as you might have guessed, to create search engines.

Downloads: 1 This Week

Last Update: 2024-11-01
See Project
24

Text-ly

Text.ly - An alternative for Notepad.

LOOKING FOR Text Editor? You've Come At The Right Place! Editing Your text for your simplicity A Text editor for Editing Text....! Just download and install and use as an alternative for typical Notepad. This application is compiled from the Pyinstaller library so don't mind there is a vulnerability or something the antivirus program might show it as malware or trojan this happens with most of the apps compiled from the Pyinstaller library. So No worries There is not any malware or virus...

Downloads: 0 This Week

Last Update: 2022-01-29
See Project
25

Model Search

Framework that implements AutoML algorithms

Model Search is an AutoML research system for discovering neural network architectures with minimal human intervention. Instead of hand-crafting models, you define a search space and objectives, then the system explores candidate architectures using controllers and population-based strategies. It supports multiple tasks (such as vision or text) by letting you express reusable building blocks—layers, cells, and topologies—that the search can recombine. Training, evaluation, and promotion of...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project