speech free download - SourceForge

Showing 71 open source projects for "speech"

View related business solutions

Software Development Linux Clear Filters & Widen Search

Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
1

AnySoftKeyboard

Android (f/w 2.1+) on screen keyboard for multiple languages

The only Android keyboard you'll ever need. Free as in speech and Free as in beer. Android (f/w 4.0.3+, API level 15+) on screen keyboard for multiple languages.

Downloads: 21 This Week

Last Update: 2026-02-08
See Project
2

spaCy

Industrial-strength Natural Language Processing (NLP)

spaCy is a library built on the very latest research for advanced Natural Language Processing (NLP) in Python and Cython. Since its inception it was designed to be used for real world applications-- for building real products and gathering real insights. It comes with pretrained statistical models and word vectors, convolutional neural network models, easy deep learning integration and so much more. spaCy is the fastest syntactic parser in the world according to independent benchmarks, with...

Downloads: 8 This Week

Last Update: 2026-03-29
See Project
3

Mercure

Server-sent live updates, protocol and reference implementation

...Because it is a thin layer on top of HTTP and SSE, Mercure is natively supported by modern web browsers, mobile applications, and IoT devices. A free (as in beer, and as in speech) reference server, as well as a hosted service, are available. Designed for REST and GraphQL. Private updates (JWT authorization). Presence API and subscription events. Event store. Compatible with serverless, PHP, and the like. Supports end-to-end encryption. Reading the documentation is an excellent way to discover Mercure.

Downloads: 2 This Week

Last Update: 1 day ago
See Project
4

TensorRT

C++ library for high performance inference on NVIDIA GPUs

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference. It includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for deep learning inference applications. TensorRT-based applications perform up to 40X faster than CPU-only platforms during inference. With TensorRT, you can optimize neural network models trained in all major frameworks, calibrate for lower precision with high accuracy, and deploy to hyperscale data centers,...

Downloads: 13 This Week

Last Update: 2026-03-25
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
5

OpenAI Go

The official Go library for the OpenAI API

OpenAI Go is the official Go client library for accessing the OpenAI API. It enables developers to integrate OpenAI’s models and features into Go applications with a clean and idiomatic interface. The library provides support for a wide range of API endpoints including chat completions, assistants, embeddings, image generation, audio processing, and batch jobs. It includes built-in tools for handling authentication, managing API requests, and parsing structured responses. The repository also...

Downloads: 1 This Week

Last Update: 7 days ago
See Project
6

Cookbook (Google Gemini)

Examples and guides for using the Gemini API

...It provides a structured learning path with quick-start tutorials for beginners and practical examples for advanced users. The repository covers a wide range of Gemini capabilities, including text, images, video, speech, robotics, and multimodal interactions. It highlights newly introduced features such as Gemini 2.5 models (Flash and Pro), Gemini’s native image generation, Veo for video generation, robotics-focused reasoning models, and Lyria for TTS and music generation. The Cookbook also includes tutorials on advanced API workflows such as grounding answers with external tools, batch-mode request handling, and live multimodal interactivity with LiveAPI. ...

Downloads: 3 This Week

Last Update: 9 hours ago
See Project
7

Stanza

Stanford NLP Python library for many human languages

...It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of speech and morphological features, to give a syntactic structure dependency parse, and to recognize named entities. The toolkit is designed to be parallel among more than 70 languages, using the Universal Dependencies formalism. Stanza is built with highly accurate neural network components that also enable efficient training and evaluation with your own annotated data.

Downloads: 2 This Week

Last Update: 2026-02-26
See Project
8

gse

Go efficient multilingual NLP and text segmentation

...Gse is implements jieba by golang, and try add NLP support and more feature. Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes. Support user and embed dictionary, Part-of-speech/POS tagging, analyze segment info, stop and trim words. Support multilingual: English, Chinese, Japanese and others. Support Traditional Chinese. Support HMM cut text use Viterbi algorithm. Support NLP by TensorFlow (in work). Named Entity Recognition (in work). Supports with elastic search and bleve. run JSON RPC service.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
9

react-use

Component for React

...Tracks mouse hover state of some element. Display an element or video full-screen. Tracks location hash value. Tracks whether user is being inactive. Tracks an HTML element's intersection. Synthesizes speech from a text string. Tracks page navigation bar location state. Re-renders component, while tweening a number from 0 to 1. Tracks long press gesture of some element. Tracks state of a CSS media query. Tracks state of connected hardware devices. Returns a callback, which re-renders component when called. Tracks state of device's motion sensor. ...

Downloads: 0 This Week

Last Update: 2024-12-09
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Bot Framework Web Chat

A highly-customizable web-based client for Azure Bot Services

This repository contains code for the Bot Framework Web Chat component. The Bot Framework Web Chat component is a highly-customizable web-based client for the Bot Framework V4 SDK. The Bot Framework SDK v4 enables developers to model conversation and build sophisticated bot applications. This repo is part of the Microsoft Bot Framework, a comprehensive framework for building enterprise-grade conversational AI experiences. Create a bot with the ability to speak, listen, understand, and learn...

Downloads: 0 This Week

Last Update: 2025-07-25
See Project
11

ESP8266Audio

Arduino library to play MOD, WAV, FLAC, MIDI, RTTTL, MP3

Arduino library for parsing and decoding MOD, WAV, MP3, FLAC, MIDI, AAC, and RTTL files and playing them on an I2S DAC or even using a software-simulated delta-sigma DAC with dynamic 32x-128x oversampling. ESP8266 is fully supported and most mature, but ESP32 is also mostly there with built-in DAC as well as external ones. For real-time, autonomous speech synthesis, check out ESP8266SAM, a library that uses this one and a port of an ancient format-based synthesis program to allow your ESP8266 to talk with low memory and no network required. All this code is released under the GPL, and all of it is to be used at your own risk. If you find any bugs, please let me know via the GitHub issue tracker or drop me an email. ...

Downloads: 0 This Week

Last Update: 2025-10-23
See Project
12

amazon-connect-wisdomjs

Gives you the power to build your own Wisdom widget

...Amazon Connect Wisdom connects relevant knowledge repositories with built-in connectors for third-party applications like Salesforce and ServiceNow, as well as internal wikis, FAQ stores, and file shares. With Wisdom, agents can search across connected repositories to find answers and quickly resolve customer issues. In addition, Wisdom uses real-time speech analytics and natural language processing (NLP) from Contact Lens for Amazon Connect to detect customer issues during calls, and then provide agents with recommendations and answers. Wisdom provides faster issue resolution and improved customer satisfaction.

Downloads: 0 This Week

Last Update: 2026-04-14
See Project
13

opencore-amr

Audio codecs extracted from Android Open Source Project

Library of OpenCORE Framework implementation of Adaptive Multi Rate Narrowband and Wideband (AMR-NB and AMR-WB) speech codec. Library of VisualOn implementation of Adaptive Multi Rate Wideband (AMR-WB) encoder and Advanced Audio Coding (AAC) encoder. Modified library of Fraunhofer AAC decoder and encoder.

19 Reviews

Downloads: 7,289 This Week

Last Update: 2025-08-21
See Project
14

JSpeech

Java library designed to integrate Speech-to-Text

jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.

1 Review

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
15

RemoteTTS

Tool to remotely activate Text-To-Speech (TTS) on a server

The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.

Downloads: 0 This Week

Last Update: 2024-02-25
See Project
16

jason

Jason is a fully-fledged interpreter for an extended version of AgentSpeak, a BDI agent-oriented logic programming language, and is implemented in Java. Using JADE a multi-agent system can be distributed over a network effortlessly. This project was moved to https://jason-lang.github.io

Downloads: 39 This Week

Last Update: 2023-10-22
See Project
17

linViex

linViex is a graphical programming environment for home automation tasks and other applications. It supports remote controls, sensor devices, power switches/dimmers, e-mail, text-to-speech conversion, media players and many more. Graphic symbols of functional objects can be interconnected to exchange and process data.

1 Review

Downloads: 0 This Week

Last Update: 2023-03-25
See Project
18

VideoSrt

Windows-GUI

...Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.

Downloads: 33 This Week

Last Update: 2023-01-13
See Project
19

Flashlight library

A C++ standalone library for machine learning

Flashlight is a fast, flexible machine learning library written entirely in C++ by Facebook AI Research and the creators of Torch, TensorFlow, Eigen, and Deep Speech. Native support in C++ and simple extensibility make Flashlight a powerful research framework that's hackable to its core and enables fast iteration on new experimental setups and algorithms with little unopinionated and without sacrificing performance. In a single repository, Flashlight provides apps for research across multiple domains. ...

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
20

Code Guide

Standards for developing consisten and sustainable HTML and CSS

...Enforce standards mode and more consistent rendering in every browser possible with this simple doctype at the beginning of every HTML page. Authors are encouraged to specify a lang attribute on the root html element, giving the document's language. This aids speech synthesis tools to determine what pronunciations to use.

Downloads: 1 This Week

Last Update: 2022-05-26
See Project
21

AugLy

A data augmentations library for audio, image, text, and video

...We designed AugLy to include many specific data augmentations that users perform in real life on internet platforms like Facebook's -- for example making an image into a meme, overlaying text/emojis on images/videos, reposting a screenshot from social media. While AugLy contains more generic data augmentations as well, it will be particularly useful to you if you're working on a problem like copy detection, hate speech detection, etc.

Downloads: 0 This Week

Last Update: 2022-03-29
See Project
22

SVoice (Speech Voice Separation)

We provide a PyTorch implementation of the paper Voice Separation

SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...

Downloads: 2 This Week

Last Update: 5 days ago
See Project
23

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and compare the results. ...

Downloads: 0 This Week

Last Update: 2021-05-24
See Project
24

Euler

A distributed graph deep learning framework.

As a general data structure with strong expressive ability, graphs can be used to describe many problems in the real world, such as user networks in social scenarios, user and commodity networks in e-commerce scenarios, communication networks in telecom scenarios, and transaction networks in financial scenarios. and drug molecule networks in medical scenarios, etc. Data in the fields of text, speech, and images is easier to process into a grid-like type of Euclidean space, which is suitable for processing by existing deep learning models. Graph is a data type in non-Euclidean space and cannot be directly applied to existing methods, requiring a specially designed graph neural network system. Graph-based learning methods such as graph neural networks combine end-to-end learning with inductive reasoning, and are expected to solve a series of problems such as relational reasoning and interpretability that deep learning cannot handle.

Downloads: 0 This Week

Last Update: 2022-08-05
See Project
25

DeepLearning

Deep Learning (Flower Book) mathematical derivation

...At the same time, it also introduces deep learning techniques used by practitioners in the industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling and practical methods, and investigates topics such as natural language processing, Applications in speech recognition, computer vision, online recommender systems, bioinformatics, and video games. Finally, the Deep Learning book provides research directions covering theoretical topics including linear factor models, autoencoders, representation learning, structured probabilistic models, etc.

Downloads: 2 This Week

Last Update: 2022-08-02
See Project