speech free download - SourceForge

Showing 86 open source projects for "speech"

View related business solutions

Artificial Intelligence Java Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Cloud tools for web scraping and data extraction
Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.

Explore 10,000+ tools
1

Stanford CoreNLP

Stanford CoreNLP, a Java suite of core NLP tools

...Pipelines produce CoreDocuments, data objects that contain all of the annotation information, accessible with a simple API, and serializable to a Google Protocol Buffer. CoreNLP generates a variety of linguistic annotations, including parts of speech, named entities, dependency parses, and coreference.

Downloads: 4 This Week

Last Update: 2025-06-07
See Project
2

Apache OpenNLP

Apache OpenNLP

Apache OpenNLP is a machine learning-based NLP library that provides tools for text-processing tasks such as tokenization, sentence segmentation, and named entity recognition.

Downloads: 0 This Week

Last Update: 2025-12-06
See Project
3

Smile

Statistical machine intelligence and learning engine

Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM...

Downloads: 1 This Week

Last Update: 2026-01-07
See Project
4

elevenlabs-api

elevenlabs-api is an open source Java wrapper around the ElevenLabs

...For any public repository security, you should store your API key in an environment variable, or external from your source code. The most realistic and versatile AI speech software, ever. Eleven brings the most compelling, rich and lifelike voices to creators and publishers seeking the ultimate tools for storytelling. Generate top-quality spoken audio in any voice and style with the most advanced and multipurpose AI speech tool out there. Our deep learning model renders human intonation and inflections with unprecedented fidelity and adjusts delivery based on context.

Downloads: 4 This Week

Last Update: 2023-12-25
See Project
Nonprofit Budgeting Software
Martus Solutions provides seamless budgeting, reporting, and forecasting tools that integrate with accounting systems for real-time financial insights

Martus' collaborative and easy-to-use budgeting and reporting platform will save you hundreds of hours each year. It's designed to make the entire budgeting process easier and create unlimited financial transparency.

Learn More
5

eGuideDog free software for the blind

eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.

16 Reviews

Downloads: 281 This Week

Last Update: 2 days ago
See Project
6

RS Media Robot Development Kit

A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.

Downloads: 5 This Week

Last Update: 3 days ago
See Project
7

Google2SRT

Download, save and convert multiple subtitles from YouTube videos

Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.

34 Reviews

Downloads: 57 This Week

Last Update: 2025-01-11
See Project
8

Conversations

App in java for chatting to a generative A.I. (involving tts and stt)

Java application for chatting to generative AI Llama3. * The user can speak into the microphone (speechToText), edit the recognized text and send it to the AI. * The AI responds and the server returns that response in real time, and the sentences converted to audio (textToSpeech), and the application broadcasts them through the speaker. The application is prepared so that only one user occupies the server's resources, so if the server is busy, in theory it will not let you...

Downloads: 0 This Week

Last Update: 2025-10-15
See Project
9

Intelligent Java

Integrate with the latest language models, image generation and speech

...Generate audio from text; Access DeepMind’s speech models. The only dependencies is GSON. Required to add manually when using IntelliJava jar. However, if you imported this repo through Maven, it will handle the dependencies.

Downloads: 6 This Week

Last Update: 2023-04-16
See Project
AI-First Supply Chain Management
Supply chain managers, executives, and businesses seeking AI-powered solutions to optimize planning, operations, and decision-making across the supply

Logility is a market-leading provider of AI-first supply chain management solutions engineered to help organizations build sustainable digital supply chains that improve people’s lives and the world we live in. The company’s approach is designed to reimagine supply chain planning by shifting away from traditional “what happened” processes to an AI-driven strategy that combines the power of humans and machines to predict and be ready for what’s coming. Logility’s fully integrated, end-to-end platform helps clients know faster, turn uncertainty into opportunity, and transform the supply chain from a cost center to an engine for growth.

Learn More
10

navmol-ch

A fork of the navmol (https://sourceforge.net/projects/navmol/)

NavMol with practical improvements, the addition of menus, the support of Mandarin, the addition of the text-to-speech, the implementation of the interrupt function of speech, and the full internationalization of text, easier and more convenient to be used.

Downloads: 0 This Week

Last Update: 2023-06-08
See Project
11

VnCoreNLP

A Vietnamese natural language processing toolkit

VnCoreNLP is a Java-based natural language processing toolkit tailored for Vietnamese. It offers a fast and accurate pipeline for essential NLP tasks, facilitating research and application development in Vietnamese language processing.

Downloads: 1 This Week

Last Update: 2025-04-25
See Project
12

jason

Jason is a fully-fledged interpreter for an extended version of AgentSpeak, a BDI agent-oriented logic programming language, and is implemented in Java. Using JADE a multi-agent system can be distributed over a network effortlessly. This project was moved to https://jason-lang.github.io

Downloads: 19 This Week

Last Update: 2023-10-22
See Project
13

ASR for Medical Reporting

Automatic speech recognition system for medical reporting in spanish.

This is a functional prototype of automatic speech recognition system for medical reporting in Spanish using CMU Sphinx4 ASR toolkit. This ASR use pre-trained acoustic model and context dependent language model in nuclear medicine diagnostics.

Downloads: 0 This Week

Last Update: 2020-07-15
See Project
14

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

...It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian

4 Reviews

Downloads: 0 This Week

Last Update: 2018-07-23
See Project
15

H.B.S.N

Speech Recognition System

H.B.S.N is a simple speech recognition software which programmed using Java. This software is a package of many sub applications.And those are as listed below , Audio Player Video Player Email Client Weather Application Mp3 Tag Editor Picture Viewer Home Automation Application Alarm / Timer Folder Locker Message Encrypt Application Income & Expenses Logging Application Apart from that we can do many thing from this software by using voice commands , such as , Open & close applications which are installed in the computer Open web sites Open folders which are in the HDD Control built-in audio & video player Control the home automation system Reading mails Reading selected text Speaking clock ( Time & Date) Speaking weather report There are system commands for the tasks which this application does.And we can replace the system default commands with custom commands.

1 Review

Downloads: 0 This Week

Last Update: 2018-06-30
See Project
16

cbrTekStraktor

an application to automatically extract text from comic books.

cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text.

Downloads: 3 This Week

Last Update: 2017-06-14
See Project
17

RDRPOSTagger

A Rule-based Part-of-Speech and Morphological Tagging Toolkit

RDRPOSTagger is a robust, easy-to-use and language-independent rule-based toolkit for Part-of-Speech (POS) and morphological tagging. RDRPOSTagger obtains fast performance in both learning and tagging process. RDRPOSTagger also achieves a very competitive accuracy in comparison to the state-of-the-art results. RDRPOSTagger now supports pre-trained POS and morphological tagging models for Bulgarian, Czech, Dutch, English, French, German, Hindi, Italian, Portuguese, Spanish, Swedish, Thai and Vietnamese. ...

2 Reviews

Downloads: 0 This Week

Last Update: 2017-05-24
See Project
18

Welsh Natural Language Toolkit

...The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words. The modules are written in JAVA and ‘wrapped’ for execution under the General Architecture for Text Engineering (GATE) framework. The project also includes CYMRIE an adapted version for Welsh of the GATE - ANNIE Named Entity Recognition (NER) application for a range of entities such as Persons, Organisations, Locations, and date and time expressions. ...

Downloads: 0 This Week

Last Update: 2017-05-26
See Project
19

XR3Capture

Take screen shots of your computer!

Comments: Capture your computer screen a lot easier with this app. System Requirements: Java 1.8.0_45++ required. GitHub (https://github.com/goxr3plus/XR3Capture)

1 Review

Downloads: 0 This Week

Last Update: 2017-02-10
See Project
20

Ansj Chinese word segmentation

Ansj word segmentation

The real java implementation of ict. The word segmentation effect is faster than the open source version of ict. Chinese word segmentation, name recognition, part-of-speech tagging, user-defined dictionary. This is a java implementation of Chinese word segmentation based on n-Gram+CRF+HMM. The word segmentation speed reaches about 2 million words per second (tested under mac air), and the accuracy rate can reach more than 96%. At present, it has realized the functions of Chinese word segmentation, Chinese name recognition, user-defined dictionary, keyword extraction, automatic summarization, and keyword tagging. ...

1 Review

Downloads: 1 This Week

Last Update: 2021-09-22
See Project
21

Welsh Natural Language Toolkit

WNLT is a suite of open source natural language modules for the Welsh

...The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words. The modules are written in JAVA and ‘wrapped’ for execution under the General Architecture for Text Engineering (GATE) framework. The project also includes CYMRIE an adapted version for Welsh of the GATE - ANNIE Named Entity Recognition (NER) application for a range of entities such as Persons, Organisations, Locations, and date and time expressions.

Downloads: 0 This Week

Last Update: 2016-11-29
See Project
22

Modular Audio Recognition Framework

MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.

3 Reviews

Downloads: 0 This Week

Last Update: 2015-10-06
See Project
23

OCR For Visually Challenged Person

Provides GUI for Tessaract OCR

It converts scanned image into text, braille and audio format. The image should be scanned with atleast 300 dpi for better accuracy.

Downloads: 0 This Week

Last Update: 2015-05-24
See Project
24

Hemera - Intelligent System

Hemera is a Virtual Intelligent System aggregating some more advanced Artificial Intelligence Technologies (speech, speech recognition, form recognition, motion recognition ...); with applications in daily tasks, domotics and robotics ...

Downloads: 0 This Week

Last Update: 2015-01-21
See Project
25

Smart Speech to Text

This software convert speech to text using Java and Android application. With this software you can also search for text in Google. You can use offline speech to text with this application if you don't have Internet, you can find the steps in guide file. How to use: ----------------- 1- Install a software to convert the PC as router (EX: My Wifi Router) then connect your mobile with PC via wifi. 2- Install Smart Text to Speech.apk file on your phone. 3- Open "Smart Speech to Text.jar" java application on PC. 4- Launch Smart Speech to Text on your phone. 5- Click on "Speak Now" button in java application. 6- After you speak click on red circle button on your phone to stop speaking and to convert it to text or you can wait few seconds notice: --------- Speech that will converted relied on the language that installed on your phone...

Downloads: 0 This Week

Last Update: 2014-09-17
See Project