Join/Login
Open Source Software
Business Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Open Source Software

Business Software

Articles
Case Studies
Learn
Blog
SourceForge Podcast

Menu

Help
Create
Join
Login

Home
Browse Open Source
Search Results

Search Results for "simple voice recognition"

x

Sort By:

Relevance

OS

Windows 169
Linux 139
Mac 109
More...
BSD 52
ChromeOS 31
Mobile Operating Systems 26
Desktop Operating Systems 4
Server Operating Systems 1

Category

Artificial Intelligence 124
Multimedia 50
Software Development 28
Scientific/Engineering 25
Communications 24
System 14
Business 11
Games 9
Internet 8
Education 7
Text Editors 4
Desktop Environment 3
Security 3
Formats and Protocols 2
Mobile 2
Productivity 2
Religion and Philosophy 1
Terminals 1

License

OSI-Approved Open Source 150
Public Domain 8
Other License 6
Creative Commons Attribution License 4
More...
GNU Free Documentation License 2

Translations

English 53
German 8
Spanish 6
French 5
More...
Italian 4
Russian 3
Brazilian Portuguese 2
Dutch 2
Norwegian 2
Afrikaans 1
Arabic 1
Bengali 1
Chinese (Simplified) 1
Czech 1
Danish 1
Finnish 1
Hindi 1
Javanese 1
Korean 1
Latin 1
Polish 1
Scottish Gaelic 1
Thai 1
Turkish 1
Welsh 1
Western Frisian 1

Programming Language

Status

Production/Stable 42
Beta 33
Alpha 22
Planning 14
More...
Pre-Alpha 12
Mature 4
Inactive 1

Showing 234 open source projects for "simple voice recognition"

View related business solutions

Employee monitoring software with screenshots
Clear visibility and insights into how employees work. Even remotely.

Stay productive working at any distance from anywhere with Monitask.

Learn More
Achieve perfect load balancing with a flexible Open Source Load Balancer
Take advantage of Open Source Load Balancer to elevate your business security and IT infrastructure with a custom ADC Solution.

Boost application security and continuity with SKUDONET ADC, our Open Source Load Balancer, that maximizes IT infrastructure flexibility. Additionally, save up to $470 K per incident with AI and SKUDONET solutions, further enhancing your organization’s risk management and cost-efficiency strategies.

Learn More
1

TTS Voice Wizard

Speech to Text to Speech, sends text as OSC messages

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) Use TTS Voice Wizard's accessibility features to improve your VRChat experience (it works outside of VRChat too!) You can convert your Speech-to-Text and back to Speech through various Speech Recognition and Text-to-Speech methods. You can send what you say as OSC messages to VRChat to be displayed on your avatar using KillFrenzyAvatarText or VRChats...

Downloads: 21 This Week

Last Update: 2024-04-11
See Project
2

Vosk Speech Recognition Toolkit

Offline speech recognition API for Android, iOS, Raspberry Pi

Vosk is an offline open source speech recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech, Polish. More to come. Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API...

Downloads: 21 This Week

Last Update: 2024-04-22
See Project
3

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 34 This Week

Last Update: 2023-12-07
See Project
4

Lyrebird

Simple and powerful voice changer for Linux, written with Python & GTK

Simple and powerful voice changer for Linux, written with Python & GTK.

Downloads: 14 This Week

Last Update: 2024-06-27
See Project
AI-based, Comprehensive Service Management for Businesses and IT Providers
Modular solutions for change management, asset management and more

ChangeGear provides IT staff with the functions required to manage everything from ticketing to incident, change and asset management and more. ChangeGear includes a virtual agent, self-service portals and AI-based features to support analyst and end user productivity.

Learn More
5

Signal iOS

A private messenger for iOS

Signal is a free, open source, messaging app for simple private communication with friends. Say "hello" to a different way of chatting: Signal is all about privacy, but with all the features you expect from a chat app. State-of-the-art end-to-end encryption (backed by Signal's open source protocol) keeps your chats safe. Neither we can read your messages or listen to your calls, nor anyone else. Privacy is not an optional mode, it is how Signal works. In all your messages, all your calls...

Downloads: 25 This Week

Last Update: 5 days ago
See Project
6

Tesseract.js

A pure Javascript Multilingual OCR

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images. The main Tesseract.js functions (ex. recognize, detect) take an image...

Downloads: 19 This Week

Last Update: 2024-05-07
See Project
7

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 8 This Week

Last Update: 2 days ago
See Project
8

Tox

A New Kind of Instant Messaging

Tox is a peer to peer (serverless) instant messenger that focuses on security and privacy. In today's world where digital surveillance is rampant, Tox offers users a communication software alternative that's free from prying eyes and ears, and is, quite literally free and without advertising. Tox comes with all the great features you'd expect from an instant messenger application, including voice calls, video calls, file sharing and screen sharing. Everything done on Tox is encrypted using...

Downloads: 8 This Week

Last Update: 2024-03-28
See Project
9

React Native Voice

React Native Voice Recognition library for iOS and Android

A speech-to-text library for React Native. Manually or automatically link the NativeModule. Drag the Voice.xcodeproj from the @react-native-voice/voice/ios folder to the Libraries group on Xcode in your project. Click on your main project file (the one that represents the .xcodeproj) select Build Phases and drag the static library, lib.Voice.a, from the Libraries/Voice.xcodeproj/Products folder to Link Binary With Libraries. The plugin provides props for extra customization. Every time you...

Downloads: 0 This Week

Last Update: 2023-06-21
See Project
The Voice API that just works | Twilio
Build a scalable voice experience with the API that's connecting millions around the world.

With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Create an engaging voice experience that you can quickly scale and modify with a wide array of customization options and resources.

Learn More
10

Alan AI for Android

Assistant SDK to build a multimodal conversational UX for Android

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 2 This Week

Last Update: 2024-07-01
See Project
11

OpenaiBot

Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant

If you don't have the instant messaging platform you need or you want to develop a new application, you are welcome to contribute to this repository. You can develop a new Controller by using Event.py. Compatibility with multiple LLMs and integration with GPT and third-party systems is handled by our llm-kira project on GitHub. It can accurately limit billing, with limits and ID binding. Supports asynchronous operations and can handle multiple requests simultaneously. Allows for private and...

Downloads: 3 This Week

Last Update: 2024-04-29
See Project
12

NSFWJS

Client-side indecent content checking powered by TensorFlow.js

NSFWJS is a simple JavaScript library that can quickly and quite accurately identify NSFW images, all in the client's browser. It is powered by TensorFlow.js and the NSFW detection model, and delivers around 90% accuracy that is improving each time. NSFWJS classifies images with percentages under five categories, namely: drawing and neutral, which are both safe for work; sexy, which includes sexually explicit images; and hentai and porn, which are pornographic drawings and images. NSFWJS...

Downloads: 3 This Week

Last Update: 2024-03-06
See Project
13

Alan AI for iOS

In-App assistant SDK to build a multimodal conversational UX for iOS

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 0 This Week

Last Update: 2024-07-01
See Project
14

Alan AI

In-App assistant SDK to build a multimodal conversational UX websites

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. To voice enable your app, you only need to get the Alan Client SDK and drop it to your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 1 This Week

Last Update: 6 days ago
See Project
15

Recorder

HTML5 js recording mp3 wav ogg webm amr format

... of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.

Downloads: 1 This Week

Last Update: 2024-04-09
See Project
16

Leku

Map location picker component for Android

Map location picker component for Android. Based on Google Maps. An alternative to Google Place Picker. Component library for Android that uses Google Maps and returns a latitude, longitude and an address based on the location picked with the Activity provided. Note that you have the voice_search_extra_language that is used for the language of the voice recognition. Replace it with the allowed voice recognition locale for your language. We encourage you to add these languages to this component...

Downloads: 0 This Week

Last Update: 2024-01-10
See Project
17

Saber

The cross-platform open-source app built for handwriting

Saber is the notes app built for handwriting. It's designed to be as simple and intuitive as possible, while still delivering unique features that you'll actually use. Additionally, Saber is available across all your devices, large and small, and syncs between them seamlessly. Only you can access your notes. You can sync your notes across devices knowing that they are encrypted and stored securely, and not even the server can read them. Notably, it can invert your notes when you're in dark mode...

Downloads: 2 This Week

Last Update: 2024-08-09
See Project
18

D++

C++ Discord API Bot Library - D++ is Lightweight and scalable

D++ is a lightweight and simple library for Discord written in modern C++. It is designed to cover as much of the API specification as possible and to have an incredibly small memory footprint, even when caching large amounts of data. It is created by the developer of TriviaBot and contributed to by a dedicated team of developers.

Downloads: 2 This Week

Last Update: 2024-05-10
See Project
19

WPPConnect

WPPConnect is an open source project

WPPConnect is an open-source project developed by the JavaScript community with the aim of exporting functions from WhatsApp Web to the node, which can be used to support the creation of any interaction, such as customer service, media sending, intelligence recognition based on phrases artificial and many other things, use your imagination. We are the best WhatsApp automation solution you have been looking for. We are a team that started an OpenSource project that performs automation...

Downloads: 2 This Week

Last Update: 2024-08-05
See Project
20

NVIDIA NeMo

Toolkit for conversational AI

NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI...

Downloads: 1 This Week

Last Update: 3 days ago
See Project
21

The SpeechBrain Toolkit

A PyTorch-based Speech Toolkit

SpeechBrain is an open-source and all-in-one conversational AI toolkit. It is designed to be simple, extremely flexible, and user-friendly. Competitive or state-of-the-art performance is obtained in various domains. SpeechBrain supports state-of-the-art methods for end-to-end speech recognition, including models based on CTC, CTC+attention, transducers, transformers, and neural language models relying on recurrent neural networks and transformers. Speaker recognition is already deployed...

Downloads: 0 This Week

Last Update: 2024-02-26
See Project
22

spaCy

Industrial-strength Natural Language Processing (NLP)

..., with an accuracy within 1% of the best available. It's blazing fast, easy to install and comes with a simple and productive API.

Downloads: 1 This Week

Last Update: 2024-06-22
See Project
23

flair

A very simple framework for state-of-the-art NLP

A very simple framework for state-of-the-art NLP. Developed by Humboldt University of Berlin and friends. A powerful NLP library. Flair allows you to apply our state-of-the-art natural language processing (NLP) models to your text, such as named entity recognition (NER), sentiment analysis, part-of-speech tagging (PoS), special support for biomedical texts, sense disambiguation and classification, with support for a rapidly growing number of languages. A text embedding library. Flair has simple...

Downloads: 0 This Week

Last Update: 2024-07-31
See Project
24

Tock

Tock, the open source conversational AI toolkit

Complete and autonomous NLU solution leveraging opensource libs, such as OpenNLP, Stanford, Duckling and more. Web, mobile, social networks, smart speakers and more. Create your bot once, connect it progressively to multiple channels as you need them. Simple graphical interfaces to build stories and models, manage multilingual and multichannel bots, better understand users with analytics. Program complex stories using Kotlin, Python or Node.js provided components, or integrate with any language...

Downloads: 0 This Week

Last Update: 2024-07-11
See Project
25

spaCy models

Models for the spaCy Natural Language Processing (NLP) library

spaCy is designed to help you do real work, to build real products, or gather real insights. The library respects your time, and tries to avoid wasting it. It's easy to install, and its API is simple and productive. spaCy excels at large-scale information extraction tasks. It's written from the ground up in carefully memory-managed Cython. If your application needs to process entire web dumps, spaCy is the library you want to be using. Since its release in 2015, spaCy has become an industry...

Downloads: 0 This Week

Last Update: 2 days ago
See Project

Previous
You're on page 1
2
3
4
5
Next

Related Searches

speech recognition project

ai

text to speech software

tamil text to speech

arabic text to speech

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 454-5900

Resources

Support
Site Documentation
Site Status

© 2024 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: