Search Results for "['speech recognition', 'speech recognition']" - Page 2

Sort By:

Relevance

Windows 167
Linux 147
Mac 113
More...
BSD 45
ChromeOS 26
Mobile Operating Systems 19
Desktop Operating Systems 6

Showing 228 open source projects for "['speech recognition', 'speech recognition']"

View related business solutions

Network Management Software and Tools for Businesses and Organizations | Auvik Networks
Mapping, inventory, config backup, and more.

Reduce IT headaches and save time with a proven solution for automated network discovery, documentation, and performance monitoring. Choose Auvik because you'll see value in minutes, and stay with us to improve your IT for years to come.

Learn More
A complete payments platform, engineered for growth
For individuals or companies that accept payments online

Accept payments and move money globally with Stripe’s powerful APIs and software solutions designed to help you capture more revenue.

Learn More
1

spaCy models

Models for the spaCy Natural Language Processing (NLP) library

spaCy is designed to help you do real work, to build real products, or gather real insights. The library respects your time, and tries to avoid wasting it. It's easy to install, and its API is simple and productive. spaCy excels at large-scale information extraction tasks. It's written from the ground up in carefully memory-managed Cython. If your application needs to process entire web dumps, spaCy is the library you want to be using. Since its release in 2015, spaCy has become an industry...

Downloads: 1 This Week

Last Update: 2024-08-17
See Project
2

Smile

Statistical machine intelligence and learning engine

Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM...

Downloads: 1 This Week

Last Update: 2024-05-22
See Project
3

TorchAudio

Data manipulation and transformation for audio signal processing

The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch...

Downloads: 0 This Week

Last Update: 2024-08-22
See Project
4

gse

Go efficient multilingual NLP and text segmentation

Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. Gse is implements jieba by golang, and try add NLP support and more feature. Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes. Support user and embed dictionary, Part-of-speech/POS tagging, analyze segment info, stop and trim words. Support multilingual: English, Chinese, Japanese and others. Support Traditional Chinese. Support HMM cut text use...

Downloads: 0 This Week

Last Update: 2024-08-22
See Project
Case Management Software for Nonprofits
CharityTracker offers easy-to-use case management software for nonprofits, helping them track assistance, measure outcomes, and generate reports.

A case management CRM that focuses on your clients, to measure real-world impact, not just data. Your Day Just Got Shorter - We know you’re busy and don’t have time for complicated software – We’ve changed that! Even your staff will enjoy using the system – No more headaches and missing data from reports so you can use your valuable time to serve more people. CharityTracker is a cloud-based, HIPAA compliant software that’s easy to use for nonprofits of all sizes. With no setup fees, hidden fees, or contracts, you can cancel anytime! Our time saving solution provides a secure way to streamline how you manage cases, record case notes & assistance, create reports, collaborate, set goals, measure outcomes, make referrals & prove your impact with change-over-time reports for your funders. We're at work in 2,200 communities! Book a free demo. Pricing starts at just $20/mo. user

Learn More
5

OpenaiBot

Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant

If you don't have the instant messaging platform you need or you want to develop a new application, you are welcome to contribute to this repository. You can develop a new Controller by using Event.py. Compatibility with multiple LLMs and integration with GPT and third-party systems is handled by our llm-kira project on GitHub. It can accurately limit billing, with limits and ID binding. Supports asynchronous operations and can handle multiple requests simultaneously. Allows for private and...

Downloads: 0 This Week

Last Update: 2024-04-29
See Project
6

PHP Client For NLP Cloud

NLP Cloud serves high performance pre-trained or custom models for NER

NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, dialogue summarization, paraphrasing, intent classification, product description and ad generation, chatbot, grammar and spelling correction, keywords and keyphrases extraction, text generation, image generation, blog post generation, code generation, question answering, automatic speech recognition, machine translation, language detection, semantic search, semantic...

Downloads: 0 This Week

Last Update: 2024-03-12
See Project
7

Paper2GUI

Convert AI papers to GUI

Convert AI papers to GUI，Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术 Paper2GUI: An AI desktop APP toolbox for ordinary people. It can be used immediately without installation. It already supports 40+ AI models, covering AI painting, speech synthesis, video frame complementing, video super-resolution, object detection, and image stylization. , OCR recognition and other fields. Support Windows, Mac, Linux systems. Paper2GUI: 一款面向普通...

Downloads: 0 This Week

Last Update: 2024-09-20
See Project
8

Stanza

Stanford NLP Python library for many human languages

Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts...

Downloads: 0 This Week

Last Update: 2024-09-12
See Project
9

spaCy

Industrial-strength Natural Language Processing (NLP)

spaCy is a library built on the very latest research for advanced Natural Language Processing (NLP) in Python and Cython. Since its inception it was designed to be used for real world applications-- for building real products and gathering real insights. It comes with pretrained statistical models and word vectors, convolutional neural network models, easy deep learning integration and so much more. spaCy is the fastest syntactic parser in the world according to independent benchmarks, with...

Downloads: 0 This Week

Last Update: 2024-09-14
See Project
Shoplogix Smart Factory Platform
For manufacturers looking for a powerful Manufacturing Execution solution

Real-time Visibility into Your Shop Floor's Performance. The Shoplogix smart factory platform enables manufacturers to increase overall equipment effectiveness, reduce operational costs, sustain growth and improve profitability by allowing them to visualize, integrate and act on production and machine performance in real-time. Manufacturers that trust us to drive efficiency in their factories. Real-time visual data and analytics provide valuable insights to make better informed decisions. Uncover hidden shop floor potential and drive rapid time to value. Develop a continuously improving culture through training, education and data-driven decisions. Compete in the i4.0 world by making the Shoplogix Smart Factory Platform the cornerstone of your digital transformation. Connect to any equipment or device to automate data collection and exchange it with other manufacturing technologies. Automatically monitor, report and analyze machine states to track real-time production.

Learn More
10

VideoSrt

Windows-GUI

... to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.

Downloads: 21 This Week

Last Update: 2023-01-13
See Project
11

Flashlight library

A C++ standalone library for machine learning

Flashlight is a fast, flexible machine learning library written entirely in C++ by Facebook AI Research and the creators of Torch, TensorFlow, Eigen, and Deep Speech. Native support in C++ and simple extensibility make Flashlight a powerful research framework that's hackable to its core and enables fast iteration on new experimental setups and algorithms with little unopinionated and without sacrificing performance. In a single repository, Flashlight provides apps for research across multiple...

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
12

React Native Voice

React Native Voice Recognition library for iOS and Android

A speech-to-text library for React Native. Manually or automatically link the NativeModule. Drag the Voice.xcodeproj from the @react-native-voice/voice/ios folder to the Libraries group on Xcode in your project. Click on your main project file (the one that represents the .xcodeproj) select Build Phases and drag the static library, lib.Voice.a, from the Libraries/Voice.xcodeproj/Products folder to Link Binary With Libraries. The plugin provides props for extra customization. Every time you...

Downloads: 0 This Week

Last Update: 2023-06-21
See Project
13

Maia

MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.

Downloads: 0 This Week

Last Update: 2024-04-21
See Project
14

Tensorflow Transformers

State of the art faster Transformer with Tensorflow 2.0

... speech recognition and audio classification. Faster AutoReggressive Decoding, TFlite support, creating TFRecords is simple. Auto-Batching tf.data.dataset or tf.ragged tensors. Everything is dictionary (inputs and outputs) Multiple mask modes like causal, user-defined, prefix. tensorflow-text tokenizer support. Supports GPU, TPU, multi-GPU trainer with wandb, multiple callbacks, auto tensorboard.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
15

Vosk Desktop

Desktop software for controlling the Vosk Speech Recognition Toolkit

Downloads: 0 This Week

Last Update: 2023-08-10
See Project
16

Alan AI for Flutter

SDK to build a multimodal conversational UX for Flutter apps

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 0 This Week

Last Update: 2023-04-19
See Project
17

Eva AI

Eva is an A.I. assistant that helps users multi-task.

... RECOGNITION STABILITY AND ACCURACY * ADDED OPTION TO SELECT SPEECH RECOGNITION ENGLISH LANGUAGE [NOTE] Eva is dependent on the Windows 10/11 ecosystem. For any issues consult the user's manual. Commands customisation tutorial: https://github.com/CSharpTeoMan911/Eva/wiki/Commands-customisation Donations: https://www.paypal.com/donate/?hosted_button_id=V5H8D2XRGRPHU You can check details about the technology at: https://github.com/CSharpTeoMan911/Eva

Downloads: 4 This Week

Last Update: 2024-09-04
See Project
18

Mice TTM

mice stt tts

Dieses Tool wird speziell für die Barrierefreiheit unter Linux entwickelt. Es ermöglicht das umwandeln/konvertieren/parsen von Texten die aus einer Spracherkennung stammen, in Diktate sowie das Ausführen von Makros. Dies funktioniert ohne Internet, da die Spracherkennung auf dem PC selbst erfolgt. Mausbewegungen auf benannte Wörter und dann entsprechend auswählen oder per Sprachbefehl klicken. Außerdem können Textpassagen z.B. unter Libreoffice Wirter per Sprachbefehl entsprechend...

Downloads: 0 This Week

Last Update: 2024-08-07
See Project
19

VATSG

Video automatic transcribe and translated subtitle generator

It generates srt format subtitle from videofile which can be any source language that whisper support , and then make translated subtitle file of your target language which deepl support. This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you...

Downloads: 3 This Week

Last Update: 2023-09-19
See Project
20

Alan AI for React Native

Build a multimodal conversational UX for apps created with React

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 0 This Week

Last Update: 2023-04-19
See Project
21

Alan AI for Cordova

Assistant SDK to build a multimodal conversational UX for Apache

...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.

Downloads: 0 This Week

Last Update: 2023-04-19
See Project
22

Rhasspy

Offline private voice assistant for many human languages

Rhasspy (ˈɹæspi) is an open-source, fully offline set of voice assistant services for many human languages that works well with Hermes protocol-compatible services (Snips.AI), Home Assistant and Hass.io, Node-RED, Jeedom, OpenHAB. Rhasspy will produce JSON events that can trigger action in home automation software, such as a Node-RED flow. Rhasspy comes with a snazzy web interface that lets you configure, program, and test your voice assistant remotely from your web browser. All of the web...

Downloads: 1 This Week

Last Update: 2023-04-19
See Project
23

Kashgari

Kashgari is a production-level NLP Transfer learning framework

Kashgari is a simple and powerful NLP Transfer learning framework, build a state-of-art model in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS), and text classification tasks.

Downloads: 0 This Week

Last Update: 2024-08-09
See Project
24

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki...

2 Reviews

Downloads: 4 This Week

Last Update: 2020-03-15
See Project
25

EasyGradeXL

Uses speech recognition to enter grades in an Excel workbook

This application simplifies the tedious task of entering grades in a Excel workbook by using the Google text-to-speech API. This API currently supports 137 languages and a number of dialects. The application keeps a log of the grades, in the order that they are entered and provides a readback function to easily check if the grades were entered correctly. This application was developed using Microsoft Excel Version 2108. It currently only runs under Microsoft Windows.

Downloads: 0 This Week

Last Update: 2021-12-21
See Project