Search Results for "simple voice recognition"

Showing 234 open source projects for "simple voice recognition"

View related business solutions
  • Employee monitoring software with screenshots Icon
    Employee monitoring software with screenshots

    Clear visibility and insights into how employees work. Even remotely.

    Stay productive working at any distance from anywhere with Monitask.
  • Achieve perfect load balancing with a flexible Open Source Load Balancer Icon
    Achieve perfect load balancing with a flexible Open Source Load Balancer

    Take advantage of Open Source Load Balancer to elevate your business security and IT infrastructure with a custom ADC Solution.

    Boost application security and continuity with SKUDONET ADC, our Open Source Load Balancer, that maximizes IT infrastructure flexibility. Additionally, save up to $470 K per incident with AI and SKUDONET solutions, further enhancing your organization’s risk management and cost-efficiency strategies.
  • 1
    TTS Voice Wizard

    TTS Voice Wizard

    Speech to Text to Speech, sends text as OSC messages

    Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) Use TTS Voice Wizard's accessibility features to improve your VRChat experience (it works outside of VRChat too!) You can convert your Speech-to-Text and back to Speech through various Speech Recognition and Text-to-Speech methods. You can send what you say as OSC messages to VRChat to be displayed on your avatar using KillFrenzyAvatarText or VRChats...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 2
    Vosk Speech Recognition Toolkit

    Vosk Speech Recognition Toolkit

    Offline speech recognition API for Android, iOS, Raspberry Pi

    Vosk is an offline open source speech recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech, Polish. More to come. Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 3
    Whisper

    Whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 4
    Lyrebird

    Lyrebird

    Simple and powerful voice changer for Linux, written with Python & GTK

    Simple and powerful voice changer for Linux, written with Python & GTK.
    Downloads: 14 This Week
    Last Update:
    See Project
  • AI-based, Comprehensive Service Management for Businesses and IT Providers Icon
    AI-based, Comprehensive Service Management for Businesses and IT Providers

    Modular solutions for change management, asset management and more

    ChangeGear provides IT staff with the functions required to manage everything from ticketing to incident, change and asset management and more. ChangeGear includes a virtual agent, self-service portals and AI-based features to support analyst and end user productivity.
  • 5
    Signal iOS

    Signal iOS

    A private messenger for iOS

    Signal is a free, open source, messaging app for simple private communication with friends. Say "hello" to a different way of chatting: Signal is all about privacy, but with all the features you expect from a chat app. State-of-the-art end-to-end encryption (backed by Signal's open source protocol) keeps your chats safe. Neither we can read your messages or listen to your calls, nor anyone else. Privacy is not an optional mode, it is how Signal works. In all your messages, all your calls...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 6
    Tesseract.js

    Tesseract.js

    A pure Javascript Multilingual OCR

    Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images. The main Tesseract.js functions (ex. recognize, detect) take an image...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 7
    sherpa-onnx

    sherpa-onnx

    Speech-to-text, text-to-speech, and speaker recognition

    Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    Tox

    Tox

    A New Kind of Instant Messaging

    Tox is a peer to peer (serverless) instant messenger that focuses on security and privacy. In today's world where digital surveillance is rampant, Tox offers users a communication software alternative that's free from prying eyes and ears, and is, quite literally free and without advertising. Tox comes with all the great features you'd expect from an instant messenger application, including voice calls, video calls, file sharing and screen sharing. Everything done on Tox is encrypted using...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 9
    React Native Voice

    React Native Voice

    React Native Voice Recognition library for iOS and Android

    A speech-to-text library for React Native. Manually or automatically link the NativeModule. Drag the Voice.xcodeproj from the @react-native-voice/voice/ios folder to the Libraries group on Xcode in your project. Click on your main project file (the one that represents the .xcodeproj) select Build Phases and drag the static library, lib.Voice.a, from the Libraries/Voice.xcodeproj/Products folder to Link Binary With Libraries. The plugin provides props for extra customization. Every time you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • The Voice API that just works | Twilio Icon
    The Voice API that just works | Twilio

    Build a scalable voice experience with the API that's connecting millions around the world.

    With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Create an engaging voice experience that you can quickly scale and modify with a wide array of customization options and resources.
  • 10
    Alan AI for Android

    Alan AI for Android

    Assistant SDK to build a multimodal conversational UX for Android

    ...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    OpenaiBot

    OpenaiBot

    Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant

    If you don't have the instant messaging platform you need or you want to develop a new application, you are welcome to contribute to this repository. You can develop a new Controller by using Event.py. Compatibility with multiple LLMs and integration with GPT and third-party systems is handled by our llm-kira project on GitHub. It can accurately limit billing, with limits and ID binding. Supports asynchronous operations and can handle multiple requests simultaneously. Allows for private and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    NSFWJS

    NSFWJS

    Client-side indecent content checking powered by TensorFlow.js

    NSFWJS is a simple JavaScript library that can quickly and quite accurately identify NSFW images, all in the client's browser. It is powered by TensorFlow.js and the NSFW detection model, and delivers around 90% accuracy that is improving each time. NSFWJS classifies images with percentages under five categories, namely: drawing and neutral, which are both safe for work; sexy, which includes sexually explicit images; and hentai and porn, which are pornographic drawings and images. NSFWJS...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Alan AI for iOS

    Alan AI for iOS

    In-App assistant SDK to build a multimodal conversational UX for iOS

    ...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Alan AI

    Alan AI

    In-App assistant SDK to build a multimodal conversational UX websites

    ...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. To voice enable your app, you only need to get the Alan Client SDK and drop it to your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ... of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Leku

    Leku

    Map location picker component for Android

    Map location picker component for Android. Based on Google Maps. An alternative to Google Place Picker. Component library for Android that uses Google Maps and returns a latitude, longitude and an address based on the location picked with the Activity provided. Note that you have the voice_search_extra_language that is used for the language of the voice recognition. Replace it with the allowed voice recognition locale for your language. We encourage you to add these languages to this component...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Saber

    Saber

    The cross-platform open-source app built for handwriting

    Saber is the notes app built for handwriting. It's designed to be as simple and intuitive as possible, while still delivering unique features that you'll actually use. Additionally, Saber is available across all your devices, large and small, and syncs between them seamlessly. Only you can access your notes. You can sync your notes across devices knowing that they are encrypted and stored securely, and not even the server can read them. Notably, it can invert your notes when you're in dark mode...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    D++

    D++

    C++ Discord API Bot Library - D++ is Lightweight and scalable

    D++ is a lightweight and simple library for Discord written in modern C++. It is designed to cover as much of the API specification as possible and to have an incredibly small memory footprint, even when caching large amounts of data. It is created by the developer of TriviaBot and contributed to by a dedicated team of developers.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    WPPConnect

    WPPConnect

    WPPConnect is an open source project

    WPPConnect is an open-source project developed by the JavaScript community with the aim of exporting functions from WhatsApp Web to the node, which can be used to support the creation of any interaction, such as customer service, media sending, intelligence recognition based on phrases artificial and many other things, use your imagination. We are the best WhatsApp automation solution you have been looking for. We are a team that started an OpenSource project that performs automation...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    NVIDIA NeMo

    NVIDIA NeMo

    Toolkit for conversational AI

    NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    The SpeechBrain Toolkit

    The SpeechBrain Toolkit

    A PyTorch-based Speech Toolkit

    SpeechBrain is an open-source and all-in-one conversational AI toolkit. It is designed to be simple, extremely flexible, and user-friendly. Competitive or state-of-the-art performance is obtained in various domains. SpeechBrain supports state-of-the-art methods for end-to-end speech recognition, including models based on CTC, CTC+attention, transducers, transformers, and neural language models relying on recurrent neural networks and transformers. Speaker recognition is already deployed...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    spaCy

    spaCy

    Industrial-strength Natural Language Processing (NLP)

    ..., with an accuracy within 1% of the best available. It's blazing fast, easy to install and comes with a simple and productive API.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    flair

    flair

    A very simple framework for state-of-the-art NLP

    A very simple framework for state-of-the-art NLP. Developed by Humboldt University of Berlin and friends. A powerful NLP library. Flair allows you to apply our state-of-the-art natural language processing (NLP) models to your text, such as named entity recognition (NER), sentiment analysis, part-of-speech tagging (PoS), special support for biomedical texts, sense disambiguation and classification, with support for a rapidly growing number of languages. A text embedding library. Flair has simple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Tock

    Tock

    Tock, the open source conversational AI toolkit

    Complete and autonomous NLU solution leveraging opensource libs, such as OpenNLP, Stanford, Duckling and more. Web, mobile, social networks, smart speakers and more. Create your bot once, connect it progressively to multiple channels as you need them. Simple graphical interfaces to build stories and models, manage multilingual and multichannel bots, better understand users with analytics. Program complex stories using Kotlin, Python or Node.js provided components, or integrate with any language...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    spaCy models

    spaCy models

    Models for the spaCy Natural Language Processing (NLP) library

    spaCy is designed to help you do real work, to build real products, or gather real insights. The library respects your time, and tries to avoid wasting it. It's easy to install, and its API is simple and productive. spaCy excels at large-scale information extraction tasks. It's written from the ground up in carefully memory-managed Cython. If your application needs to process entire web dumps, spaCy is the library you want to be using. Since its release in 2015, spaCy has become an industry...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next