Showing 29 open source projects for "speech"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    ESP8266Audio

    ESP8266Audio

    Arduino library to play MOD, WAV, FLAC, MIDI, RTTTL, MP3

    Arduino library for parsing and decoding MOD, WAV, MP3, FLAC, MIDI, AAC, and RTTL files and playing them on an I2S DAC or even using a software-simulated delta-sigma DAC with dynamic 32x-128x oversampling. ESP8266 is fully supported and most mature, but ESP32 is also mostly there with built-in DAC as well as external ones. For real-time, autonomous speech synthesis, check out ESP8266SAM, a library that uses this one and a port of an ancient format-based synthesis program to allow your ESP8266 to talk with low memory and no network required. All this code is released under the GPL, and all of it is to be used at your own risk. If you find any bugs, please let me know via the GitHub issue tracker or drop me an email. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    Enhanced version of Brill's Parts-of-Speech Tagger with built-in Tokenizer and Lemmatizer.
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 4
    Auditory Modeling Toolbox
    The auditory modeling toolbox (AMT) is a Matlab/Octave toolbox for the development and application of auditory computational models. Over 50 auditory models implemented in Matlab, Octave, C, C++, and Python can be run from Matlab and Octave, on Windows and Linux. The AMT provides a well-structured in-code documentation, includes auditory data required to run the models. It integrates functionality to reproduce the model predictions. Model implementations can be evaluated in two stages,...
    Leader badge
    Downloads: 44 This Week
    Last Update:
    See Project
  • Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software. Icon
    Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.

    Banks, lending institutions

    Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
    Learn More
  • 5
    Source code from the Research Institute for Signals, Systems and Computational Intelligence http://fich.unl.edu.ar/sinc
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 6
    Unitag is a language-independent Unicode-based part-of-speech tagging system. Written entirely in ANSI-compatible C, it should (in theory) compile on any OS, but has been tested on 32-bit Windows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8

    AhoTTS Multilingual, a Multilingual TTS

    Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

    Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    medSLT - The Medical Speech Translator
    Downloads: 0 This Week
    Last Update:
    See Project
  • Smart Business Texting that Generates Pipeline Icon
    Smart Business Texting that Generates Pipeline

    Create and convert pipeline at scale through industry leading SMS campaigns, automation, and conversation management.

    TextUs is the leading text messaging service provider for businesses that want to engage in real-time conversations with customers, leads, employees and candidates. Text messaging is one of the most engaging ways to communicate with customers, candidates, employees and leads. 1:1, two-way messaging encourages response and engagement. Text messages help teams get 10x the response rate over phone and email. Business text messaging has become a more viable form of communication than traditional mediums. The TextUs user experience is intentionally designed to resemble the familiar SMS inbox, allowing users to easily manage contacts, conversations, and campaigns. Work right from your desktop with the TextUs web app or use the Chrome extension alongside your ATS or CRM. Leverage the mobile app for on-the-go sending and responding.
    Learn More
  • 10
    Winboard 4.5 Accessible Chess

    Winboard 4.5 Accessible Chess

    Chess for the Blind for the JAWS or NVDA Screen Readers

    Winboard 4.5 32-bit is a free Windows accessible Chess program that works automatically with the JAWS or the free NVDA screen reader. It is for the blind, low sighted or those who can not use a mouse. It provides vocal announcements of position changes and other selectable board conditions. Blind players also use a separate "tactile chess board". Winboard 4.5 has full keyboard access to move pieces and run menu items. Partial sighted players use high contrast mode and adjust board, piece,...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    a tool for segmenting, labeling and transcribing speech
    Downloads: 40 This Week
    Last Update:
    See Project
  • 12

    AhoTTS Iparrahotsa

    TTS for Basque Lapurdian dialect

    AhoTTS Iparrahotsa is the TTS developed at the Aholab Signal Processing Laboratory of the University of the Basque Country (UPV/EHU) for the Lapurdian dialect of Basque. This dialect is spoken at the Northern area of the Basque speaking area (French region). This project was funded by the Euroregion Aquitaine-Euskadi under grant EUSKADI-2012-004.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Software for speech research. It includes programs and libraries for signal processing, along with general purpose scientific libraries. Most of the code is in Python, with C/C++ supporting code. Also, contains code releases corresponding to publishe
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    SAMI

    SAMI

    Home Automation System for A Typical Apartment

    SAMI is an extensible, voice-controlled home automation system which seamlessly controls all of the pieces of your apartment or house, without installation or monthly fees. For more information about how to use her, see the documentation tab!
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    Dhvani is Text-to-Speech System for Indic Languages. Current C- GNU/Linux implementation supports Hindi, Kannada, Marathi, Malayalam, Gujarati, Bengali, Telugu, Panjabi, Tamil and Oriya.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    The AK toolkit is another kit for building and use Hidden Markov Models (HMMs). Originally developed for handwritten text recognition (HTR) using Bernoulli HMMs, it also implements diagonal Gaussians and can be used for any other purpose.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Advanced Speech Signal Analysis library provides a structure to handle various file formats and a variety of analysis functions commonly used in speech processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ESMERALDA is a development environment for statistical recognizers operating on sequential data (speech, handwriting, biological sequences). It supports continuous density Hidden Markov models, Markov chain (N-gramm) models, and Gaussian mixture models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Java Voice Control System
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The NICO Toolkit is an Artificial Neural Network (ANN) toolkit designed and optimized for speech technology applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    eXtace is a 3D audio visualization tool (or eye candy depending on how you look at it). eXtace requires ESD (Esound) for its sound input source. It performs a FFT (fast fourier transform) on audio and displays it via various graphical modes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Augmenting other natural interfaces namely handwriting, speech recognition to input Indian language characters to the computer. Speech synthesis is also provided to read out local language text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next