Showing 65 open source projects for "speech"

View related business solutions
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • Lightspeed golf course management software Icon
    Lightspeed golf course management software

    Lightspeed Golf is all-in-one golf course management software to help courses simplify operations, drive revenue and deliver amazing golf experiences.

    From tee sheet management, point of sale and payment processing to marketing, automation, reporting and more—Lightspeed is built for the pro shop, restaurant, back office, beverage cart and beyond.
    Learn More
  • 1
    OpenAI Translator

    OpenAI Translator

    Browser extension and cross-platform desktop app based on ChatGPT API

    ...Our tool allows for mutual translation, polishing and summarization across 55 different languages. Streaming mode is supported! It allows users to customize their translation text. One-click copying, Text-to-Speech (TTS). Available on all platforms (Windows, macOS, and Linux) for both browsers and Desktop.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    ESP8266Audio

    ESP8266Audio

    Arduino library to play MOD, WAV, FLAC, MIDI, RTTTL, MP3

    Arduino library for parsing and decoding MOD, WAV, MP3, FLAC, MIDI, AAC, and RTTL files and playing them on an I2S DAC or even using a software-simulated delta-sigma DAC with dynamic 32x-128x oversampling. ESP8266 is fully supported and most mature, but ESP32 is also mostly there with built-in DAC as well as external ones. For real-time, autonomous speech synthesis, check out ESP8266SAM, a library that uses this one and a port of an ancient format-based synthesis program to allow your ESP8266 to talk with low memory and no network required. All this code is released under the GPL, and all of it is to be used at your own risk. If you find any bugs, please let me know via the GitHub issue tracker or drop me an email. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic transcription. SPPAS is helpful for the analysis of any annotated data: estimate statistical distributions, make requests, manage files, visualize annotations. ...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 4
    Auditory Modeling Toolbox
    The auditory modeling toolbox (AMT) is a Matlab/Octave toolbox for the development and application of auditory computational models. Over 50 auditory models implemented in Matlab, Octave, C, C++, and Python can be run from Matlab and Octave, on Windows and Linux. The AMT provides a well-structured in-code documentation, includes auditory data required to run the models. It integrates functionality to reproduce the model predictions. Model implementations can be evaluated in two stages,...
    Leader badge
    Downloads: 49 This Week
    Last Update:
    See Project
  • All-in-one security tool helps you prevent ransomware and breaches. Icon
    All-in-one security tool helps you prevent ransomware and breaches.

    SIEM + Detection and Response for IT Teams

    Blumira’s detection and response platform enables faster resolution of threats to help you stop ransomware attacks and prevent data breaches. We surface real threats, providing meaningful findings so you know what to prioritize. With our 3-step rapid response, you can automatically block known threats, use our playbooks for easy remediation, or contact our security team for additional guidance. Our responsive security team helps with onboarding, triage and ongoing consultations to continuously help your organization improve your security coverage.
    Learn More
  • 5
    Source code from the Research Institute for Signals, Systems and Computational Intelligence http://fich.unl.edu.ar/sinc
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 6
    Unitag is a language-independent Unicode-based part-of-speech tagging system. Written entirely in ANSI-compatible C, it should (in theory) compile on any OS, but has been tested on 32-bit Windows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    Live Transcribe Speech Engine provides on-device speech recognition components that power real-time transcription for accessibility and everyday voice-first experiences. Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    AhoTTS Multilingual, a Multilingual TTS

    Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

    Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Powerful cloud-based licensing solution designed for fast-growing software businesses. Icon
    Powerful cloud-based licensing solution designed for fast-growing software businesses.

    A single-point of license control for desktop, SaaS, and mobile applications, APIs, VMs and devices.

    10Duke Enterprise is a cloud-based, scalable and flexible software licensing solution enabling software vendors to easily configure, manage and monetize the licenses they provide to their customers in real-time.
    Learn More
  • 10
    guglinatts-en

    guglinatts-en

    Guglina TTS, special edition: in English (guglinatts-en)

    Guglina TTS, special edition: in English (guglinatts-en), is a voice synthesizer originally designed for Brazilian Portuguese. Uses the Google Translate text-to-speech API. Read screens for the visually impaired. Transforms text into audio, allowing blind or low-vision people to access content displayed on the screen. Although the main target audience for text-to-speech conversion systems - such as Guglina TTS EN - is people with visual impairment, this type of program can be used by people with dyslexia and other reading disabilities, people with severe as well as by pre-literate children. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    medSLT - The Medical Speech Translator
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ...It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Al-Mintiq: Arabic eSpeak

    Al-Mintiq: Arabic eSpeak

    Arabic voice files for eSpeak system

    Arabic files and voices for eSpeak Text to speech system, المنطيق : ملفات اللغة العربية لبرنامج توليد الكلام من النص إسبيك
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14

    RDRPOSTagger

    A Rule-based Part-of-Speech and Morphological Tagging Toolkit

    RDRPOSTagger is a robust, easy-to-use and language-independent rule-based toolkit for Part-of-Speech (POS) and morphological tagging. RDRPOSTagger obtains fast performance in both learning and tagging process. RDRPOSTagger also achieves a very competitive accuracy in comparison to the state-of-the-art results. RDRPOSTagger now supports pre-trained POS and morphological tagging models for Bulgarian, Czech, Dutch, English, French, German, Hindi, Italian, Portuguese, Spanish, Swedish, Thai and Vietnamese. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Welsh Natural Language Toolkit
    ...The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words. The modules are written in JAVA and ‘wrapped’ for execution under the General Architecture for Text Engineering (GATE) framework. The project also includes CYMRIE an adapted version for Welsh of the GATE - ANNIE Named Entity Recognition (NER) application for a range of entities such as Persons, Organisations, Locations, and date and time expressions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    a tool for segmenting, labeling and transcribing speech
    Downloads: 35 This Week
    Last Update:
    See Project
  • 17
    srt-translator

    srt-translator

    Subtitle translator from one natural language to other.

    Translating subtitles in format SubRip from one natural language to other. It is based on Google Translate without API and therefore without payment. Translator have automatic and manual spell checkers.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 18

    AhoTTS Iparrahotsa

    TTS for Basque Lapurdian dialect

    AhoTTS Iparrahotsa is the TTS developed at the Aholab Signal Processing Laboratory of the University of the Basque Country (UPV/EHU) for the Lapurdian dialect of Basque. This dialect is spoken at the Northern area of the Basque speaking area (French region). This project was funded by the Euroregion Aquitaine-Euskadi under grant EUSKADI-2012-004.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Welsh Natural Language Toolkit

    Welsh Natural Language Toolkit

    WNLT is a suite of open source natural language modules for the Welsh

    ...The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words. The modules are written in JAVA and ‘wrapped’ for execution under the General Architecture for Text Engineering (GATE) framework. The project also includes CYMRIE an adapted version for Welsh of the GATE - ANNIE Named Entity Recognition (NER) application for a range of entities such as Persons, Organisations, Locations, and date and time expressions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 21
    ICE Nigeria

    ICE Nigeria

    Nigerian component of the International Corpus of English

    ...This is the first release of the complete corpus. The corpus can be downloaded in several parts. The written part can be downloaded as text files, xml files and xml files with parts of speech tagging, both with or without the raw files. For the spoken part the eaf files (ELAN files in xml format) together with the text files can be downloaded separately from the sound files. In addition, we provide the corpus manual as well as metadata (speaker age, gender, ethnic group and profession) and XML specifications.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 22
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23

    eNTranslator

    To aid translation of satsangs of Paramhamsa Nithyananda

    To aid translation of satsangs of Paramhamsa Nithyananda. Can be used for general purpose by others as well. This translator desktop app uses google translator to translate English text. The auto generated translations are then enriched with human alternation using an easy graphical user interface. Time stamp information may be synched and a subtitle file or a simple textual output may be generated. Additionally it is planned to use google voice tools to also add voice over from these...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Accelerated Feature Extraction Tool

    A fast GPU accelerated feature extraction software for speech analysis

    A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Wrapper for vendors to simplify usage of the Java Speech API (JSR 113). Note that the spec is an untested early access and that there may be changes in the API.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next