Showing 24 open source projects for "speech"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    sherpa-onnx

    sherpa-onnx

    Speech-to-text, text-to-speech, and speaker recognition

    Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.
    Downloads: 145 This Week
    Last Update:
    See Project
  • 2
    whisper.cpp

    whisper.cpp

    Port of OpenAI's Whisper model in C/C++

    whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. ...
    Downloads: 642 This Week
    Last Update:
    See Project
  • 3
    AnySoftKeyboard

    AnySoftKeyboard

    Android (f/w 2.1+) on screen keyboard for multiple languages

    The only Android keyboard you'll ever need. Free as in speech and Free as in beer. Android (f/w 4.0.3+, API level 15+) on screen keyboard for multiple languages.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    Mailu

    Mailu

    Insular email distribution - mail server as Docker images

    Mailu is a simple yet full-featured mail server as a set of Docker images. It is free software (both as in free beer and as in free speech), open to suggestions and external contributions. The project aims at providing people with an easily setup, easily maintained and full-featured mail server while not shipping proprietary software nor unrelated features often found in popular groupware. Security, enforced TLS, DANE, MTA-STS, Letsencrypt!, outgoing DKIM, anti-virus scanner, Snuffleupagus, block malicious attachments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Efficient Cloud Printing | CirrusPrint Icon
    Efficient Cloud Printing | CirrusPrint

    Companies searching for an efficient cloud and network printing solution that helps print from their cloud or on-premise ERP

    CirrusPrint is designed to manage and streamline printing and document delivery across networks. It solves cloud migration problems related to printing, and provides the most direct and immediate method to deliver documents to your users. Traditional network printing works without changing operations, plus there are new capabilities: you can print to your users, or email your printers, or send a file from your phone to a printer across the country.
    Learn More
  • 5
    PearlOS

    PearlOS

    Latest Debian and Ubuntu based REPO files and ISO's for PearlLinuxOS

    PearlOS is Pearl Linux just another branding we used early on with the distro. We are hoping to better organize our upcoming releases. All future releases and the most resent scootski (12) and Preslee (13) you will find under pearlos. Our previous release of our first rolling release also called preslee is no longer on bookworm and trixie now its stickly trixie which is our latest debian release. Sccotski is our latest Ubuntu base and is from 24.04 ISO's are located under...
    Leader badge
    Downloads: 382 This Week
    Last Update:
    See Project
  • 6
    Pearl Linux MATE 12

    Pearl Linux MATE 12

    The perfect desktop for all with Compiz as default Window Manager

    Pearl MATE Desktop 12 is a easy to customize 64 bit OS based on a mix of both Ubuntu and the Linux Mint latest LTR (24.04). We have included both the fully loaded version and also a minimal ISO. Natural sounding Text to Speech with Pied Piper
    Downloads: 32 This Week
    Last Update:
    See Project
  • 7
    Pearl MATE Studio 12

    Pearl MATE Studio 12

    OSX Styled Powerful Audio Workstation

    Pearl MATE Studio 12 is running on the Ubuntu 24.04 base with no snap support. This release does however support flatpak and the text to speech on Pearl is managed with Pied for downloading and selecting natural voice models which through keyboard shortcut, <Alt>+s, which will playback highlighted. This release of Pearl MATE Studio is alot lighter on the pre installed software so the user may choose what they want by installing through our software manager, Gdebi, or Synaptic which are pre-installed. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Pearl Desktop (PDE) 12

    Pearl Desktop (PDE) 12

    The Stable Solid Multimedia Workhorse Powerful OS with Eye Candy

    ...Compiz is the default Window Manager and you may switch window managers without rebooting. Tons of New Software. Pre-Configured SAMBA Shares for Network Shares, Pied Piper handles great sounding natural Text to Speech Voice Models.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier.
    Downloads: 13 This Week
    Last Update:
    See Project
  • Pioneering Intelligent GxP Manufacturing Icon
    Pioneering Intelligent GxP Manufacturing

    Life sciences manufacturers that need a GxP compliant solution to aggregate manufacturing data, contextualize it, and optimize their processes.

    Pharma manufacturers enhance yield, reduce deviations and ensure product quality in GMP environments with our proven and practical AI-powered solutions. Transform your operations regardless of your digital maturity and journey with:
    Learn More
  • 10
    Free Queue Manager

    Free Queue Manager

    Web based python-flask Queue management system

    A web based management system developed for the purpose of easing the process of orgnizing queues and lines. Like many other (QMS)s Queue Management Systems, FQM does provide a basic dashboard to allow the users of the system and customers alike to interact with the system via a basic yet simple user interface . Brief user guide can be found on https://fqms.github.io/images/user_guide.pdf
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Ampare Speech To Amaar and Martin
    (Right Click(on Form) To Show Up The Menu) This Program will Make Your Computer Can Talk!! It Contains Text To Speech Hotkeys Sound Calculator Elpased Times To Know what time they are tired and more They are many things to try they can speak themselv
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    mp3 library, advanced ID3V1 and ID3V2 tagger, player. Organize a large mp3 library, over 40,000 songs. Speech synthesis and tag backup utilities. Scripts to maintain and organize song files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Grasshopper Web App

    Grasshopper Web App

    Web App to control Bticino MyHome using OpenWebNet

    Grasshopper is an open source and free (speech & beer) responsive-design web application to control Bticino MyHome. VALUES: Use any device Since Grasshopper is a browser-based application, you can use a browser on any device to access Grasshopper. Thanks to its responsive-design support, the Grasshopper interface will adapt to the screen-size of your device. Choice of server-platform Grasshopper is a web application that can run from different web servers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    ...Although it was originally developed for German, it is, mostly, language independent. It allows the user to lemmatize words to be indexed, to weight termy ba their parts of speech (e.g. weighting nouns mor hevaily than pronouns), and to add synonyms taken from GermaNet or a list you provide to the search index and thereby increase recall of lucene.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Farsi for eSpeak

    Farsi for eSpeak

    this project makes to able that eSpeak converts Farsi texts to speach.

    eSpeak-Farsi is part of a larger open source project named eSpeak. eSpeak is a cross-platform, text-to-speech software, supporting over ninety languages at the time of this writing. It is utilised on Windows, Linux and Mac, and has been ported to Android as well. This project aims to use crowd-sourcing to improve the pronunciation of Farsi words in eSpeak. It has been created by Shadyar Khodayari. so that Farsi speakers who use eSpeak can help to develop and enhance its pronunciations (through fa_list and fa_rules). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Dynamic tree of Java objects encapsulates hard-drive and Jar/Zip files (and their inner files) and Java objects all the same way. Create new ways of communication as executable Jar files, like a paint program that creates/uses paint programs as tools
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Collaborative development and distribution of Windows Speech Recognition (WSR) application macros to 1) improve the accessibility of personal computing for impaired users, and 2) improve the efficiency of personal computing for all users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Open Path is an open source, browser-based, online RPG. It aims to be the best game engine available, boasting many inbuilt features that most frameworks don't have. It is fast, powerful, robust, and free (speech and beer).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    Audacity-Extra

    dark themed version of free Audacity sound editor

    audacity-extra now provides a sleek dark themed version of the Audacity open source sound editor. The project experiments with Audacity variations. There's a vowel-sound target-practice display for language learners and an analog waveform data logger for embedded systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Project c2h - cetacean to human - building Seadragon, a tool for the scientific research of the acoustic communication of cetaceans, supporting the creation, emission, and recognition of underwater whistles. The blog: http://leafyseadragon.blogspot.com/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TLDP Audio Conversion This is an attempt to convert the Entire TLDP Project into audio format for expanded use and distribution
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Web Textual eXtraction Tools C++ Parallel web crawler, noun phrase idenification, Multi-lingual Part of Speech Tagging, Tarjan's Algorithm, Co-RelationShip Mappings...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    <h3>SpeaR</h3><b>SpeakingRoomplan</b> <p> SpeaR, der sprechende Raumplan, liest Ereignisse aus eine DB aus und gibt entsprechende Meldungen in verschiedenen Räumen aus.</P> <br>DB <--JDBC--> Server <--JINI--> SpeakingClients
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next