Showing 189 open source projects for "speech"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 1
    Transcoder

    Transcoder

    Hardware-accelerated video transcoding using Android MediaCodec APIs

    Transcoder by DeepMedia is an AI-powered video-to-video speech translation engine that enables fully automated multilingual dubbing. Unlike traditional speech translation systems that rely on multi-stage pipelines, Transcoder directly translates one speaker’s video into another language while preserving facial expressions, lip-sync, and vocal identity. Designed for real-time use and production-grade pipelines, Transcoder combines advanced deep learning models with GPU acceleration to deliver high-quality translations across languages. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Stanford CoreNLP

    Stanford CoreNLP

    Stanford CoreNLP, a Java suite of core NLP tools

    ...Pipelines produce CoreDocuments, data objects that contain all of the annotation information, accessible with a simple API, and serializable to a Google Protocol Buffer. CoreNLP generates a variety of linguistic annotations, including parts of speech, named entities, dependency parses, and coreference.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3

    speech intonator

    The purpose of the project is to develop audio processing algorithms

    The initial version of the main branch of the project has been completed. The main name of the project is "Java audio mixer Summaha". The second name of the project is "Sound Arithmometer". Main purpose - production of musical sound remixes from a set of samples. The name "Summaha" rhymes well with 'Yamaha' and creates motivation and inspiration to achieve a sound quality comparable to with a well-known brand. Detailed documentation in 'read' signature files. Anyone who is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Apache OpenNLP

    Apache OpenNLP

    Apache OpenNLP

    Apache OpenNLP is a machine learning-based NLP library that provides tools for text-processing tasks such as tokenization, sentence segmentation, and named entity recognition.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 5
    AnySoftKeyboard

    AnySoftKeyboard

    Android (f/w 2.1+) on screen keyboard for multiple languages

    The only Android keyboard you'll ever need. Free as in speech and Free as in beer. Android (f/w 4.0.3+, API level 15+) on screen keyboard for multiple languages.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    elevenlabs-api

    elevenlabs-api

    elevenlabs-api is an open source Java wrapper around the ElevenLabs

    ...For any public repository security, you should store your API key in an environment variable, or external from your source code. The most realistic and versatile AI speech software, ever. Eleven brings the most compelling, rich and lifelike voices to creators and publishers seeking the ultimate tools for storytelling. Generate top-quality spoken audio in any voice and style with the most advanced and multipurpose AI speech tool out there. Our deep learning model renders human intonation and inflections with unprecedented fidelity and adjusts delivery based on context.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Smile

    Smile

    Statistical machine intelligence and learning engine

    Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 282 This Week
    Last Update:
    See Project
  • 9
    Google2SRT

    Google2SRT

    Download, save and convert multiple subtitles from YouTube videos

    Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.
    Downloads: 71 This Week
    Last Update:
    See Project
  • Rent Manager Software Icon
    Rent Manager Software

    Landlords, multi-family homes, manufactured home communities, single family homes, associations, commercial properties and mixed portfolios.

    Rent Manager is award-winning property management software built for residential, commercial, and short-term-stay portfolios of any size. The program’s fully customizable features include a double-entry accounting system, maintenance management/scheduling, marketing integration, mobile applications, more than 450 insightful reports, and an API that integrates with the best PropTech providers on the market.
    Learn More
  • 10
    A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Conversations

    Conversations

    App in java for chatting to a generative A.I. (involving tts and stt)

    Java application for chatting to generative AI Llama3. * The user can speak into the microphone (speechToText), edit the recognized text and send it to the AI. * The AI ​​responds and the server returns that response in real time, and the sentences converted to audio (textToSpeech), and the application broadcasts them through the speaker. The application is prepared so that only one user occupies the server's resources, so if the server is busy, in theory it will not let you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Social World

    a software project that tries to visualize people's social behaviour

    Social World is a software project that tries to visualize people's social behaviour. Therefore the actions, the speech and the physical and mental constitution are part of a simulation process that calculates, how attributes of creatures and characters change and how they act and react.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Intelligent Java

    Intelligent Java

    Integrate with the latest language models, image generation and speech

    ...Generate audio from text; Access DeepMind’s speech models. The only dependencies is GSON. Required to add manually when using IntelliJava jar. However, if you imported this repo through Maven, it will handle the dependencies.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14

    navmol-ch

    A fork of the navmol (https://sourceforge.net/projects/navmol/)

    NavMol with practical improvements, the addition of menus, the support of Mandarin, the addition of the text-to-speech, the implementation of the interrupt function of speech, and the full internationalization of text, easier and more convenient to be used.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    VnCoreNLP

    VnCoreNLP

    A Vietnamese natural language processing toolkit

    VnCoreNLP is a Java-based natural language processing toolkit tailored for Vietnamese. It offers a fast and accurate pipeline for essential NLP tasks, facilitating research and application development in Vietnamese language processing. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Jason is a fully-fledged interpreter for an extended version of AgentSpeak, a BDI agent-oriented logic programming language, and is implemented in Java. Using JADE a multi-agent system can be distributed over a network effortlessly. This project was moved to https://jason-lang.github.io
    Downloads: 17 This Week
    Last Update:
    See Project
  • 17
    MaryTTS

    MaryTTS

    An open-source, multilingual text-to-speech synthesis system

    MaryTTS is an open-source, multilingual Text-to-Speech Synthesis platform written in Java. It was originally developed as a collaborative project of DFKI’s Language Technology Lab and the Institute of Phonetics at Saarland University. It is now maintained by the Multimodal Speech Processing Group in the Cluster of Excellence MMCI and DFKI. As of version 5.2, MaryTTS supports German, British and American English, French, Italian, Luxembourgish, Russian, Swedish, Telugu, and Turkish; more languages are in preparation. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    Live Transcribe Speech Engine provides on-device speech recognition components that power real-time transcription for accessibility and everyday voice-first experiences. Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    AzVoice is an Augmentative Communications System designed for use on common off the shelf hardware. It is intended to replace more expensive systems such as DynaVox(R) and the like. The AzVoice talker gives speech to the speech impaired.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    OpenOffice.org Export As DAISY
    odt2daisy is an OpenOffice.org Writer extension, enabling to export in DAISY XML, Full DAISY (xml+audio) and Audiobook format. DAISY is an NISO Z39.86 standard for blind, visual impaired, print-disabled, and learning-disabled people.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Korean Analyzer Rhino

    Korean Analyzer Rhino

    Parsing Korean words by morpheme and part-of-speech

    RHINO parses Korean words by morpheme and part-of-speech. Its dictionaries are based on Korean Modern Tagged Corpus(12 million phrases scale) which was made by Korean government. So it analyses many cases of stems and endings. And the newly developed Dynamic Dictionary Technology can make words to react with their context. That is, a programmed database. For more information see the files in the help folder.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    ASR for Medical Reporting

    Automatic speech recognition system for medical reporting in spanish.

    This is a functional prototype of automatic speech recognition system for medical reporting in Spanish using CMU Sphinx4 ASR toolkit. This ASR use pre-trained acoustic model and context dependent language model in nuclear medicine diagnostics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Cabos

    Cabos

    Cross-platform Gnutella file-sharing servent in Xojo with Java backend

    ...Like the original Cabos, this continuation is licenced under the GNU General Public Licence. Note: Cabos 0.8.2 and earlier for MacOS is 32-bit only. Cabos is free as in speech and free as in beer. We have a Mercurial repository, so you can work on the source any time! Happy sharing
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    XR3Player

    XR3Player

    Dominant JavaFX Advanced Media Player

    This project is on Github now : https://github.com/goxr3plus/XR3Player
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    medSLT - The Medical Speech Translator
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next