Showing 57 open source projects for "audio"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    Spring AI Alibaba Examples

    Spring AI Alibaba Examples

    Spring AI Alibaba examples for building and testing AI apps

    ...It is designed to help developers understand core concepts, explore practical implementations, and follow best practices when building AI-powered systems using the Spring ecosystem. Each module focuses on a specific use case such as chat, image processing, audio handling, graph workflows, and retrieval-augmented generation. The examples highlight how to integrate AI models, manage prompts, handle memory, and build multi-model or multi-agent workflows. Developers can explore individual project folders for detailed instructions and implementation guidance. Spring AI Alibaba Examples also supports experimentation through playground modules and encourages contributions to expand real-world AI use cases and improve development practices.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Deep Chat

    Deep Chat

    Customizable AI chat component for websites with API support

    Deep Chat is a highly customizable web component designed to simplify the integration of AI-powered chat interfaces into websites. It allows developers to embed a fully functional chatbot using minimal setup, while still offering extensive control over behavior, appearance, and integrations. Deep Chat supports connections to a wide range of AI services as well as custom backends, enabling flexible deployment for different use cases. It is built as a framework-agnostic solution, meaning it...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 177 This Week
    Last Update:
    See Project
  • 4
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 5
    Conversations

    Conversations

    App in java for chatting to a generative A.I. (involving tts and stt)

    ... * The user can speak into the microphone (speechToText), edit the recognized text and send it to the AI. * The AI ​​responds and the server returns that response in real time, and the sentences converted to audio (textToSpeech), and the application broadcasts them through the speaker. The application is prepared so that only one user occupies the server's resources, so if the server is busy, in theory it will not let you connect. There is a demo video that shows how it works: https://frojasg1.com:8443/resource_counter/resourceCounter?...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    elevenlabs-api

    elevenlabs-api

    elevenlabs-api is an open source Java wrapper around the ElevenLabs

    ...The most realistic and versatile AI speech software, ever. Eleven brings the most compelling, rich and lifelike voices to creators and publishers seeking the ultimate tools for storytelling. Generate top-quality spoken audio in any voice and style with the most advanced and multipurpose AI speech tool out there. Our deep learning model renders human intonation and inflections with unprecedented fidelity and adjusts delivery based on context.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Intelligent Java

    Intelligent Java

    Integrate with the latest language models, image generation and speech

    ...Access ChatGPT, GPT3 to generate text and DALL·E to generate images. OpenAI is preferred for quality results without tuning. Generate text; Cohere allows you to generate a language model to suit your specific needs. Generate audio from text; Access DeepMind’s speech models. The only dependencies is GSON. Required to add manually when using IntelliJava jar. However, if you imported this repo through Maven, it will handle the dependencies.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you will find binary downloads and discussion (https://sourceforge.net/p/crgrep/discussion/) . The actual development and issue tracking can be found here: https://bitbucket.org/cryanfuse/crgrep
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    AlphaPlayer

    AlphaPlayer

    AlphaPlayer is a video animation engine

    AlphaPlayer is positioned as a multimedia or media-player library or application under ByteDance, likely intended to provide video/audio playback functionality, streaming, or media rendering capabilities. It probably serves as a foundation for building media-heavy applications — offering features like playback control, streaming support, adaptive media handling, and possibly integration with custom codecs or streaming protocols. For developers building web, desktop, or mobile applications needing stable audio/video playback and streaming — perhaps for ByteDance’s own media apps — AlphaPlayer could provide a ready-to-use or extendable library to embed media capabilities. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    jMIR

    jMIR

    Music research software

    jMIR is an open-source software suite implemented in Java for use in music information retrieval (MIR) research. It can be used to study music in the form of audio recordings, symbolic encodings and lyrical transcriptions, and can also mine cultural information from the Internet. It also includes tools for managing and profiling large music collections and for checking audio for production errors. jMIR includes software for extracting features, applying machine learning algorithms, applying heuristic error error checkers, mining metadata and analyzing metadata.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 11
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    H.B.S.N

    H.B.S.N

    Speech Recognition System

    H.B.S.N is a simple speech recognition software which programmed using Java. This software is a package of many sub applications.And those are as listed below , Audio Player Video Player Email Client Weather Application Mp3 Tag Editor Picture Viewer Home Automation Application Alarm / Timer Folder Locker Message Encrypt Application Income & Expenses Logging Application Apart from that we can do many thing from this software by using voice commands , such as , Open & close applications which are installed in the computer Open web sites Open folders which are in the HDD Control built-in audio & video player Control the home automation system Reading mails Reading selected text Speaking clock ( Time & Date) Speaking weather report There are system commands for the tasks which this application does.And we can replace the system default commands with custom commands.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    OCR For Visually Challenged Person

    OCR For Visually Challenged Person

    Provides GUI for Tessaract OCR

    It converts scanned image into text, braille and audio format. The image should be scanned with atleast 300 dpi for better accuracy.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    J-Syncker
    This application assists in generating pre-compositional material based on a computational interpretation of the 'Schillinger System of Musical Composition' (Schillinger 1946).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    jaivox

    jaivox

    Speech recognition application builder and library

    Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Bermuda Text-to-Speech

    This project includes basic NLP and DSP techniques for Text-to-Speech

    See TTS demo at: http://rslp.racai.ro/index.php?page=tts This is an entirely written in JAVA project which includes a set of tools and methods designed to enable Multilingual Text-to-Speech (TTS) synthesis. We currently support English and Romanian but we will soon train more models and make them available for download. If you want to read more about our other NLP and TTS tools check out http://nlptools.racai.ro.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    HMM Speech Recognition in Java

    HMM Speech Recognition in Java

    HMM Speech Recognition in Java
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21

    High-order HMM in Java

    A duration high-order hidden Markov model (DHO-HMM) in Java.

    This project provides an implementation of duration high-order hidden Markov model (DHO-HMM) in Java. It is compactible with JDK 5 & 6. It was used in the author's research on speech recognition of Mandarin digits. There are some Chinese words in this project and I am afraid that I don't have enough time to translate to English recently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DJDarwin

    DJDarwin

    A Genetic algorithm approach to creating beats.

    The program puts beats through evolution - survival, mutations, reproduction. The purpose, in the words of Africa Bambaataa - Looking for the perfect beat. Using a genetic­ algorithmic framework, the user's taste defines a Beat fitness function - she decides which beats survive and breed, and which join the choir invisible. In addition, the user can easily define an automatic fitness function, put the program on 'auto­pilot' mode, and let it speed up evolution. Different types of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Anthromorphic Scribe

    Anthromorphic Scribe

    Provides speech to text gui to sphinx4

    It provides an interactive speech to text application that uses sphinx 4. With this you can use pre-recorded audio, record your own voice and convert incompatible audio/video to be compatible with sphinx 4. It currently supports U.S English by using hub4 acoustic and language model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Audivolv

    Audivolv

    Artificial intelligence evolves musical instruments played with mouse

    Artificial Intelligence (AI) evolves musical instruments you play with the mouse from small blocks of Java code that operate on numbers, each block designed to keep all numbers in range -1 to 1 if they started in that range, and combinations of these evolved codes running 44100 times per second for high quality 44.1 khz audio. You can also write the Java code into its window. Uses Javassist internal compiler on the new Java code assembled by renaming variables to overlap eachother and reordering, copying, and removing blocks of code. Listen to the recorded music in files section of sourceforge or try it for yourself. This branched into my other projects: GigaLineCompile, JSelfModify, JSoundCard, and Human AI Net as I got more into the AI and systems side of things and explored vision research as a better user interface to network minds together through the Internet. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    BayesianCortex

    BayesianCortex

    simple algorithm for a realtime interactive visual cortex for painting

    ...In this early version, I'm still working on edge detection and its understanding of the same shapes at different brightnesses. This will be a module of the bigger Human AI Net project and will be used for adding realtime intuitive high dimensional intelligence in audio and visual interactions with the user.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB