Showing 1292 open source projects for "java voice chat"

View related business solutions
  • Build AI Apps with Gemini 3 on Vertex AI Icon
    Build AI Apps with Gemini 3 on Vertex AI

    Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

    Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
    Try Vertex AI Free
  • Go from Data Warehouse to Data and AI platform with BigQuery Icon
    Go from Data Warehouse to Data and AI platform with BigQuery

    Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.

    BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
    Try BigQuery Free
  • 1
    Fun Audio Chat

    Fun Audio Chat

    Large Audio Language Model built for natural interactions

    Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Gemini Next Chat

    Gemini Next Chat

    Deploy your private Gemini application for free with one click

    Gemini Next Chat is an open-source web application that allows you to deploy your own private chat interface powered by Google’s Gemini models (e.g., Gemini 1.5, Gemini 2.0, etc.). It is built with Next.js/TypeScript and targets developers and hobbyists who want a self-hosted solution for interacting with advanced multimodal models (text, image, voice).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Telegram Desktop

    Telegram Desktop

    Telegram Desktop messaging app

    Telegram Desktop is the official C++/Qt-based cross-platform client for Telegram, implementing the full Telegram API and MTProto protocol for secure messaging, voice/video calls, file sharing, and chat features. It provides message sync across devices, supports themes, stickers, bots, and is actively maintained.
    Downloads: 165 This Week
    Last Update:
    See Project
  • 4
    Mumble

    Mumble

    Mumble is an open-source, low-latency, high quality voice chat

    Mumble is an open-source, low-latency, high-quality voice chat software. There are two modules in Mumble; the client (mumble) and the server (murmur). The client works on Windows, Linux, FreeBSD, OpenBSD, and macOS, while the server should work on anything Qt can be installed on. Low-latency and high-quality voice-chat program written on top of Qt and Opus. Administrators appreciate Mumble for being able to self-host and have control over data security and privacy. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 5
    DiscordBotClient

    DiscordBotClient

    A patched version of discord, with bot login support

    A patched version of Discord, with bot login support. Discord Bot Client allows you to use your bot, just like any other user account, except for Friends and Groups.
    Downloads: 82 This Week
    Last Update:
    See Project
  • 6
    Rasa

    Rasa

    Open source machine learning framework to automate text conversations

    Rasa is an open source machine learning framework to automate text-and voice-based conversations. With Rasa, you can build contextual assistants on Facebook Messenger, Slack, Google Hangouts, Webex Teams, Microsoft Bot Framework, Rocket.Chat, Mattermost, Telegram, and Twilio or on your own custom conversational channels. Rasa helps you build contextual assistants capable of having layered conversations with lots of back-and-forths. In order for a human to have a meaningful exchange with a...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    OpenAI Realtime Agents

    OpenAI Realtime Agents

    This is a simple demonstration of more advanced, agentic patterns

    This repository demonstrates how to build low-latency, streaming “voice + chat” agents using OpenAI’s Realtime API combined with the OpenAI Agents SDK. The demo shows patterns for connecting a realtime voice stream (audio in/out) with agents that can use tools, maintain state, and orchestrate multi-agent workflows. The SDK offers abstractions such as agent orchestration, event handling, handoffs, state management, and guardrails, tailored to support realtime, conversational systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Signal Desktop

    Signal Desktop

    Private messenger for Windows, Mac, and Linux

    Say "hello" to a different messaging experience. An unexpected focus on privacy, combined with all of the features you expect. State-of-the-art end-to-end encryption (powered by the open source Signal Protocol) keeps your conversations secure. We can't read your messages or listen to your calls, and no one else can either. Privacy isn’t an optional mode, it’s just the way that Signal works. Every message, every call, every time. Share text, voice messages, photos, videos, GIFs and files for...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 9
    Qwen2-Audio

    Qwen2-Audio

    Repo of Qwen2-Audio chat & pretrained large audio language model

    Qwen2-Audio is a large audio-language model by Alibaba Cloud, part of the Qwen series. It is trained to accept various audio signal inputs (including speech, sounds, etc.) and perform both voice chat and audio analysis, producing textual responses. It supports two major modes: Voice Chat (interactive voice only input) and Audio Analysis (audio + text instructions), with both base and instruction-tuned models. It is evaluated on many benchmarks (speech recognition, translation, sound classification, emotion, etc.), and offers pretrained models (e.g. 7B) released via ModelScope and Hugging Face. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Fluxer

    Fluxer

    A free and open source instant messaging and VoIP platform

    Fluxer is a free and open-source instant messaging and VoIP platform designed to provide communities, friend groups, and teams with a self-hostable communication alternative. The project combines real-time chat, voice calling, and community features into a unified system that can be deployed independently rather than relying on centralized proprietary services. It is built with a modern TypeScript stack and focuses on delivering a clean user experience alongside flexible infrastructure options for developers who want control over their communication environment. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Tox

    Tox

    A New Kind of Instant Messaging

    Tox is a peer to peer (serverless) instant messenger that focuses on security and privacy. In today's world where digital surveillance is rampant, Tox offers users a communication software alternative that's free from prying eyes and ears, and is, quite literally free and without advertising. Tox comes with all the great features you'd expect from an instant messenger application, including voice calls, video calls, file sharing and screen sharing. Everything done on Tox is encrypted...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 12
    ChatGPT Java API

    ChatGPT Java API

    Java SDK for seamless integration with ChatGPT APIs

    ChatGPT-Java is a Java SDK created by PlexPt that offers a convenient, out-of-the-box integration with the OpenAI ChatGPT API (including models like GPT-4o, GPT-5) for Java applications. The library supports chat completions, token counting, function calls, streaming responses, proxy configuration, custom API host, and integration with frameworks such as Spring (via SseEmitter).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Element

    Element

    A glossy Matrix collaboration client for the web

    Element, formerly known as Vector and Riot, is a glossy Matrix collaboration client built using the Matrix React SDK. It offers teams, friends and organizations a secure, all in one chat app that is protected from pesky ads and data mining methods. All communications are done through the open global Matrix network, secured with end-to-end encryption. Element gives you all the services you need from a chat app: group chat, video calls, file sharing and more-- all done securely and in total...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 14
    OpenAI-Compatible Edge-TTS API

    OpenAI-Compatible Edge-TTS API

    Free, high-quality text-to-speech API endpoint to replace OpenAI

    ...The project emulates the /v1/audio/speech endpoint used by OpenAI, so any client that can talk to the OpenAI TTS API can be redirected to this service with minimal changes. It exposes parameters for input text, voice selection, audio format, and playback speed, mirroring the OpenAI interface while mapping popular OpenAI voice names to equivalent Edge voices. Because it relies on Edge’s TTS, the audio generation itself is free, and the project essentially acts as a smart proxy that handles formatting and streaming. The server supports Server-Sent Events (SSE) for streaming audio, enabling low-latency playback in chat UIs and other interactive tools. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    Jovo Framework

    Jovo Framework

    The React for Voice and Chat, build apps for Alexa, Google Assistant

    The multimodal experience platform enables professional teams to build and run apps that work across smart speakers, the web, mobile, and more. Fully customizable and open source. The Jovo product ecosystem allows you to build, test, and run powerful experiences for voice, chat, and web platforms. From local development to production, Jovo allows you to build robust experiences, faster. Build across devices and platforms and use all supported modalities thanks to the Jovo output template engine. Our component and plugin architecture makes it possible to make Jovo work for your specific use case, across projects. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Qwen-Audio

    Qwen-Audio

    Chat & pretrained large audio language model proposed by Alibaba Cloud

    Qwen-Audio is a large audio-language model developed by Alibaba Cloud, built to accept various types of audio input (speech, natural sounds, music, singing) along with text input, and output text. There is also an instruction-tuned version called Qwen-Audio-Chat which supports conversational interaction (multi-round), audio + text input, creative tasks and reasoning over audio. It uses multi-task training over many different audio tasks (30+), and achieves strong multi-benchmarks performance without task-specific fine‐tuning. It includes features such as flexible multi-run chat, audio understanding/reasoning, music appreciation, and also tool usage (e.g. voice editing).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    DiscordGo

    DiscordGo

    (Golang) Go bindings for Discord

    DiscordGo is a Go package that provides low level bindings to the Discord chat client API. DiscordGo has nearly complete support for all of the Discord API endpoints, websocket interface, and voice interface. If you would like to help the DiscordGo package please use this link to add the official DiscordGo test bot dgo to your server. This provides indispensable help to this project. Construct a new Discord client which can be used to access the variety of Discord API functions and to set callback functions for Discord events. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    chatgpt-on-wechat

    chatgpt-on-wechat

    A chatbot built based on a large model

    chatgpt-on-wechat turns your WeChat client (including personal accounts) into an intelligent chatbot powered by large language models like ChatGPT, enabling automated replies, context-aware conversations, and media handling directly inside chats. It receives text and voice messages from private and group chats, forwards them to an AI model using official APIs, and returns replies that feel natural and contextually relevant, creating more engaging interactions without manual typing. Beyond...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    FastRTC

    FastRTC

    The python library for real-time communication

    ...It abstracts away much of the complexity that typically comes with implementing WebRTC by providing a simple interface — e.g. a Stream class — that can be mounted within a web backend (for example a FastAPI application). This makes it particularly well suited for building real-time voice (or video) interfaces for applications such as AI assistants, live chat, or collaborative audio/video tools. FastRTC also integrates nicely with UI frameworks (e.g. via a web demo using Gradio), so developers can rapidly prototype and deploy real-time streaming applications without deep knowledge of low-level WebRTC internals. Because voice-enabled AI agents often involve many moving parts (speech-to-text, text processing, text-to-speech, streaming, session/chat management), FastRTC helps by handling the streaming aspect, leaving the rest to be plugged in modularly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Bailing

    Bailing

    Bailing is a voice dialogue robot similar to GPT-4o

    Bailing is an open-source voice-dialogue assistant designed to deliver natural voice-based conversations by combining automatic speech recognition (ASR), voice activity detection (VAD), a large language model (LLM), and text-to-speech (TTS) in a single pipeline. Its goal is to offer a “voice-first” chat experience similar to what one might expect from a system like GPT-4o, but fully open and deployable by users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Happy Coder

    Happy Coder

    Mobile and Web client for Codex and Claude Code, with realtime voice

    Happy is an open-source, cross-platform mobile and web client designed to bring powerful AI coding agents such as Claude Code and Codex to your fingertips no matter where you are. At its core, Happy wraps existing AI coding tools with a unified interface, providing real-time voice interactions, encrypted communication, and seamless device switching between desktop and mobile. You can start a coding session locally through the Happy CLI or connect from a phone or browser, allowing developers...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 23
    Xash3D FWGS Engine

    Xash3D FWGS Engine

    Xash3D FWGS engine

    ...The engine supports multiple rendering backends (OpenGL, GLES, software fallback), modern platforms (desktop OSes, mobile, etc.), and updated features such as better multiplayer, voice chat, IPv6 support, improved virtual filesystem, and support for modern asset formats. For modders or fans of classic games, Xash3D FWGS offers a way to play or build games using the familiar GoldSrc workflow but with updated portability and features. Legacy mods and maps for supported games can often be used unchanged (or with minimal modifications), preserving the classic experience. ...
    Downloads: 121 This Week
    Last Update:
    See Project
  • 24
    CosyVoice

    CosyVoice

    Multi-lingual large voice generation model, providing inference

    CosyVoice is a multilingual large voice generation model that offers a full-stack solution for training, inference, and deployment of high-quality TTS systems. The model supports multiple languages, including Chinese, English, Japanese, Korean, and a range of Chinese dialects such as Cantonese, Sichuanese, Shanghainese, Tianjinese, and Wuhanese. It is designed for zero-shot voice cloning and cross-lingual or mix-lingual scenarios, so a single reference voice can be used to synthesize speech...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SafeClaw

    SafeClaw

    Chat with it via text and voice

    SafeClaw is an open-source, entirely local alternative to cloud-based AI assistants like OpenClaw, enabling users to build a personal assistant that runs on their own machine without incurring API usage charges or exposing data to third-party services. It emphasizes privacy and predictability by using traditional programming, rule-based intent parsing, and established machine learning tools rather than large language models, meaning there are no per-token API costs and deterministic...
    Downloads: 13 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.
Try Free →