Showing 201 open source projects for "speak"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    pyttsx3

    pyttsx3

    Offline Text To Speech synthesis for python

    ...The library exposes a simple but flexible API for controlling voice selection, speaking rate, volume, and other synthesis parameters from Python code. It supports both a high-level speak convenience function and a lower-level engine object with event hooks, queuing, and saving output to audio files. The repository includes examples and documentation that show how to adjust properties dynamically, persist synthesized output, and integrate pyttsx3 into GUIs or background services.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 2
    TimescaleDB

    TimescaleDB

    An open-source time-series SQL database optimized for fast ingest

    TimescaleDB is the open-source relational database for time-series and analytics. Build powerful data-intensive applications. Become instantly productive with full SQL. Rely on the same PostgreSQL you know, love, and trust. Hyperfunctions make time series easier. Achieve 10-100x faster queries than with vanilla PostgreSQL, InfluxDB, MongoDB. Write millions of data points per second per node. Horizontally scale to petabytes. Don’t worry about cardinality. Simplify your stack, ask more complex...
    Downloads: 68 This Week
    Last Update:
    See Project
  • 3
    Textream

    Textream

    Textream is a free macOS teleprompter app for streamers, interviewers

    Textream is an open-source, free macOS teleprompter application designed for streamers, podcasters, presenters, and interviewers who want a smooth, distraction-free way to stay on script. It runs natively on macOS and leverages on-device speech recognition to highlight each word in real time as you speak, keeping your focus where it belongs — on delivery rather than memorization. The interface supports multiple modes of use, such as classic constant-scroll auto-scrolling, voice-activated scrolling that pauses when you’re silent, and direct word tracking that syncs the displayed script to your spoken pace. Textream also includes flexible display modes, from a sleek notch-anchored overlay on laptops to an invisible-to-the-audience floating window or fullscreen experience.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 4
    TURF

    TURF

    A modular geospatial engine written in JavaScript

    ...It includes traditional spatial operations, helper functions for creating GeoJSON data, and data classification and statistics tools. Turf can be added to your website as a client-side plugin, or you can run Turf server-side with Node.js. Modular, simple-to-understand JavaScript functions that speak GeoJSON. Turf is a collection of small modules, you only need to take what you want to use. Takes advantage of the newest algorithms and doesn't require you to send data to a server. TypeScript definitions are packaged with each module. No DefinitelyTyped packages required. Turf uses GeoJSON for all geographic data. Turf expects the data to be standard WGS84 longitude, latitude coordinates. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    OpenMAIC

    OpenMAIC

    Open Multi-Agent Interactive Classroom

    ...The platform generates multiple learning scenes rather than a single static output, including slides, quizzes, interactive simulations, and project-based activities, which makes it feel closer to a guided lesson than a simple content generator. It also supports whiteboard-style visual explanation and text-to-speech delivery, allowing agents to draw, explain formulas, and speak aloud during instruction. OpenMAIC is built for flexible deployment, with support for direct use through its web experience as well as integration with OpenClaw so classrooms can be generated from messaging platforms such as Slack, Telegram, and Feishu.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 6
    OptiKey

    OptiKey

    Open source assistive on-screen keyboard that runs on Windows

    OptiKey is an assistive on-screen keyboard designed to bring keyboard control, mouse control and speech to people with motor and speech limitations. Completely free, open source and compatible with low cost eye-tracking devices, it is a great alternative to often expensive and complicated AAC (alternative and augmentative communication) products. It can also be used as an alternative to a physical keyboard or mouse. OptiKey runs on Windows and works right out of the box once an...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    AndBible

    AndBible

    AndBible: Bible Study

    AndBible: Bible Study" is a powerful, yet easy to use, offline Bible study application for Android. The app does not aim to be simply a Bible reader, but focuses on being an advanced tool to do in-depth personal Bible study. This application is developed by Bible readers, for Bible readers. It aims at helping you make your Bible study convenient, deep and fun. The best part about this non-profit community project is that it is open-source, completely free, and contains no advertisements. A...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    iroh

    iroh

    IPFS reimagined

    Iroh takes IPFS beyond the theoretical, to offer developers efficient infrastructure that scales data distribution past 10M devices. We have been working on and around IPFS nearly since its inception, and are starting out on a new implementation because we have unfinished business with IPFS. We believe the best days of the protocol are ahead, but to get there we need to ship an order-of-magnitude improvement that unlocks new platforms and use cases. We're building iroh to see just how far we...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Open Source Social Network

    Open Source Social Network

    Open Source Social Network (OSSN) is a social networking software

    ...For most Linux servers running Apache v2, making the parent directory writable is very simple. If you are not running Apache v2 or are using a different type of webserver, you'll need to read your system's documentation or speak to your server administrator before proceeding.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Open-LLM-VTuber

    Open-LLM-VTuber

    Open source AI VTuber platform with voice chat and Live2D avatars

    ...It enables hands-free conversations with large language models by combining speech recognition, language processing, and text-to-speech synthesis into a single system. Users can speak directly to the AI character, and the system can respond with a generated voice while animating a Live2D avatar to simulate a talking virtual personality. Open-LLM-VTuber is modular, allowing developers to swap or configure different language models, speech recognition engines, and voice synthesis systems depending on their needs. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 11
    OpenVoice

    OpenVoice

    Instant voice cloning by MIT and MyShell. Audio foundation model

    ...It is designed not only to match the timbre of the reference voice, but also to give granular control over style parameters such as emotion, accent, rhythm, pauses, and intonation. The model supports cross-lingual and even zero-shot cross-lingual voice cloning, so a speaker recorded in one language can be made to speak naturally in others. Architecturally, OpenVoice separates “tone color” cloning from style control, which makes it easier to keep a consistent identity while flexibly changing prosody or language. The project provides open-weight models, inference code, and examples, making it suitable both for research and for building production voice experiences. ...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 12
    ElatoAI

    ElatoAI

    Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP

    ...The system integrates voice synthesis and recognition by connecting an ESP32 device through secure WebSockets to edge server functions written in Deno, allowing users to speak naturally with AI agents hosted through cloud APIs including OpenAI’s Realtime API, Gemini’s Live API, xAI’s Grok Voice Agent API, and others. It includes a web client (built with Next.js) for managing devices, controlling volume, and viewing conversation transcripts, while the hardware runs optimized firmware to deliver responses in near real time — even supporting >15-minute uninterrupted conversations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    GetProfile

    GetProfile

    User profile and long-term memory for your AI agent

    ...The goal is to make “memory” and user understanding an infrastructure concern rather than an app-by-app feature, so teams can add continuity with minimal code changes. Because it behaves like an OpenAI-compatible gateway, it can work with multiple providers and tools that already speak that API shape.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Obot MCP Gateway

    Obot MCP Gateway

    Hosting, Registry, Gateway, and Chat Client

    ...The platform is designed to work with a variety of workflows and clients, so MCP servers managed inside Obot can be used by automation/agent frameworks as well as popular chat clients that speak MCP.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Neorg

    Neorg

    Modernity meets insane extensibility

    ...The premise is that all of these features are built on top of a single base file format (.norg), which the user only has to learn once to gain access to all of Neorg's functionality. Not only does this yield a low barrier for entry for new users it also ensures that all features are integrated with each other and speak the same underlying language. The file format is built to be expressive and easy to parse, which also makes .norg files easily usable anywhere outside of Neorg itself. A good way of thinking about Neorg is as a plaintext environment that can be adapted to a variety of use cases. If a problem can be represented using raw text, it can be solved using Neorg.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Polyglot

    Polyglot

    Cross-platform AI language practice app

    ...Users can define custom AI personas, choose languages, and configure their own OpenAI and Azure keys so they retain control over which backends they use. The app supports speech recognition with quick keyboard shortcuts, allowing learners to hold down a key to speak and release it to submit for recognition and response. It includes translation features, dark mode, playback of the user’s own recorded speech, and word highlighting that tracks the progress of synthesized audio to make following along easier. Polyglot also integrates additional AI providers, supports configurable conversation scenarios, and lets users personalize avatars, making the experience more engaging and flexible.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Smile

    Smile

    Statistical machine intelligence and learning engine

    ...If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM languages. Data scientists and developers can speak the same language now! Smile provides hundreds advanced algorithms with clean interface. Scala API also offers high-level operators that make it easy to build machine learning apps. And you can use it interactively from the shell, embedded in Scala. The most complete machine learning engine. Smile covers every aspect of machine learning.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Mockito

    Mockito

    Most popular Mocking framework for unit tests written in Java

    ...In particular anyone using Kotlin (which demands using mockito-inline) and PowerMock (which exacerbates the problem even more) will want to add this to all of their test classes to avoid a large memory leak. Fancy getting world-wide visibility and building up an eternal fame of an OSS contributor? Use the latest version! Hack and experiment. Speak up at the mailing list. Mockito is a mocking framework that tastes really good. It lets you write beautiful tests with a clean & simple API. Mockito doesn’t give you hangover because the tests are very readable and they produce clean verification errors.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    annyang!

    annyang!

    Speech recognition for your site

    annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Fun Audio Chat

    Fun Audio Chat

    Large Audio Language Model built for natural interactions

    Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. The system supports dynamic audio input and output, meaning it can handle different voices, tones, and conversational contexts without forcing users into typed interactions. With real-time streaming, it minimizes latency and delivers responses quickly, making it suitable for applications where responsiveness matters, such as interactive demos, accessibility tools, and conversational games.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Seamless Communication

    Seamless Communication

    Foundational Models for State-of-the-Art Speech and Text Translation

    ...The motivation is to move beyond “text in, text out” and enable direct, live, multi-turn exchange involving language, gesture, gaze, vision, and modality switching without user friction. The system architecture includes a real-time multimodal signal pipeline for audio, video, and sensor data, a dialog manager that can decide when to act (speak, gesture, point) or query, and a cross-modal reasoning layer that fuses perception with semantic context. The research prototype includes components for visual grounding (understanding when a user references something in view), gesture recognition and synthesis, and turn-taking mechanisms that mirror human conversational timing. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Leaf 3

    Leaf 3

    The easiest way to create clean, simple but powerful web apps and APIs

    ...Builds on top of standard PHP with a flexible learning curve + ZERO config. A rich, incrementally adoptable ecosystem of modules to build powerful apps with. Want a Leaf core team member to speak at your next event? Invite us to create a memorable and engaging experience for your attendees.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Actionhero

    Actionhero

    Actionhero is a realtime multi-transport nodejs API server

    ...Actionhero can work in a cluster to handle all the clients you can throw at it. Actionhero was built to serve the same APIs across multiple protocols. Do your games speak both HTTP and Websockets? Actionhero has got you covered. Actionhero was built from the ground up to include all the features you expect from a modern API framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Bot Framework Web Chat

    Bot Framework Web Chat

    A highly-customizable web-based client for Azure Bot Services

    ...This repo is part of the Microsoft Bot Framework, a comprehensive framework for building enterprise-grade conversational AI experiences. Create a bot with the ability to speak, listen, understand, and learn from your users with Azure Cognitive Services. Benefit from open source SDK and tools to build, test, and connect bots that interact naturally with users, wherever they are. Build secure, global, scalable solutions that integrate with your existing IT ecosystem. Create an AI experience that can extend your brand and keep you in control of your own data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TEN

    TEN

    Open-source framework for conversational voice AI agents

    ...It includes a full ecosystem, TEN Turn Detection, TEN Agent, and TMAN Designer, allowing developers to rapidly assemble human-like, responsive agents that can see, speak, hear, and interact. With support for languages like Python, C++, and Go, it offers flexible deployment on both edge and cloud environments. Using components like graph-based workflow design, drag-and-drop UI (via TMAN Designer), and reusable extensions such as real-time avatars, RAG (Retrieval-Augmented Generation), and image generation, TEN enables highly customizable, scalable agent development with minimal code.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB