Showing 31 open source projects for "text voice"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • 1
    sherpa-onnx

    sherpa-onnx

    Speech-to-text, text-to-speech, and speaker recognition

    Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.
    Downloads: 118 This Week
    Last Update:
    See Project
  • 2
    RunAnywhere

    RunAnywhere

    Production ready toolkit to run AI locally

    ...It also includes integrated pipelines that combine speech-to-text, large language models, and text-to-speech into a complete conversational system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    RHVoice

    RHVoice

    Free open source speech synthesizer for Russian and other languages

    RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both...
    Downloads: 105 This Week
    Last Update:
    See Project
  • 4
    FAY

    FAY

    Framework for building AI-powered interactive digital humans and agent

    ...Fay supports various types of digital humans, including 2.5D and 3D avatars, and can be integrated with applications running on mobile devices, PCs, web platforms, and embedded systems. Its architecture allows developers to combine different AI components such as speech recognition, text-to-speech, and large language models to create conversational digital agents. Fay provides multiple interfaces for text, voice, and digital human control, enabling developers to build interactive assistants, virtual presenters, or automated service agents. It also supports custom knowledge bases and configurable behaviors so developers can tailor the personality and responses of the digital human.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    PaddleSpeech

    PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model

    PaddleSpeech is an open-source toolkit on PaddlePaddle platform for a variety of critical tasks in speech and audio, with state-of-art and influential models. Via the easy-to-use, efficient, flexible and scalable implementation, our vision is to empower both industrial application and academic research, including training, inference & testing modules, and deployment process. Low barriers to install, CLI, Server, and Streaming Server is available to quick-start your journey. We provide...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    nheko

    nheko

    Desktop client for Matrix using Qt and C++20

    Desktop client for Matrix using Qt and C++20. The motivation behind the project is to provide a native desktop app for Matrix that feels more like a mainstream chat app (Element, Telegram etc) and less like an IRC client.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Operit AI

    Operit AI

    Powerful Android AI agent with tools, automation, and Linux shell

    Operit is a full-featured AI assistant and agent platform designed specifically for Android devices, aiming to go far beyond traditional chat-based interfaces. It integrates deep system-level capabilities with a wide range of tools, allowing the AI to perform real tasks such as file management, automation, and system control directly on the device. A standout aspect of the project is its built-in Ubuntu 24 environment, which enables users to run Linux commands, scripts, and development tools...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 8
    YuE

    YuE

    Open source AI model for generating full songs from lyrics prompts

    YuE is an open source project that provides a foundation model designed for full-song music generation using artificial intelligence. It focuses on transforming text inputs such as lyrics and genre prompts into complete musical compositions that include both vocal and instrumental tracks. Unlike many shorter audio generators, the model is capable of producing songs that last several minutes while maintaining coherent musical structure and alignment with the provided lyrics. YuE introduces a...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9
    KoboldCpp

    KoboldCpp

    Run GGUF models easily with a UI or API. One File. Zero Install.

    KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.
    Leader badge
    Downloads: 319 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 120 This Week
    Last Update:
    See Project
  • 11
    ViaVoip

    ViaVoip

    A portable peer to peer voice-chat/walkie-talkie.

    ViaVoip is a simple Voice Over IP application that can be used when you need to talk, chat, or send files through the internet, but you can't or don't want to make use of any third party services. Its peer to peer design allows the two end points to connect directly to each other, without any central server nor account registration. It runs on Windows, Linux, Mac OS X and Android, and is portable, that is you don't need any setup, just get a copy and run it from any storage...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Trigger Rally

    Trigger Rally

    A fast-paced free software rally racing game

    A 3D rally simulation with a great physics engine for drifting, over 200 maps, different terrain materials like dirt, asphalt, sand, ice etc. and various weather, light and fog conditions give this rally simulation the edge over many other free games. You need to make it through the maps in often tight time limits and can further improve by beating the recorded high scores. All attached single races must be finished in time in order to win an event, unlocking additional events and cars. Most...
    Leader badge
    Downloads: 59 This Week
    Last Update:
    See Project
  • 13
    Linux Air Combat

    Linux Air Combat

    Combat Flight Simulator for LINUX. WW2. Network and VOIP. Open-source.

    WW2 Combat Flight Simulator. Free. Current Version: 9.93. This is now the world's leading open-source combat flight sim for LINUX. Linux Air Combat ("LAC") runs on almost any desktop LINUX system (even Steam Deck & Raspberry Pi) and generates smooth, high-performance flight. If it runs TOO fast on your hardware (framerates beyond 100 FPS), you should activate LAC's built-in FrameRate Limiter as documented...
    Leader badge
    Downloads: 49 This Week
    Last Update:
    See Project
  • 14
    Piper TTS

    Piper TTS

    A fast, local neural text to speech system

    ...Piper is particularly suited for offline voice assistants and embedded systems.
    Downloads: 367 This Week
    Last Update:
    See Project
  • 15
    RemoteTTS

    RemoteTTS

    Tool to remotely activate Text-To-Speech (TTS) on a server

    The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Coqui STT

    Coqui STT

    The deep learning toolkit for speech-to-text

    ...With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    qTox

    qTox

    qTox is a chat, voice, video, and file transfer IM client

    qTox is a cross-platform, secure messaging and video calling client for the Tox peer-to-peer protocol. It offers end-to-end encryption, zero metadata leakage, and no central servers, making it a privacy-focused alternative to mainstream messaging apps. qTox includes rich features like file transfers, screen sharing, and group chats, all built with a user-friendly interface powered by Qt. As a decentralized app, qTox ensures that user communication is secure, private, and independent of...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 18
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Oasi -  Open Document Speaker

    Oasi - Open Document Speaker

    A simple Text2Audio

    Document Speaker - A simple Editor to give VOICE on Your Documents, save your doc as AudioBook or other format this app recognizes the language of the documents and converts them into audiobooks by recognizing texts in nearly 200 languages ... Open RTF & RTFD (mac format/inode directory) ODT,EPUB (unstable), PDF as plain Text to convert as MP4 or AudioBook. Convert Text to Voice Format: 3gp2 3GPP-2 Audio (.3g2) [Qclp,aac,aace,aacf,aach,aacl,aacp] 3gpp 3GP Audio (.3gp) [Qclp,aac,aace,aacf,aach,aacl,aacp] AIFC AIFC (.aifc,.aiff,.aif) [lpcm,ulaw,alaw,ima4,Qclp] AIFF AIFF (.aiff,.aif) [lpcm] NeXT NeXT/Sun (.snd,.au) [lpcm,ulaw] Sd2f Sound Designer II (.sd2) [lpcm] WAVE WAVE (.wav) [lpcm,ulaw,alaw] adts AAC ADTS (.aac,.adts) caff CAF (.caf) [Qclp,aac,aace,aacf,aach,aacl,aacp,alac,alaw,ilbc,ima4,lpcm,ulaw] m4af Apple MPEG-4 Audio (.m4a,.m4r) [aac,aace,aacf,aach,aacl,aacp,alac] m4bf Apple MPEG-4 AudioBooks (.m4b) [aac,aace,aacf,aach,aacl,aacp] mp4f MPEG-4 Audio (.mp4
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    IMGDissolver
    A simple to use, cross platform slideshow/divx/dvd maker for your photos and videos. Among various features: transition effects, background music, text captions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21

    VS2D

    Vietnamese Word recorder and Test To Speech Kit

    -Using: +Qt API, Qt library, QtMobility -Main Purpose: +Make Vietnamese Dub for Science Film, Video. (Do-able) +Read text file, make Audio Books. (Do-able) +Speech synthesis (! Nature Voice Undo-able, Need Instruction). +Free, and OS. -Deployment: +Currently work with Linux. +Test compile with Window, Mac (QMobility not so working, need instruction.) +The Word Recorder will be solely for Computer to make voice Data base. (In the future, it will be deploy for Android and iOS if demand). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    VoiceChatter is a free, open source, and cross platform voice chat application. It is built for gaming, so it is light weight, yet delivers excellent voice quality over minimal bandwidth and latency. For more, visit http://www.voicechatter.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    FLXLab is a program for running psychology experiments. Capabilities include presenting text and graphics, playing and recording sounds, and recording reaction times via the keyboard, joystick/button box, or a voice key. Windows/Mac/Linux.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This project is a Linux client for gtalk. It supports voice and text based chat.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Virtual Reality Chat (VRC) is developed using a framework (Yaf3D) which combines robust software libraries such as OpenScenegraph, Lua, ReplicaNet, and more. VRC provides text and voice chat in a 3D environment. YAG 2002 project is their home.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo