Showing 20 open source projects for "python voice synthesis"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Rasa

    Rasa

    Open source machine learning framework to automate text conversations

    ...Rasa uses Poetry for packaging and dependency management. If you want to build it from the source, you have to install Poetry first. By default, Poetry will try to use the currently activated Python version to create the virtual environment for the current project automatically.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    OpenAI Agents JS

    OpenAI Agents JS

    A lightweight, powerful framework for multi-agent workflows

    ...The repo includes examples showing how to build agents that call local functions, chain between agents, validate input/output, stream responses, and interact in real time (e.g. voice agents via WebRTC). It also has tracing and debugging support so you can introspect how agents executed their workflows. Because it aligns closely with the Python Agents SDK, it aims for cross-language parity so that JS/TS devs can adopt similar agent architectures.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    FuseSoC

    FuseSoC

    Package manager and build abstraction tool for FPGA/ASIC development

    FuseSoC is a package manager and build abstraction tool for hardware description language (HDL) code, aimed at simplifying the development and reuse of IP cores. It provides a standardized way to describe, manage, and build hardware projects, facilitating collaboration and reducing duplication of effort in FPGA and ASIC development. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    XLS

    XLS

    XLS: Accelerated HW Synthesis

    XLS is an open-source toolkit for building high-level hardware with a modern compiler stack that spans from a functional DSL to optimized IR and hardware generation. At the front end, DSLX lets you describe algorithms with strong typing and familiar control flow while remaining synthesis-friendly. The compiler lowers DSLX into a rich intermediate representation, applies aggressive optimization and scheduling passes, and can either JIT the design for software simulation or emit Verilog for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Imagen - Pytorch

    Imagen - Pytorch

    Implementation of Imagen, Google's Text-to-Image Neural Network

    Implementation of Imagen, Google's Text-to-Image Neural Network that beats DALL-E2, in Pytorch. It is the new SOTA for text-to-image synthesis. Architecturally, it is actually much simpler than DALL-E2. It consists of a cascading DDPM conditioned on text embeddings from a large pre-trained T5 model (attention network). It also contains dynamic clipping for improved classifier-free guidance, noise level conditioning, and a memory-efficient unit design. It appears neither CLIP nor prior...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    Eris

    Eris

    A NodeJS Discord library

    A Node.js wrapper for interfacing with Discord. You will need NodeJS 10.4+. If you need voice support you will also need Python 2.7 and a C++ compiler. Create a directory for your bot, and change to that directory in your command line. If you want to be more updated (at the expense of stability), you can install the beta builds instead. Eris supports a few optional libraries that could potentially improve bot performance but may require additional dependencies.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    CO3D (Common Objects in 3D)

    CO3D (Common Objects in 3D)

    Tooling for the Common Objects In 3D dataset

    CO3Dv2 (Common Objects in 3D, version 2) is a large-scale 3D computer vision dataset and toolkit from Facebook Research designed for training and evaluating category-level 3D reconstruction methods using real-world data. It builds upon the original CO3Dv1 dataset, expanding both scale and quality—featuring 2× more sequences and 4× more frames, with improved image fidelity, more accurate segmentation masks, and enhanced annotations for object-centric 3D reconstruction. CO3Dv2 enables research...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DeepMind Research

    DeepMind Research

    Implementations and code to accompany DeepMind publications

    ...The codebase is primarily Jupyter Notebooks and Python, reflecting an emphasis on experimentation and pedagogy rather than production packaging.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    NeuMan

    NeuMan

    Neural Human Radiance Field from a Single Video (ECCV 2022)

    NeuMan is a reference implementation that reconstructs both an animatable human and its background scene from a single monocular video using neural radiance fields. It supports novel view and novel pose synthesis, enabling compositional results like transferring reconstructed humans into new scenes. The pipeline separates human/body and environment, learning consistent geometry and appearance to support animation. Demos showcase sequences such as dance and handshake, and the code provides...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Triton

    Triton

    Triton is a dynamic binary analysis library

    Triton is a dynamic binary analysis library. It provides internal components that allow you to build your program analysis tools, automate reverse engineering, perform software verification or just emulate code. Linux, Windows and OS X compatible. You can directly compile source code from our Github repository. The Triton library is Linux, Windows and OS X compatible. A potential way to getting started is to see our Python examples already bootstrapped.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Nerfies

    Nerfies

    This is the code for Deformable Neural Radiance Fields

    Nerfies demonstrates deformation-aware neural radiance fields that reconstruct and render dynamic, real-world scenes from casual video. Instead of assuming a static world, the method learns a canonical space plus a deformation field that maps changing poses or expressions back to that space during training. This lets the system generate photorealistic novel views of nonrigid subjects—faces, bodies, cloth—while preserving fine detail and consistent lighting. The training pipeline handles...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SkyWater PDK

    SkyWater PDK

    Open source process design kit for usage with SkyWater Technology

    The SkyWater PDK is the first broadly available open-source process design kit for a commercial-grade CMOS node, enabling researchers, startups, and students to design real ASICs without proprietary NDAs. It provides the essential artifacts for digital and analog flows: SPICE models, DRC/LVS rules, extraction decks, and technology files for open tools like Magic and KLayout. Standard-cell libraries and IO pads are included so digital designers can use open synthesis and place-and-route to...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    Rasa Core

    Rasa Core

    Rasa Core is now part of the Rasa repo

    Rasa is an open source machine learning framework to automate text and voice-based conversations. With Rasa, you can build contextual assistants. Rasa helps you build contextual assistants capable of having layered conversations with lots of back-and-forth. In order for a human to have a meaningful exchange with a contextual assistant, the assistant needs to be able to use context to build on things that were previously discussed – Rasa enables you to build assistants that can do this in a...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Marsyas (Music Analysis, Retrieval and Synthesis for Audio Signals) is a framework for developing systems for audio processing. It provides an general architecture for connecting audio, soundfiles, signal processing blocks and machine learning. Source code at SF is outdated! Marsyas is now hosted at GitHub: https://github.com/marsyas/marsyas Downloads are now provided at Bintray: https://bintray.com/marsyas
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The aim of the project is to make programming through voice input as easy and productive as with mouse and keyboard. For install, Use subversion, as described in this page: http://sourceforge.net/apps/mediawiki/voicecode/index.php?title=VCode_1_Doc/InstallationManual.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Virtual News Reader is a computer desktop application that can convert text (idealy for online news) into Virtual Voice and can be saved on computer. The project is using Java, (JSAPI), FreeTTS (Text-To-Speech synthesis), DJProject, Substance, & other.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Talking Computer Control Center is a speaking interface for web browsers, file downloads, and speech synthesis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A modular audio programming language, designed to write applications quickly. Its main goal is real time audio processing, but it should be used for any kind of development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB