Showing 31 open source projects for "python voice synthesis"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Rasa

    Rasa

    Open source machine learning framework to automate text conversations

    ...Rasa uses Poetry for packaging and dependency management. If you want to build it from the source, you have to install Poetry first. By default, Poetry will try to use the currently activated Python version to create the virtual environment for the current project automatically.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    OpenAI Agents JS

    OpenAI Agents JS

    A lightweight, powerful framework for multi-agent workflows

    ...The repo includes examples showing how to build agents that call local functions, chain between agents, validate input/output, stream responses, and interact in real time (e.g. voice agents via WebRTC). It also has tracing and debugging support so you can introspect how agents executed their workflows. Because it aligns closely with the Python Agents SDK, it aims for cross-language parity so that JS/TS devs can adopt similar agent architectures.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    XLS

    XLS

    XLS: Accelerated HW Synthesis

    XLS is an open-source toolkit for building high-level hardware with a modern compiler stack that spans from a functional DSL to optimized IR and hardware generation. At the front end, DSLX lets you describe algorithms with strong typing and familiar control flow while remaining synthesis-friendly. The compiler lowers DSLX into a rich intermediate representation, applies aggressive optimization and scheduling passes, and can either JIT the design for software simulation or emit Verilog for...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    FuseSoC

    FuseSoC

    Package manager and build abstraction tool for FPGA/ASIC development

    FuseSoC is a package manager and build abstraction tool for hardware description language (HDL) code, aimed at simplifying the development and reuse of IP cores. It provides a standardized way to describe, manage, and build hardware projects, facilitating collaboration and reducing duplication of effort in FPGA and ASIC development. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Imagen - Pytorch

    Imagen - Pytorch

    Implementation of Imagen, Google's Text-to-Image Neural Network

    Implementation of Imagen, Google's Text-to-Image Neural Network that beats DALL-E2, in Pytorch. It is the new SOTA for text-to-image synthesis. Architecturally, it is actually much simpler than DALL-E2. It consists of a cascading DDPM conditioned on text embeddings from a large pre-trained T5 model (attention network). It also contains dynamic clipping for improved classifier-free guidance, noise level conditioning, and a memory-efficient unit design. It appears neither CLIP nor prior...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    Eris

    Eris

    A NodeJS Discord library

    A Node.js wrapper for interfacing with Discord. You will need NodeJS 10.4+. If you need voice support you will also need Python 2.7 and a C++ compiler. Create a directory for your bot, and change to that directory in your command line. If you want to be more updated (at the expense of stability), you can install the beta builds instead. Eris supports a few optional libraries that could potentially improve bot performance but may require additional dependencies.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    CO3D (Common Objects in 3D)

    CO3D (Common Objects in 3D)

    Tooling for the Common Objects In 3D dataset

    CO3Dv2 (Common Objects in 3D, version 2) is a large-scale 3D computer vision dataset and toolkit from Facebook Research designed for training and evaluating category-level 3D reconstruction methods using real-world data. It builds upon the original CO3Dv1 dataset, expanding both scale and quality—featuring 2× more sequences and 4× more frames, with improved image fidelity, more accurate segmentation masks, and enhanced annotations for object-centric 3D reconstruction. CO3Dv2 enables research...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    VoiceClip

    VoiceClip

    VoiceClip es una aplicación de asistencia a usuarios

    VoiceClip es una aplicación de asistencia a usuarios diseñada para integrarse de manera fluida en su entorno de trabajo, proporcionando un acceso rápido y eficiente a diversas funcionalidades mediante comandos de voz y texto. Presentada como una barra de herramientas que permanece siempre visible en primer plano, VoiceClip busca simplificar tareas comunes, mejorar la productividad y facilitar la interacción con su sistema operativo y con tecnologías avanzadas de inteligencia artificial
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    DeepMind Research

    DeepMind Research

    Implementations and code to accompany DeepMind publications

    ...The codebase is primarily Jupyter Notebooks and Python, reflecting an emphasis on experimentation and pedagogy rather than production packaging.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    NeuMan

    NeuMan

    Neural Human Radiance Field from a Single Video (ECCV 2022)

    NeuMan is a reference implementation that reconstructs both an animatable human and its background scene from a single monocular video using neural radiance fields. It supports novel view and novel pose synthesis, enabling compositional results like transferring reconstructed humans into new scenes. The pipeline separates human/body and environment, learning consistent geometry and appearance to support animation. Demos showcase sequences such as dance and handshake, and the code provides...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Kalliope

    Kalliope

    Kalliope is a framework to create your own personal assistant

    Kalliope is a modular always-on voice-controlled personal assistant designed for home automation. Kalliope is a framework that will help you to create your own personal assistant. The concept is to create the brain of your assistant by attaching an input signal (vocal order, scheduled event, MQTT message, GPIO event, etc..) to one or multiple actions called neurons. You can create your own Kalliope bot, by simply choosing and composing the existing neurons without writing any code. But, if...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    tom_core

    tom_core

    tom_core - a tool for automating events on a computer

    tom_core is a software tool used for the automation of everything that happens on your computer. By using this application, you can easily record your activity on your computer, starting the recording at any moment that you choose. The application repeats all your clicks or drags, keystrokes, hotkeys, etc. All in exactly the timing and number of repetitions you need. The toolbox such as the optical recognition and voice control enables to branch out the recordings into complex forms, with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Triton

    Triton

    Triton is a dynamic binary analysis library

    Triton is a dynamic binary analysis library. It provides internal components that allow you to build your program analysis tools, automate reverse engineering, perform software verification or just emulate code. Linux, Windows and OS X compatible. You can directly compile source code from our Github repository. The Triton library is Linux, Windows and OS X compatible. A potential way to getting started is to see our Python examples already bootstrapped.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Spleeter

    Spleeter

    Deezer source separation library including pretrained models

    Spleeter is the Deezer source separation library with pretrained models written in Python and using Tensorflow. It makes it easy to train music source separation models (assuming you have a dataset of isolated sources), and provides already trained state of the art models for performing various flavours of separation. 2 stems and 4 stems models have state of the art performances on the musdb dataset. Spleeter is also very fast as it can perform separation of audio files to 4 stems 100x...
    Downloads: 40 This Week
    Last Update:
    See Project
  • 16
    Nerfies

    Nerfies

    This is the code for Deformable Neural Radiance Fields

    Nerfies demonstrates deformation-aware neural radiance fields that reconstruct and render dynamic, real-world scenes from casual video. Instead of assuming a static world, the method learns a canonical space plus a deformation field that maps changing poses or expressions back to that space during training. This lets the system generate photorealistic novel views of nonrigid subjects—faces, bodies, cloth—while preserving fine detail and consistent lighting. The training pipeline handles...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SkyWater PDK

    SkyWater PDK

    Open source process design kit for usage with SkyWater Technology

    The SkyWater PDK is the first broadly available open-source process design kit for a commercial-grade CMOS node, enabling researchers, startups, and students to design real ASICs without proprietary NDAs. It provides the essential artifacts for digital and analog flows: SPICE models, DRC/LVS rules, extraction decks, and technology files for open tools like Magic and KLayout. Standard-cell libraries and IO pads are included so digital designers can use open synthesis and place-and-route to...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    captcha_break

    captcha_break

    Identification codes

    This project will use Keras to build a deep convolutional neural network to identify the captcha verification code. It is recommended to use a graphics card to run the project. The following visualization codes are jupyter notebookall done in . If you want to write a python script, you can run it normally with a little modification. Of course, you can also remove these visualization codes. captcha is a library written in python to generate verification codes. It supports image verification codes and voice verification codes. We use its function of generating image verification codes. First, we set our verification code format to numbers and capital letters, and generate a string of verification codes. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Rasa Core

    Rasa Core

    Rasa Core is now part of the Rasa repo

    Rasa is an open source machine learning framework to automate text and voice-based conversations. With Rasa, you can build contextual assistants. Rasa helps you build contextual assistants capable of having layered conversations with lots of back-and-forth. In order for a human to have a meaningful exchange with a contextual assistant, the assistant needs to be able to use context to build on things that were previously discussed – Rasa enables you to build assistants that can do this in a...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Marsyas (Music Analysis, Retrieval and Synthesis for Audio Signals) is a framework for developing systems for audio processing. It provides an general architecture for connecting audio, soundfiles, signal processing blocks and machine learning. Source code at SF is outdated! Marsyas is now hosted at GitHub: https://github.com/marsyas/marsyas Downloads are now provided at Bintray: https://bintray.com/marsyas
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The aim of the project is to make programming through voice input as easy and productive as with mouse and keyboard. For install, Use subversion, as described in this page: http://sourceforge.net/apps/mediawiki/voicecode/index.php?title=VCode_1_Doc/InstallationManual.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Lioness (Languages Interop Framework)
    Framework for making Windows applications that are one .exe file in AutoHotKey_L,C++,C#, VB.NET,Java,Groovy,Common Lisp,Nemerle,Ruby,Python,PHP,Lua,Tcl,Perl,Jint,S#,WSH VBScript,HTML/JavaScript/CSS,COM, PowerShell without compiling . For .NET 4.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Synthesia is an open hardware/software platform intended for creating standalone audio devices such as synthesizers on embedded processors.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    Virtual News Reader is a computer desktop application that can convert text (idealy for online news) into Virtual Voice and can be saved on computer. The project is using Java, (JSAPI), FreeTTS (Text-To-Speech synthesis), DJProject, Substance, & other.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB