voice-pro free download

Showing 2214 open source projects for "voice-pro"

View related business solutions

Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 16 This Week

Last Update: 2025-12-05
See Project
2

Voice

Minimalistic audiobook player

This is my digital playground where I am learning. I'm integrating and validating new technologies and ideas here, playing around with new UI / UX components, and developing with the best coding standard I have come up with. At the same time, I want to provide an audiobook player which is really easy in use and a joy to work with.

Downloads: 0 This Week

Last Update: 2026-02-04
See Project
3

clone-voice

A sound cloning tool with a web interface, using your voice

Clone-voice is a local voice-cloning tool that lets you synthesize speech in any target voice or convert one recording into another voice using the same timbre. It is built around Coqui’s XTTS-v2 model, so it inherits multilingual support and modern neural TTS quality while wrapping it in a user-friendly desktop workflow. The app is designed to be very easy to use: you download a precompiled package, double-click app.exe, and it launches a browser-based web interface where you control cloning and synthesis. ...

Downloads: 6 This Week

Last Update: 2025-11-28
See Project
4

Moonshine Voice

Fast and accurate automatic speech recognition (ASR) for edge devices

moonshine is an open-source automatic speech recognition toolkit optimized for fast and accurate transcription on edge devices and local environments. The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. Moonshine supports multiple platforms including mobile, desktop, and embedded systems, and provides example projects to accelerate integration into real-world products. ...

Downloads: 4 This Week

Last Update: 2026-03-16
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
5

Authenticator Pro

Two-Factor Authentication (2FA) client for Android + Wear OS

Authenticator Pro is a free and open-source two-factor authentication app for Android. It features encrypted backups, icons, categories, a high level of customization, and even a Wear OS app. Unlike some alternative apps, Authenticator Pro gives you control over your data. You can import from other apps and export at any time. Authenticator Pro is a community-backed open-source project which aims to provide privacy and security to users.

Downloads: 6 This Week

Last Update: 2026-03-24
See Project
6

Pro Git

Pro Git 2nd Edition

Welcome to the second edition of the Pro Git book. The entire Pro Git book, written by Scott Chacon and Ben Straub and published by Apress, is available here. All content is licensed under the Creative Commons Attribution Non Commercial Share Alike 3.0 license. Print versions of the book are available on Amazon.com. What is “version control”, and why should you care? Version control is a system that records changes to a file or set of files over time so that you can recall specific versions later. ...

Downloads: 4 This Week

Last Update: 2025-12-12
See Project
7

Depth Pro

Sharp Monocular Metric Depth in Less Than a Second

...Community integrations (e.g., inference wrappers and UI nodes) have sprung up around the model, reflecting practical interest in video, AR, and generative pipelines. As a general-purpose monocular depth backbone, Depth Pro slots into 3D reconstruction, relighting, and scene understanding workflows that benefit from metric predictions.

Downloads: 3 This Week

Last Update: 2025-10-08
See Project
8

TTS Voice Wizard

Speech to Text to Speech, sends text as OSC messages

...The app can translate your speech from one language to over 20 other support languages. There are 100+ different voices with various customization options so you can pick a voice that best suits you. Display the current song you are listening to on Spotify or via your browser. Display tracker and controller battery life in conjunction with XSOverlay. Use in conjunction with HRtoVRChat_OSC to enable you to display your heartrate in VRChat's Chatbox.

Downloads: 8 This Week

Last Update: 2025-11-02
See Project
9

Monokai Pro

Monokai Pro theme for Neovim written in Lua, with multiple filters

Beautiful functionality for professional developers.

Downloads: 1 This Week

Last Update: 2026-02-02
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

GLM-4-Voice

GLM-4-Voice | End-to-End Chinese-English Conversational Model

GLM-4-Voice is an open-source speech-enabled model from ZhipuAI, extending the GLM-4 family into the audio domain. It integrates advanced voice recognition and generation with the multimodal reasoning capabilities of GLM-4, enabling smooth natural interaction via spoken input and output. The model supports real-time speech-to-text transcription, spoken dialogue understanding, and text-to-speech synthesis, making it suitable for conversational AI, virtual assistants, and accessibility applications. ...

Downloads: 3 This Week

Last Update: 6 days ago
See Project
11

RustDesk Server PRO

Some scripts for RustDesk Server Pro are hosted here

This project provides the enterprise-grade backend for RustDesk deployments that need centralized management, stronger policy controls, and advanced operational tooling. It encompasses the components required to broker, relay, and supervise connections while adding administrative layers suitable for regulated or large environments. Organizations can standardize how clients authenticate, how sessions are permitted and audited, and how data paths are routed under diverse network constraints....

Downloads: 10 This Week

Last Update: 2026-03-06
See Project
12

Voicebox

The open-source voice synthesis studio powered by Qwen3-TTS

Voicebox is a local-first voice synthesis studio that aims to bring professional, DAW-like voice generation workflows to a desktop app while keeping models and voice data entirely on your machine. It positions itself as an open-source alternative to cloud voice platforms by emphasizing privacy, offline use, and freedom from subscriptions or usage caps. The tool supports downloading voice models, cloning voices from short audio samples, and generating speech locally, then organizing the results using studio-oriented editing concepts. ...

Downloads: 56 This Week

Last Update: 2026-03-17
See Project
13

IDA Pro MCP

MCP Server for IDA Pro

The IDA Pro MCP Server is a Model Context Protocol (MCP) server designed to integrate with IDA Pro, a popular disassembler and debugger. It enables AI assistants to interact with IDA Pro, facilitating tasks such as code analysis and reverse engineering.

Downloads: 6 This Week

Last Update: 2025-10-06
See Project
14

Real-Time Voice Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Real-Time Voice Cloning is an influential deep-learning repository that demonstrates how to clone a voice from just a few seconds of audio and then generate arbitrary speech in that voice in near real time. It implements the SV2TTS pipeline (“Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis”) in three stages: a speaker encoder, a synthesizer, and a vocoder.

Downloads: 5 This Week

Last Update: 2026-03-09
See Project
15

Applio

A simple, high-quality voice conversion tool focused on ease of use

Applio is a high-quality voice conversion toolkit designed to make modern RVC/VITS-based voice cloning accessible to non-experts. It focuses strongly on ease of use: installation scripts for Windows, Linux, and macOS set up dependencies and then launch a browser-based Gradio interface. Within that interface, users can train and run voice conversion models for tasks like singing conversion, speech-to-speech transformation, and voice cloning.

Downloads: 88 This Week

Last Update: 2026-02-18
See Project
16

GPT-SoVITS

1 min voice data can also be used to train a good TTS model

GPT‑SoVITS is a state-of-the-art voice conversion and TTS system that enables zero‑shot and few‑shot synthesis based on a short vocal sample (e.g., 5 seconds). It supports cross‑lingual speech synthesis across English, Chinese, Japanese, Korean, Cantonese, and more. It's powered by VITS architecture enhanced for few‑sample adaptation and real‑time usability.

Downloads: 36 This Week

Last Update: 2025-07-29
See Project
17

OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model

OpenVoice is a versatile instant voice cloning system that can replicate a speaker’s tone color from just a short audio clip and then generate speech in multiple languages. It is designed not only to match the timbre of the reference voice, but also to give granular control over style parameters such as emotion, accent, rhythm, pauses, and intonation. The model supports cross-lingual and even zero-shot cross-lingual voice cloning, so a speaker recorded in one language can be made to speak naturally in others. ...

Downloads: 16 This Week

Last Update: 2025-11-28
See Project
18

Alan AI

In-App assistant SDK to build a multimodal conversational UX websites

Quickly add voice to your app with the Alan Platform. Create an in-app voice assistant to enable human-like conversations and provide a personalized voice experience for every user. Alan is a conversational voice AI platform that lets you create an intelligent voice assistant for your app. It offers all the necessary tools to design, embed, and host your voice solutions.

Downloads: 3 This Week

Last Update: 2026-03-24
See Project
19

Ruoyi-Vue-Pro

RuoYi-Vue-Pro is an enterprise-level, full-stack admin management system based on Spring Boot and Vue 3. It features a modular architecture with RBAC permissions, multi-tenant support, workflow engine integration, monitoring, logging, and microservice compatibility. Built for scalability and maintainability, it’s designed for Chinese-speaking developers building robust backend management systems, and includes tools for user, role, and permission management, data visualization, and more.

Downloads: 0 This Week

Last Update: 2026-01-31
See Project
20

XiaoMi Pro Hackintosh

XiaoMi NoteBook Pro Hackintosh

XiaoMi NoteBook Pro Hackintosh. If you are using XiaoMi-Pro with 8th Gen CPU, then it's a KBL (Kaby Lake) machine. (Actually Kaby Lake Refresh) If you are using XiaoMi-Pro with 10th Gen CPU, then it's a CML (Comet Lake) machine.

Downloads: 0 This Week

Last Update: 2025-06-15
See Project
21

Gemini Next Chat

Deploy your private Gemini application for free with one click

...It is built with Next.js/TypeScript and targets developers and hobbyists who want a self-hosted solution for interacting with advanced multimodal models (text, image, voice). It supports features like image recognition, voice-based conversation, plugins (web search, ArXiv search, weather, etc.), and client apps (tray app) for greater convenience. The project emphasizes “one-click” deployment, aiming to make it easy to spin up a custom chat front end without deep infra-setup. It’s licensed under MIT and has an active community of contributors; documentation and release notes note support for newer features like mixed image+text generation. ...

Downloads: 2 This Week

Last Update: 2025-11-24
See Project
22

Telegram Desktop

Telegram Desktop messaging app

Telegram Desktop is the official C++/Qt-based cross-platform client for Telegram, implementing the full Telegram API and MTProto protocol for secure messaging, voice/video calls, file sharing, and chat features. It provides message sync across devices, supports themes, stickers, bots, and is actively maintained.

Downloads: 798 This Week

Last Update: 23 hours ago
See Project
23

UI UX Pro Max

AI SKILL that provide design intelligence

UI UX Pro Max is an open-source AI skill designed to provide intelligent design assistance for professional user interfaces and user experiences across web, mobile, and cross-platform frameworks. It uses an AI reasoning engine to generate complete design systems tailored to project requirements, recommending layouts, typography, colors, spacing, and component structures automatically based on natural language prompts.

Downloads: 7 This Week

Last Update: 2026-03-10
See Project
24

Vocode

Build voice-based LLM agents. Modular + open source

Vocode is an open source library that makes it easy to build voice-based LLM apps. Using Vocode, you can build real-time streaming conversations with LLMs and deploy them to phone calls, Zoom meetings, and more. You can also build personal assistants or apps like voice-based chess. Vocode provides easy abstractions and integrations so that everything you need is in a single library.

Downloads: 2 This Week

Last Update: 2025-02-05
See Project
25

Alan AI for Android

Assistant SDK to build a multimodal conversational UX for Android

Quickly add voice to your app with the Alan Platform. Create an in-app voice assistant to enable human-like conversations and provide a personalized voice experience for every user. Alan is a conversational voice AI platform that lets you create an intelligent voice assistant for your app. It offers all the necessary tools to design, embed, and host your voice solutions.

Downloads: 1 This Week

Last Update: 2024-07-01
See Project