Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "opencl voice integration"

x

Sort By:

Relevance

OS

Windows 102
Linux 99
Mac 90
More...
BSD 53
ChromeOS 46
Mobile Operating Systems 16

Category

Artificial Intelligence 76
Software Development 17
Multimedia 10
Communications 9
System 8
Business 4
Games 4
Scientific/Engineering 4
Internet 3
Desktop Environment 1
Education 1
Social sciences 1
Terminals 1

License

OSI-Approved Open Source 111
Creative Commons Attribution License 2

Translations

English 9
French 4
Spanish 2
German 1
More...
Italian 1
Japanese 1
Russian 1
Turkish 1

Programming Language

Python 49
TypeScript 16
C++ 13
C 7
More...
JavaScript 7
Java 6
PHP 6
C# 5
Swift 3
ASP.NET 1
Cold Fusion 1
Dart 1
Groovy 1
Kotlin 1
Lua 1
Objective C 1
Perl 1
PowerShell 1
Ruby 1
Rust 1
Unix Shell 1
Visual Basic 1

Status

Production/Stable 9
Beta 7
Pre-Alpha 4
Alpha 2
More...
Mature 2
Planning 1

Showing 121 open source projects for "opencl voice integration"

View related business solutions

Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

Moonshine Voice

Fast and accurate automatic speech recognition (ASR) for edge devices

moonshine is an open-source automatic speech recognition toolkit optimized for fast and accurate transcription on edge devices and local environments. The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. Moonshine supports multiple platforms including mobile, desktop, and embedded systems, and provides example projects to accelerate integration into real-world products. ...

Downloads: 13 This Week

Last Update: 7 days ago
See Project
2

PyOpenCL

OpenCL integration for Python, plus shiny features

PyOpenCL is a Python wrapper for the OpenCL framework, providing seamless access to parallel computing on CPUs, GPUs, and other accelerators. It enables developers to harness the full power of heterogeneous computing directly from Python, combining Python’s ease of use with the performance benefits of OpenCL. PyOpenCL also includes convenient features for managing memory, compiling kernels, and interfacing with NumPy, making it a preferred choice in scientific computing, data analysis, and...

Downloads: 0 This Week

Last Update: 2026-01-09
See Project
3

Applio

A simple, high-quality voice conversion tool focused on ease of use

Applio is a high-quality voice conversion toolkit designed to make modern RVC/VITS-based voice cloning accessible to non-experts. It focuses strongly on ease of use: installation scripts for Windows, Linux, and macOS set up dependencies and then launch a browser-based Gradio interface. Within that interface, users can train and run voice conversion models for tasks like singing conversion, speech-to-speech transformation, and voice cloning. The project is structured to be flexible through...

Downloads: 103 This Week

Last Update: 2026-02-18
See Project
4

Telegram Desktop

Telegram Desktop messaging app

Telegram Desktop is the official C++/Qt-based cross-platform client for Telegram, implementing the full Telegram API and MTProto protocol for secure messaging, voice/video calls, file sharing, and chat features. It provides message sync across devices, supports themes, stickers, bots, and is actively maintained.

Downloads: 401 This Week

Last Update: 3 days ago
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
5

CosyVoice

Multi-lingual large voice generation model, providing inference

CosyVoice is a multilingual large voice generation model that offers a full-stack solution for training, inference, and deployment of high-quality TTS systems. The model supports multiple languages, including Chinese, English, Japanese, Korean, and a range of Chinese dialects such as Cantonese, Sichuanese, Shanghainese, Tianjinese, and Wuhanese. It is designed for zero-shot voice cloning and cross-lingual or mix-lingual scenarios, so a single reference voice can be used to synthesize speech across languages and in code-switching contexts. ...

Downloads: 5 This Week

Last Update: 2025-11-30
See Project
6

Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models

Qwen3-TTS is an open-source text-to-speech (TTS) project built around the Qwen3 large language model family, focused on generating high-quality, natural-sounding speech from plain text input. It provides researchers and developers with tools to transform text into expressive, intelligible audio, supporting multiple languages and voice characteristics tuned for clarity and fluidity. The project includes pre-trained models and inference scripts that let users synthesize speech locally or...

Downloads: 31 This Week

Last Update: 2026-03-17
See Project
7

VibeVoice ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

VibeVoice ComfyUI is a comprehensive wrapper that integrates Microsoft’s VibeVoice text-to-speech models directly into ComfyUI workflows. It exposes VibeVoice as a set of custom nodes so you can build single-speaker and multi-speaker voice generation pipelines visually, combining TTS with other audio or generative components. The integration supports high-quality single-speaker synthesis as well as scripted multi-speaker conversations, with optional voice cloning from audio samples for each speaker. It includes advanced control over generation parameters like attention backend, diffusion steps, sampling temperature, guidance scale, and quantization settings, allowing users to tune the trade-offs between quality, VRAM usage, and speed. ...

Downloads: 7 This Week

Last Update: 2025-11-28
See Project
8

II ElevenLabs UI

Component library and custom registry built on top of shadcn/ui

...It is tightly aligned with ElevenLabs’ ecosystem, allowing seamless integration with their voice synthesis and conversational AI tools. The project also includes a CLI that simplifies the process of adding components directly into existing projects, streamlining development workflows.

Downloads: 0 This Week

Last Update: 2026-03-19
See Project
9

Open-LLM-VTuber

Open source AI VTuber platform with voice chat and Live2D avatars

Open-LLM-VTuber is an open source platform designed to create AI-powered VTuber characters that can interact with users through voice and animated avatars. It enables hands-free conversations with large language models by combining speech recognition, language processing, and text-to-speech synthesis into a single system. Users can speak directly to the AI character, and the system can respond with a generated voice while animating a Live2D avatar to simulate a talking virtual personality....

Downloads: 22 This Week

Last Update: 2026-03-17
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
10

Bytecoder

Framework to interpret and transpile JVM bytecode to JavaScript

Bytecoder is a Rich Domain Model for Java Bytecode and Framework to interpret and transpile it to other languages such as JavaScript, OpenCL or WebAssembly. The JVM Bytecode is parsed and transformed into an intermediate representation. This intermediate representation is passed thru optimizer stages and sent to a backend implementation for target code generation. The JavaScript backend transforms the intermediate representation into JavaScript.

Downloads: 0 This Week

Last Update: 2024-05-08
See Project
11

TEN Framework

TEN, a voice agent framework to create conversational AI.

TEN (Transformative Extensions Network) is a voice agent framework for creating conversational AI applications, focusing on high performance and modularity.

Downloads: 0 This Week

Last Update: 2026-04-02
See Project
12

LuxTTS

A high-quality rapid TTS voice cloning model

LuxTTS is an open-source text-to-speech (TTS) system focused on delivering high-quality, rapid voice synthesis and voice cloning that runs extremely fast and efficiently on consumer hardware. It implements a lightweight architecture based on ZipVoice and optimized sampling techniques so that it can generate speech at speeds up to roughly 150 times real-time on a single GPU and faster than real-time on CPU, all while producing audio at high fidelity with 48 kHz quality. The project supports...

Downloads: 3 This Week

Last Update: 2026-03-12
See Project
13

Ultravox

Fast multimodal LLM for real-time voice interaction and AI apps

...Internally, it leverages pretrained language models and speech encoders, with a multimodal adapter that integrates both modalities for inference and training. Ultravox is optimized for low latency, achieving fast response times suitable for interactive voice agents and real-time applications. It supports use cases such as conversational AI agents, speech-to-speech translation, and analysis of spoken audio content. Ultravox also includes tooling and configuration systems for training, evaluation, and dataset integration.

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
14

Pipecat

Framework for building real-time voice and multimodal AI agents

Pipecat is an open source Python framework designed for building real-time voice and multimodal conversational AI agents. It provides developers with tools to orchestrate complex pipelines that combine speech recognition, language models, audio processing, and speech synthesis into a cohesive conversational system. Pipecat focuses on low-latency interactions so voice conversations with AI feel natural and responsive during live use. Pipecat allows applications to integrate multiple AI...

Downloads: 1 This Week

Last Update: 2026-04-14
See Project
15

MegaTTS 3

Official PyTorch Implementation

MegaTTS3 is an open-source text-to-speech (TTS) and voice-cloning system from ByteDance that aims to deliver high-quality, expressive speech synthesis, including zero-shot voice cloning of previously unseen speakers. Its backbone is a lightweight diffusion-transformer (on the order of ~0.45 B parameters), which enables efficient inference while still producing high-fidelity audio. Given a reference audio sample (and corresponding latent representation), MegaTTS3 can generate speech in the...

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
16

VoxCPM

TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

VoxCPM is a tokenizer-free text-to-speech system that models speech in a continuous space, aiming for extremely realistic, context-aware synthesis and true-to-life zero-shot voice cloning. Instead of converting speech into discrete tokens, it uses an end-to-end diffusion-autoregressive architecture built on the MiniCPM-4 backbone, combining hierarchical language modeling, finite scalar quantization (FSQ), and local Diffusion Transformers. This design helps decouple semantic and acoustic...

Downloads: 40 This Week

Last Update: 2026-04-08
See Project
17

Fun Audio Chat

Large Audio Language Model built for natural interactions

Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. The system...

Downloads: 1 This Week

Last Update: 2026-02-27
See Project
18

ChatTTS_colab

One-click deployment (including offline integration package)

ChatTTS_colab is a wrapper project around the ChatTTS model that focuses on “one-click” deployment, especially in Google Colab. It provides an integrated offline bundle and scripts for Windows and macOS so users can run ChatTTS locally without wrestling with complex environment setup. The repository includes Colab notebooks that launch a Gradio-based web UI and expose streaming TTS, making it possible to listen to generated audio as it is produced. A distinctive feature is the “voice gacha”...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
19

XiaoZhi AI Chatbot

Build your own AI friend

xiaozhi-esp32 is an open-source project that guides users in building their own AI-powered conversational companion using the ESP32 microcontroller. The project provides detailed instructions on assembling the hardware, setting up the software, and integrating AI models to enable natural language interactions. This DIY approach offers an accessible entry point into AI and hardware development.

Downloads: 221 This Week

Last Update: 2026-04-19
See Project
20

RunAnywhere

Production ready toolkit to run AI locally

RunAnywhere SDKs are a set of cross-platform development tools that enable applications to run artificial intelligence models directly on user devices instead of relying on cloud infrastructure. The toolkit allows developers to integrate language models, speech recognition, and voice synthesis capabilities into mobile or desktop applications while keeping all computation local. By running models entirely on device, the platform eliminates network latency and protects user data because...

Downloads: 3 This Week

Last Update: 2026-04-20
See Project
21

OpenCV

Open Source Computer Vision Library

OpenCV (Open Source Computer Vision Library) is a comprehensive open-source library for computer vision, machine learning, and image processing. It enables developers to build real-time vision applications ranging from facial recognition to object tracking. OpenCV supports a wide range of programming languages including C++, Python, and Java, and is optimized for both CPU and GPU operations.

Downloads: 40 This Week

Last Update: 2025-12-31
See Project
22

Kitten TTS

State-of-the-art TTS model under 25MB

KittenTTS is an open-source, ultra-lightweight, and high-quality text-to-speech model featuring just 15 million parameters and a binary size under 25 MB. It is designed for real-time CPU-based deployment across diverse platforms. Ultra-lightweight, model size less than 25MB. CPU-optimized, runs without GPU on any device. High-quality voices, several premium voice options available. Fast inference, optimized for real-time speech synthesis.

Downloads: 12 This Week

Last Update: 2026-02-24
See Project
23

CallMe

Minimal plugin that lets Claude Code call you on the phone

CallMe is a minimalist plugin for Claude Code that bridges computational tasks with real-world alerts by letting the AI call your phone, smartwatch, or even a landline when an operation finishes, gets stuck, or needs a user decision. It is designed to let users start a long-running task, leave their device, and then be notified in a natural way with a phone call instead of second-guessing progress in the console. The plugin uses a local MCP server alongside a webhook tunnel (typically via...

Downloads: 1 This Week

Last Update: 2026-04-07
See Project
24

ChatOllama

ChatOllama is an open-source AI chatbot

...It goes beyond a basic chat UI by supporting a broad model ecosystem that includes OpenAI, Azure OpenAI, Anthropic, Google Gemini, Groq, Moonshot, Ollama, and other OpenAI-compatible services. The platform also includes higher-level capabilities such as AI agents, document-backed knowledge bases, real-time voice chat, and Model Context Protocol integration for external tools. Its RAG functionality allows document upload and knowledge-base-driven interaction, while vector database support adds more scalable retrieval options. Deployment is streamlined with Docker Compose, and the project also includes internationalization and modular feature toggles for controlling what parts of the system are enabled. ...

Downloads: 4 This Week

Last Update: 7 days ago
See Project
25

WhisperX

Automatic Speech Recognition with Word-level Timestamps

WhisperX is an advanced speech recognition system built on top of OpenAI’s Whisper model, designed to improve transcription accuracy and timing precision for long-form audio. It addresses key limitations of standard Whisper implementations by introducing voice activity detection and forced alignment techniques to produce word-level timestamps. The system enables batched inference, significantly increasing transcription speed while maintaining high accuracy. It is particularly effective for...

Downloads: 16 This Week

Last Update: 2026-04-06
See Project

Previous
You're on page 1
2
3
4
5
Next

Related Searches

telegram

opencv

sapi 5 voices

tts

telegram source code

ai chatbot offline

telegram desktop

applio

esp32-s3

xiaozhi-esp32-1.7.6

Related Categories

Artificial Intelligence

Software Development

Multimedia

Communications

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise