Showing 22 open source projects for "ai voice generator"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Operit AI

    Operit AI

    Powerful Android AI agent with tools, automation, and Linux shell

    Operit is a full-featured AI assistant and agent platform designed specifically for Android devices, aiming to go far beyond traditional chat-based interfaces. It integrates deep system-level capabilities with a wide range of tools, allowing the AI to perform real tasks such as file management, automation, and system control directly on the device. A standout aspect of the project is its built-in Ubuntu 24 environment, which enables users to run Linux commands, scripts, and development tools...
    Downloads: 29 This Week
    Last Update:
    See Project
  • 2
    AIGCPanel

    AIGCPanel

    One-stop AI digital human system with video voice synthesis tools

    AIGCPanel is an open source desktop application designed as a comprehensive, all-in-one platform for creating AI-powered digital humans and media content. It integrates multiple capabilities such as video synthesis, voice synthesis, and voice cloning into a unified interface, allowing users to generate realistic audiovisual outputs with minimal setup. AIGCPanel focuses heavily on simplifying the management of local AI models by providing streamlined workflows for importing, configuring, and running different models with minimal manual effort. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Open-LLM-VTuber

    Open-LLM-VTuber

    Open source AI VTuber platform with voice chat and Live2D avatars

    Open-LLM-VTuber is an open source platform designed to create AI-powered VTuber characters that can interact with users through voice and animated avatars. It enables hands-free conversations with large language models by combining speech recognition, language processing, and text-to-speech synthesis into a single system. Users can speak directly to the AI character, and the system can respond with a generated voice while animating a Live2D avatar to simulate a talking virtual personality. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    FastRTC

    FastRTC

    The python library for real-time communication

    ...It abstracts away much of the complexity that typically comes with implementing WebRTC by providing a simple interface — e.g. a Stream class — that can be mounted within a web backend (for example a FastAPI application). This makes it particularly well suited for building real-time voice (or video) interfaces for applications such as AI assistants, live chat, or collaborative audio/video tools. FastRTC also integrates nicely with UI frameworks (e.g. via a web demo using Gradio), so developers can rapidly prototype and deploy real-time streaming applications without deep knowledge of low-level WebRTC internals. Because voice-enabled AI agents often involve many moving parts (speech-to-text, text processing, text-to-speech, streaming, session/chat management), FastRTC helps by handling the streaming aspect, leaving the rest to be plugged in modularly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Short Video Factory

    Short Video Factory

    AI tool for automatic batch short video creation and editing

    Short Video Factory is an open source desktop application designed to simplify the creation of short-form videos using AI-driven automation. It enables users to generate product marketing clips and general content videos by combining simple prompt-based input with pre-prepared media assets. Short Video Factory integrates multiple stages of video production, including script generation, voice synthesis, video editing, and subtitle effects, into a single streamlined workflow. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    myGPTReader

    myGPTReader

    AI Slack bot for reading, summarizing, and chatting with content

    myGPTReader is an AI-powered Slack bot designed to help users read, summarize, and interact with various types of digital content through conversational interfaces. It enables users to quickly understand web pages, documents, and even video content by transforming them into interactive discussions rather than static reading experiences. myGPTReader supports a wide range of file formats, including eBooks, PDFs, and text-based documents, making it flexible for both casual and professional use cases. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    FAY

    FAY

    Framework for building AI-powered interactive digital humans and agent

    Fay is an open source framework designed to build and deploy interactive digital humans powered by large language models. It acts as a middleware layer that connects digital character technologies with conversational AI systems and business applications. Fay supports various types of digital humans, including 2.5D and 3D avatars, and can be integrated with applications running on mobile devices, PCs, web platforms, and embedded systems. Its architecture allows developers to combine different AI components such as speech recognition, text-to-speech, and large language models to create conversational digital agents. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    App Store Screenshot Generator

    App Store Screenshot Generator

    Create screenshots for the iOS App Store

    AppScreen is a free and open-source utility designed to help developers and designers produce polished App Store screenshots for iOS apps without needing expensive graphic design software or manual layout work. It runs entirely in a browser or local environment and includes intuitive controls for backgrounds, text overlays, and device mockups, letting users compose compelling marketing visuals that meet Apple’s official size guidelines. Users can toggle between multiple device models and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PyGPT

    PyGPT

    Open source personal AI Assistant for Linux, Windows and Mac

    PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API. Moreover, the application has implemented context memory support, context storage,...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10
    WanGP

    WanGP

    AI video generator optimized for low VRAM and older GPUs use

    Wan2GP is an open source AI video generation toolkit designed to make modern generative models accessible on consumer-grade hardware with limited GPU memory. It acts as a unified interface for running multiple video, image, and audio generation models, including Wan-based models as well as other systems like Hunyuan Video, Flux, and Qwen. A key focus of the project is reducing VRAM requirements, enabling some workflows to run on as little as 6 GB while still supporting older Nvidia and...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 11
    Voice Accounting For Blind & Mute People

    Voice Accounting For Blind & Mute People

    Free & Easy AI Voice Accounting Software For Blind & Speechless People

    Just download the above zip file, extract it and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, please view and download my full collection of softwares for people with disabilities, here : https://sourceforge.net/projects/softwares-for-disabled-people/ This full collection also includes the Voice Accounting Software as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Harbor LLM

    Harbor LLM

    Run a full local LLM stack with one command using Docker

    ...Harbor supports multiple inference engines, including llama.cpp and vLLM, and connects them seamlessly to user interfaces. It also includes tools for web retrieval, image generation, voice interaction, and workflow automation. Built on Docker, Harbor allows services to run in isolated containers while communicating over a local network. It is intended for local development and experimentation rather than production deployment, giving developers a flexible way to explore AI systems, test configurations, and manage complex LLM stacks without manual wiring or setup overhead.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    comfyui-mixlab-nodes

    comfyui-mixlab-nodes

    Workflow and speech recognition app

    ...The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. For audio and speech, it provides nodes for SpeechRecognition and SpeechSynthesis, plus workflows that combine voice generation with real-time face swapping and other audio-visual effects. On the AI side, it integrates multiple LLM providers (cloud and local), supports OpenAI-compatible endpoints, Siliconflow models, and includes prompt-focused utilities for random prompt generation, Chinese prompts, clip interrogation.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14

    PowerPoint Generator

    PowerPoint Generator: Your Gateway to Effortless Presentations

    Dive into the ease of presentation creation with PowerPoint Generator, a smart web application crafted to automate PowerPoint presentations! 🌟 This project is ripe for growth and eager for your contributions. Let's enhance its capability together!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    KoboldCpp

    KoboldCpp

    Run GGUF models easily with a UI or API. One File. Zero Install.

    KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.
    Leader badge
    Downloads: 390 This Week
    Last Update:
    See Project
  • 16
    Softwares For Blind, Deaf, Handicap

    Softwares For Blind, Deaf, Handicap

    Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People

    Just download the above zip file, extract it first and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, keep NumLock ON while using the Numeric Keypad of any Keyboard. Can also attach an external USB keyboard, with seperate Numeric Keypad, if required. I have added some general guidelines for students, using these softwares, on the Wiki Page of this website. Please refer them for more instructions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    SEO Tool

    Seo Tool: AI Autoblogger for ArtikelSchreiber.com and UNAIQUE.net

    Seo Tool: AI Autoblogger for https://www.artikelschreiber.com/ and https://www.unaique.net/ Function: Create Mini site based on Config File on your server https://linktr.ee/textgenerator Use a Shared Hosting Server Change the "seo-marketing-tool.conf" config to fit your needs (eg: Create a Mini Site about "Cars" then change the AI Software API Keywords to "car") Start the Tool "python3 seo-marketing-tool.py" Your mini site will be created on your Shared Hosting Server HTML5...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A Netflix film cover generator Nuxt.js

    A Netflix film cover generator Nuxt.js

    A tool for generating Netflix show image

    We love Netflix, but we love memes even more. We thought that helping Netflix on their UI/UX testing with a tool that can create show images easily with an export function to png. A tool for generating Netflix shows an image. You can visit the demo website hosted on Netlify. This is an open-source tool and it is available on Github. On this tool you have a full editable canvas where you can edit content, text position, text dimension, gradient position and change the background image. In...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    StarkOX

    StarkOX

    Open Source Robot Operating System

    StarkOX, the first universal Open Source AI Robot Operating System. It provides easy to use hardware abstraction and commonly-used functionality. Furthermore, strong cognitive functions, voice and pattern detection together with a new powerful scripting language provide possibilities to create and share abstract behavior patterns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    MultiPlink

    MultiPlink

    [UPDATE 28-10-2015] FINAL MULTIPLINK SSH LIFETIME v2.0 UP-5 "FAREWELL"

    Download Link Utama: http://sourceforge.net/projects/multiplink/files/Final_MultiPlink_SSHLIFETIME_2.0_UP-5_FAREWELL.rar/download Jika di Halaman Download, Link belum tersedia, klik link alternatif : https://www.dropbox.com/s/znptxp2gxhihpax/Final_MultiPlink_SSHLIFETIME_2.0_UP-5_FAREWELL.rar?dl=0 Lalu Download & Install juga Minor...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Asterisk AD integration \ VQ monitoring
    The project is a preconfigured VoIP PBX VM Image based on Asterisk. But provides more advance features, such as LDAP integration, VQ monitoring via RTCP XR reports, LDAP user login, integrated billing system and telephone directory PDF generator.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Text Image Creator Kovalenko

    Text Image creator mobile app

    Text Image Creator Kovalenko is the ultimate text generation mobile app, designed to streamline your content creation process and inspire your creativity. Whether you're a writer, marketer, student, or simply someone looking to spice up your social media posts, Text Image Creator Kovalenko is your go-to tool for generating high-quality and engaging text content on the fly. Content Marketing: Craft compelling blog posts, social media updates, and email newsletters effortlessly. Academic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB