25940 projects for "java-tts" with 1 filter applied:

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    DragonianVoice

    DragonianVoice

    C++ inference library for multiple SVC/TTS

    ...It focuses on being a reusable native library rather than a full UI product, with bindings for C, C++, and C# so it can be embedded into other applications or engines. The project supports a wide range of model families: TTS models such as Tacotron2, VITS, EmotionalVITS, BERTVits2, GPT-SoVITS, SVC systems like SoVitsSvc (v2/v3/v4), RVC, DiffSvc, DiffusionSvc, FishDiffusion, ReflowSvc, and even singing systems like DiffSinger and related pitch/feature extractors like FCPE and RMVPE. It uses ONNX Runtime and other backends to accelerate inference, with notes on how different execution providers such as CUDA or DirectML affect operator support and numerical stability. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    MARS5

    MARS5

    MARS5 speech model (TTS) from CAMB.AI

    MARS5-TTS is CAMB.AI’s open-source English speech model designed for high-quality text-to-speech and voice emulation. It uses a two-stage architecture that combines an autoregressive (AR) model with a non-autoregressive (NAR) model, giving it both expressiveness and speed. The model is built to handle prosodically challenging content such as sports commentary, anime dialogue, and other high-energy or highly varied speech patterns with realistic rhythm and intonation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    StyleTTS 2

    StyleTTS 2

    Towards Human-Level Text-to-Speech through Style Diffusion

    ...StyleTTS2 supports both single-speaker and multi-speaker configurations, with the ability to sample or transfer styles from reference audio, making it powerful for expressive TTS and character voices. The repository includes training scripts, configuration files, and pre-trained auxiliary modules such as a text aligner, pitch extractor, and PL-BERT-based linguistic encoder.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    DSA Bootcamp Java

    DSA Bootcamp Java

    This repository consists of the code samples, assignments, and notes

    DSA Bootcamp Java is an open source educational repository created by Kunal Kushwaha to teach Data Structures and Algorithms (DSA) using Java. It is designed as a structured bootcamp, covering fundamental concepts to advanced problem-solving techniques. The project provides explanations, exercises, assignments, and practice problems, making it useful for both beginners and intermediate learners who want to strengthen their Java and algorithmic skills.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Incredable is the first DLT-secured platform that allows you to save time, eliminate errors, and ensure your organization is compliant all in one place. Icon
    Incredable is the first DLT-secured platform that allows you to save time, eliminate errors, and ensure your organization is compliant all in one place.

    For healthcare Providers and Facilities

    Incredable streamlines and simplifies the complex process of medical credentialing for hospitals and medical facilities, helping you save valuable time, reduce costs, and minimize risks. With Incredable, you can effortlessly manage all your healthcare providers and their credentials within a single, unified platform. Our state-of-the-art technology ensures top-notch data security, giving you peace of mind.
    Learn More
  • 5
    simple-java-maven-app

    simple-java-maven-app

    For an introductory tutorial on how to use Jenkins

    simple-java-maven-app is a reference repository created for the Jenkins tutorial on building a Java application with Maven. It contains a minimal Java program that prints “Hello world!” and includes unit tests to validate functionality, with test results exported in JUnit XML format. The repository provides an example that demonstrates how to configure a Jenkins Pipeline for building and delivering applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    gTTS

    gTTS

    Python library and CLI tool to interface with Google Translate

    gTTS (Google Text-to-Speech) is a Python library and command-line tool that wraps the speech functionality of Google Translate. It lets you send text to the Google Translate TTS endpoint and receive spoken audio back as MP3 data, either written to a file, a file-like object, or standard output. The library is designed to handle long texts, using a speech-specific sentence tokenizer that keeps intonation and punctuation natural while splitting requests into acceptable chunks. It supports customizable text pre-processors, which can correct pronunciations, tweak formatting, or handle domain-specific vocabulary before sending it to the API. gTTS is primarily aimed at developers who want a quick way to add cloud-backed speech to scripts, apps, or pipelines without managing any model weights locally. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    VoxCPM

    VoxCPM

    TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

    VoxCPM is a tokenizer-free text-to-speech system that models speech in a continuous space, aiming for extremely realistic, context-aware synthesis and true-to-life zero-shot voice cloning. Instead of converting speech into discrete tokens, it uses an end-to-end diffusion-autoregressive architecture built on the MiniCPM-4 backbone, combining hierarchical language modeling, finite scalar quantization (FSQ), and local Diffusion Transformers. This design helps decouple semantic and acoustic...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    EasyVoice

    EasyVoice

    Open source text-to-speech tool, supports extra-long text

    ...It offers streaming playback so audio starts almost immediately, even for very long inputs, and automatically generates subtitle files suitable for video production or translation workflows. Under the hood, easyVoice uses a modern stack with Vue 3 and Element Plus on the front end, Node.js and Express on the back end, and TTS engines such as Microsoft Azure TTS and OpenAI-compatible APIs, orchestrated through ffmpeg.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    CosyVoice

    CosyVoice

    Multi-lingual large voice generation model, providing inference

    CosyVoice is a multilingual large voice generation model that offers a full-stack solution for training, inference, and deployment of high-quality TTS systems. The model supports multiple languages, including Chinese, English, Japanese, Korean, and a range of Chinese dialects such as Cantonese, Sichuanese, Shanghainese, Tianjinese, and Wuhanese. It is designed for zero-shot voice cloning and cross-lingual or mix-lingual scenarios, so a single reference voice can be used to synthesize speech across languages and in code-switching contexts. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • G-P - Global EOR Solution Icon
    G-P - Global EOR Solution

    Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

    With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.
    Learn More
  • 10
    30 Seconds of Java

    30 Seconds of Java

    Collection of reusable tested Java 21 compatible code snippets

    30 Seconds of Java is a curated collection of short, focused Java code snippets intended to teach small concepts quickly, similar to the popular “30 seconds of code” format in other languages. Each snippet is designed to be read and understood in under a minute, showing a single idea such as working with streams, handling optionals, manipulating collections, or doing common utility tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Serverless Java container

    Serverless Java container

    A Java wrapper to run Spring, Spring Boot, Jersey, and other apps

    The AWS Serverless Java Container library is a framework that allows developers to run existing or new Java web applications—built with frameworks such as Spring, Jersey, Spark, Struts—inside AWS Lambda with minimal modifications. It bridges the gap between traditional servlet or web-framework models and serverless functions by mapping HTTP events from API Gateway into requests your framework understands and routing responses back appropriately.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Dia

    Dia

    A TTS model capable of generating ultra-realistic dialogue

    Dia is a neural text-to-speech model designed specifically for generating ultra-realistic dialogue in a single pass. Instead of focusing on isolated sentences or flat narration, it is optimized for conversational audio, complete with natural turn-taking, prosody, and pacing. The model can be conditioned on a reference audio sample, allowing you to control emotion, tone, and other stylistic aspects of the speech. It can also produce nonverbal vocalizations like laughter, coughs, clearing the...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    comfyui-mixlab-nodes

    comfyui-mixlab-nodes

    Workflow and speech recognition app

    comfyui-mixlab-nodes is a large collection of custom nodes for ComfyUI that turns workflows into interactive apps and adds real-time multimedia, LLM, and TTS capabilities. It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Google Cloud Java Client Libraries

    Google Cloud Java Client Libraries

    Google Cloud Client Library for Java

    The Cloud Client Libraries are the recommended way to access Google Cloud APIs programmatically. The Cloud Client Libraries support accessing Google Cloud services in a way that significantly reduces the boilerplate code you have to write. The libraries provide high-level API abstractions so they're easier to understand. They embrace idioms of the language, work well with the standard library, and integrate better with your codebase.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AudioLM - Pytorch

    AudioLM - Pytorch

    Implementation of AudioLM audio generation model in Pytorch

    Implementation of AudioLM, a Language Modeling Approach to Audio Generation out of Google Research, in Pytorch It also extends the work for conditioning with classifier free guidance with T5. This allows for one to do text-to-audio or TTS, not offered in the paper. Yes, this means VALL-E can be trained from this repository. It is essentially the same. This repository now also contains a MIT licensed version of SoundStream. It is also compatible with EnCodec, however, be aware that it has a more restrictive non-commercial license, if you choose to use it.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    SoniTranslate

    SoniTranslate

    Synchronized Translation for Videos

    SoniTranslate is a video translation and dubbing system that produces synchronized target-language audio tracks for existing video content. It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets...
    Downloads: 51 This Week
    Last Update:
    See Project
  • 17
    Kubernetes & OpenShift Java Client

    Kubernetes & OpenShift Java Client

    Java client for Kubernetes & OpenShift

    This client provides access to the full Kubernetes & OpenShift REST APIs via a fluent DSL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Apache NetBeans

    Apache NetBeans

    Apache NetBeans

    ...It highlights source code syntactically and semantically, lets you easily refactor code, with a range of handy and powerful tools. Apache NetBeans provides editors, wizards, and templates to help you create applications in Java, PHP and many other languages. Apache NetBeans can be installed on all operating systems that support Java, i.e, Windows, Linux, Mac OSX and BSD. Write Once, Run Anywhere, applies to NetBeans too.
    Downloads: 172 This Week
    Last Update:
    See Project
  • 19
    LINE Messaging API SDK for Java

    LINE Messaging API SDK for Java

    LINE Messaging API SDK for Java

    The LINE Messaging API SDK for Java makes it easy to develop bots using LINE Messaging API, and you can create a sample bot within minutes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    rtmp-rtsp-stream-client-java

    rtmp-rtsp-stream-client-java

    Library to stream in rtmp and rtsp for Android. All code in Java

    Library for streaming in RTMP and RTSP. All code in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    EssentialsX

    EssentialsX

    The modern Essentials suite for Spigot and Paper

    EssentialsX is a continuation of the Essentials plugin suite, updated to support modern Minecraft and Spigot versions. It provides countless new features, performance enhancements and fixes that are not available in the original Essentials or Spigot-Essentials. If you're coming from the original Essentials plugin, EssentialsX is a drop-in replacement for Essentials.
    Downloads: 96 This Week
    Last Update:
    See Project
  • 22
    VisualVM

    VisualVM

    VisualVM is an All-in-One Java Troubleshooting Tool

    ...Designed for both development and production time use. Use Apache Ant 1.9.9 or above and Oracle JDK 8 to build VisualVM from this repository. VisualVM monitors and troubleshoots applications running on Java 1.4+ from many vendors using various technologies including jvmstat, JMX, Serviceability Agent (SA) and Attach API. VisualVM perfectly fits all the requirements of application developers, system administrators, quality engineers and end users. VisualVM automatically detects and lists locally and remotely running Java applications (jstatd must be running on the remote host). ...
    Downloads: 39 This Week
    Last Update:
    See Project
  • 23
    LWJGL

    LWJGL

    Java library that enables cross-platform access to popular native APIs

    LWJGL is a Java library that enables cross-platform access to popular native APIs useful in the development of graphics (OpenGL, Vulkan), audio (OpenAL) and parallel computing (OpenCL) applications. This access is direct and high-performance, yet also wrapped in a type-safe and user-friendly layer, appropriate for the Java ecosystem. LWJGL is an enabling technology and provides low-level access.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 24
    Aliucord

    Aliucord

    A modification for the Discord Android App

    Aliucord is a modification for the Android Discord app.
    Downloads: 85 This Week
    Last Update:
    See Project
  • 25
    Gson

    Gson

    A Java serialization/deserialization library to convert Java Objects

    Gson is a Java library developed by Google that allows conversion between Java objects and JSON. It enables serialization and deserialization of Java classes to and from JSON format, handling complex and generic types, nulls, custom naming policies, and more. Gson is lightweight, easy to use, and does not require annotation-based configuration, making it a popular choice for JSON processing in Java applications.
    Downloads: 9 This Week
    Last Update:
    See Project