• Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    NeuTTS Air

    NeuTTS Air

    NeuTTS model built from small LLM backbones

    ...Its LLM-based architecture is intended to bring more expressive and flexible speech generation to local applications. NeuTTS is especially useful for embedded voice agents, private assistants, toys, accessibility tools, and compliance-sensitive apps. Its main value is making modern voice AI more portable, private, and practical for local deployment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    NeuTTS Nano

    NeuTTS Nano

    On-device TTS model by Neuphonic

    ...Its LLM-based architecture is intended to bring more expressive and flexible speech generation to local applications. NeuTTS is especially useful for embedded voice agents, private assistants, toys, accessibility tools, and compliance-sensitive apps. Its main value is making modern voice AI more portable, private, and practical for local deployment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Dia2

    Dia2

    TTS model capable of streaming conversational audio in realtime

    ...Dia2 provides 1B and 2B model checkpoints along with inference code for research and experimentation. It currently focuses on English generation and supports up to two minutes of generated audio. Its main value is enabling low-latency, dialogue-oriented TTS workflows where timing, turn-taking, and natural conversation matter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Fish Audio Python SDK

    Fish Audio Python SDK

    The official Python library for the Fish Audio API

    ...It supports synchronous usage for simple scripts and provides utilities that make generated audio easier to play or export. The SDK is useful for developers building voice products, prototypes, assistants, content tools, or automated audio pipelines. Its main value is simplifying integration with Fish Audio services through a clean Python interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    MARS5

    MARS5

    MARS5 speech model (TTS) from CAMB.AI

    ...To control speaker identity, MARS5 uses a short reference audio clip, typically between 2 and 12 seconds, from which it learns the voice characteristics. It supports two main inference modes: shallow clone, which is faster and only needs the reference audio, and deep clone, which additionally uses the transcript of the reference audio to increase similarity and naturalness at the cost of more computation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    WhisperSpeech

    WhisperSpeech

    An Open Source text-to-speech system built by inverting Whisper

    ...The repository includes notebooks and scripts for inference, long-form synthesis, and finetuning, as well as pre-trained models and converted datasets hosted on Hugging Face. Performance optimizations like torch.compile, KV-caching, and architectural tweaks allow the main model to reach up to 12× real-time speed on a consumer RTX 4090.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Resemble Enhance

    Resemble Enhance

    AI powered speech denoising and enhancement

    ...It is useful for voice datasets, podcasts, narration, generated speech, and other workflows where speech clarity matters. The models are trained on high-quality speech data, which helps the tool produce cleaner output than basic filtering alone. Its main value is giving developers and audio creators an open tool for upgrading imperfect speech recordings.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Resemblyzer

    Resemblyzer

    A python package to analyze and compare voices with deep learning

    ...The project is useful for researchers and developers who need a practical way to reason about speaker identity without building a voice encoder from scratch. It can help identify whether two recordings sound like the same speaker or visualize voice relationships across many samples. Its main value is making speaker representation accessible through a simple Python workflow.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo