Showing 7 open source projects for "format"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    MLX-Audio

    MLX-Audio

    A text-to-speech, speech-to-text and speech-to-speech library

    ...The project provides a straightforward CLI (mlx_audio.tts.generate) as well as a Python API for programmatic generation of audio, including parameters for voice choice, speed, language hints, output format, and sample rate. It includes examples such as audiobook generation to demonstrate long-form synthesis and joined audio segments. On top of that, MLX-Audio offers a modern web interface powered by FastAPI, with real-time waveform and 3D visualizations, file upload, and audio management.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    OpenAI-Compatible Edge-TTS API

    OpenAI-Compatible Edge-TTS API

    Free, high-quality text-to-speech API endpoint to replace OpenAI

    ...A Docker image is provided for one-command deployment, and environment variables can be used to configure default voice, language, response format, authentication, and logging options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    sag

    sag

    Like the macOS say command, but with a modern voice

    ...The project allows users to stream synthesized speech directly to speakers, save audio files, or list and manage available voices through a lightweight terminal interface. Designed for speed and convenience, sag supports voice selection, playback rate adjustments, output format inference, and configurable API endpoints for flexible deployment. It integrates naturally into shell scripts, automation pipelines, and AI workflows where high-quality voice synthesis is required without complex setup. The tool emphasizes a familiar UNIX-style experience while significantly improving realism and expressiveness compared to traditional system speech engines. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    abogen

    abogen

    Generate audiobooks from EPUBs, PDFs and text with captions

    ...In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio. This can be very useful for accessibility, content consumption on the go, or for users who prefer audio over reading. The repository supports handling common ebook formats and generating outputs that combine audio plus caption metadata. By automating text-to-speech for arbitrary documents, abogen reduces the friction of producing audiobooks and could be integrated into larger workflows (e.g., batch converting a library of texts).
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    Orpheus TTS

    Orpheus TTS

    Towards Human-Sounding Speech

    ...Inference is provided through a Python package that uses vLLM under the hood for high-throughput, low-latency generation, including streaming examples that show how to generate audio chunks in real time. The maintainers provide Colab notebooks, a standardized prompting format, and one-click deployment via Baseten for production-grade, FP8/FP16 optimized inference with ~200 ms streaming latency.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    htmid

    htmid

    Generative Music For Beginners and Everyone Else

    Generative music is a fascinating and innovative approach to music creation that involves creating procedurally generated music that evolves and changes over time. Whether you're a beginner or a seasoned musician, this guide will introduce you to the world of generative music and show you how to create your own live music performances. Generative music is music that is ever-changing and created in real-time. It can be created by anyone, with or without musical experience. Learn how to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Coder Revolt - Text to Speech is a completely Free to use software for commercial/non-commercial rights from business to personal projects. The software itself is designed to be very simple but yet loaded with incredible features that are targeted at solving your problems. Open Source files will be released this week (6/30/2016) and they will be in the full version of Visual Basic and converted version in CSharp C#.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo