Search Results for "converting transcript to audio"

37 projects for "converting transcript to audio" with 1 filter applied:

  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    MARS5

    MARS5

    MARS5 speech model (TTS) from CAMB.AI

    ...The model is built to handle prosodically challenging content such as sports commentary, anime dialogue, and other high-energy or highly varied speech patterns with realistic rhythm and intonation. To control speaker identity, MARS5 uses a short reference audio clip, typically between 2 and 12 seconds, from which it learns the voice characteristics. It supports two main inference modes: shallow clone, which is faster and only needs the reference audio, and deep clone, which additionally uses the transcript of the reference audio to increase similarity and naturalness at the cost of more computation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Claude Code Video Vision

    Claude Code Video Vision

    Give Claude the ability to watch and understand videos

    ...It supports multiple backends for audio processing, including local and cloud-based options, enabling flexible deployment depending on privacy or performance requirements.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    EasyVoice

    EasyVoice

    Open source text-to-speech tool, supports extra-long text

    easyVoice is an open-source text-to-speech platform aimed at turning long-form text and novels into high-quality audio, with a strong focus on usability and scalability. It provides a web interface where users can paste or upload large texts and generate speech and subtitles in a single workflow, even for works exceeding 100,000 characters. The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Agili Hacker Podcast

    Agili Hacker Podcast

    AI tool that turns Hacker News posts into daily podcast updates

    Hacker Podcast is an AI-powered project that turns top Hacker News stories into a Chinese podcast. It automatically fetches trending posts each day, processes the content with AI, and generates concise summaries before converting them into audio. This creates a hands-free way to stay updated on tech, startups, and developer discussions without reading long threads. Hacker Podcast combines content aggregation, natural language processing, and text-to-speech to deliver clear and digestible updates. Users can listen through web interfaces or podcast platforms, while also accessing written summaries for deeper reading. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    VoxCPM

    VoxCPM

    TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

    ...Trained on a large 1.8-million-hour bilingual corpus, VoxCPM can infer appropriate speaking style from context, dynamically adjusting intonation, rhythm, and emotional tone. It supports zero-shot voice cloning from a short reference audio clip, capturing timbre, accent, and pacing to closely mimic a target speaker without per-speaker fine-tuning.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 6
    AI-Media2Doc

    AI-Media2Doc

    AI tool converting video/audio into structured documents instantly

    AI-Media2Doc is a web-based application that uses large language models to convert video and audio content into structured, readable documents in a single workflow. It is designed to transform multimedia inputs into formats such as knowledge notes, summaries, mind maps, and social-style articles, making content easier to review and reuse. AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    VERT.sh

    VERT.sh

    The next-generation file converter

    VERT is a modern, privacy-focused file conversion platform that leverages WebAssembly to perform conversions entirely on the user’s device rather than relying on cloud-based processing. Built with Svelte and TypeScript, it provides a clean and responsive interface for converting a wide variety of file types, including images, audio, video, and documents. One of its defining characteristics is its local-first approach, which eliminates the need to upload files to external servers, thereby improving both privacy and performance. The system supports over 250 file formats and includes customizable conversion settings, allowing users to fine-tune output parameters. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Streamer-Sales

    Streamer-Sales

    LLM Large Model of Selling Anchor

    Streamer-Sales is an open-source large language model system designed specifically for e-commerce live streaming and automated product promotion. The project focuses on generating persuasive product descriptions and live presentation scripts that mimic the style of professional online sales hosts. By analyzing product characteristics and marketing information, the model can produce engaging explanations that emphasize benefits, features, and emotional appeal to encourage viewers to make...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Perl Audio Converter

    Perl Audio Converter

    Linux Audio Converter / Tagger / CD Ripper

    A Linux CLI tool for converting multiple audio types from one format to another. It supports the following audio formats: 3G2, 3GP, 8SVX, AAC, AC3, ADTS, AIFF, AL, AMB, AMR, APE, AU, AVR, BONK, CAF, CDR, CVU, DAT, DTS, DVMS, F32, F64, FAP, FLA, FLAC, FSSD, GSRT, HCOM, IMA, IRCAM, LA, MAT, AUD, MAT4, MAT5, M4A, M4R, MP2, MP3, MP4, MP4A, MPC, MPP, NIST, OFF, OFR, OFS, OPUS, OGA,OGG, PAF, PRC, PVF, RA, RAM, RAW, RF64, SD2, SF, SHN, SMP, SND,SOU, SPX, SRN, TAK, TTA, TXW, VOC, VMS, VQF, W64, WAV, WMA, and WV. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 10
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial :...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Subsonic
    Subsonic is a web-based media streamer, providing ubiquitous access to your music and video collection. More than 20 apps are available for Android, iPhone, Windows Phone, BlackBerry, Roku, Chumby, Sonos etc. Supports virtually all media formats, converting files on the fly. Also includes a Podcast receiver and jukebox feature allowing you to control what's playing on your computer from your mobile phone.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 12
    EXP Soundboard

    EXP Soundboard

    Simple soundboard app with hotkeys

    A soundboard that supports almost all MP3s and WAVs. Sounds can be triggered with custom keyboard hot-keys and played through up to 2 outputs. i.e. Your speakers and a virtual audio cable. Also allows for your mic to pass into the virtual audio cable when enabling Mic Injector. This soundboard also incorporates a save feature. REQUIREMENTS: - Java 7 If you want sounds to be played through voice chat you'll need a virtual audio cable. (For Windows users I recommend the...
    Leader badge
    Downloads: 1,179 This Week
    Last Update:
    See Project
  • 13
    Graphical Youtube Downloader

    Graphical Youtube Downloader

    GYD is a youtube-dl GUI based on QT

    GYD - Graphical Youtube Downloader is a GUI for youtube-dl. It is easy to use and it supports most of the youtube-dl features and some extra features like converting files and a "youtube to MP3 / OGG" (video to audio) function since Version 0.3a. When you like to use the extra features you must install ffmpeg. Youtube-dl - http://rg3.github.com/youtube-dl/ NOTE: GYD 0.3.x is the last version using QT4 (does not work with QT5).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    MediaEncodingCluster is an Enterprise Class, Video Cluster Environment with a Plattform Independent Client - Server Architecture for distributed video/audio converting/encoding tool over a grid Computing Network Design. more on http://docs.codergrid.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MobileMate

    MobileMate

    A video and audio converting tool customized on Tinycore Linux.

    MobileMate is an open source video and audio converting tool customized on Tinycore Linux. It use Bash to glue open source tools such as Mplayer, FFmpeg(Libav), Zenity, Grep, Sed, etc.. It features as a self-booting tiny linux, can easy expand to your language, small size, easy to custom as your needs, etc..
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    transqript

    a program to transcript audio files

    transqript can be used to transcribe audio files of interviews etc. to text files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Set of python scripts for bulk conversion of media files. The scripts scan directory trees for video and audio files and converting them to avi, ogg, or mp3 appropriately. Relies on mencoder, lame, mplayer, and oggenc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A Framework for mp3 content. Base is a database containing your music. On top there may be plugins for ripping, playing and converting music content.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The Qt Audio Processor is an ultimate audio files processing software, including ripping, converting, tagging and burning to, from and between every available audio codec.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    MediaDropBox
    A GUI for audio and video encoding and playing for portable devices using ffmpeg and mplayer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Tinger Converter
    This soft can convert all kinds of audios and videos to MP3 formate. She can run on any kinds of platform, like windows, linux, MAC, etc. Especially she is better at multi-tasks. The audio like mp3, wma, etc. The video like rmvb, rmvb, avi, mkv, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Speech Made Visible
    Speech Made Visible is an experiment in showing some of the qualities of speech in printed text. Analyze a recording for attributes like pitch, intensity (loudness), and speed; then style the words in a transcript to suggest those characteristics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    LEG is the Linux Encoder Gui. It exist in order to make life easier for users to do file conversions, whether its converting plain avi to mpeg or performing dvd rips and converting to different media types like IPODs, SmartPhones, IPAQs, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Lossless2lossy is a conversion script for mass converting your ENTIRE music collection (or just one album) from one format to another whilst mirroring the directory structure and tags of the original format. Supports ape,flac,wavpack(& hybrid),ogg,mp3.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ProteinMusic is a Java program converting DNA sequences into music. The original idea for this project came from R. D. King at the University of Wales, Aberystwyth and C. G Angus from the Shamen (King, R.D. & Angus, C.G. (1996)).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB