Showing 52 open source projects for "sound to text"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 1
    Sound to Text

    Sound to Text

    Convert a sound file to text to analyze it

    Convert a sound file to text you can analyze to spot hidden words for psychological or universal-philosophical interpretation. The program will convert a sound file to text, with the purpose of analyzing it to spot intelligible words. Use the program with songs, movie or TV dialogue, private recordings, to reveal the hidden text messages of the sound. The resulting text will be a long string of mostly repeating characters but, every so often, you will notice an intelligible word, either...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Amiga Memories

    Amiga Memories

    A walk along memory lane

    Amiga Memories is a project (started & released in 2013) that aims to make video programmes that can be published on the internet. The images and sound produced by Amiga Memories are 100% automatically generated. The generator itself is implemented in Squirrel, the 3D rendering is done on GameStart 3D. An Amiga Memories video is mostly based on a narrative. The purpose of the script is to define the spoken and written content. The spoken text will be read by a voice synthesizer (Text To Speech...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MSCTIME is a streamlined online timesheet platform tailored for the construction industry. Icon
    MSCTIME is a streamlined online timesheet platform tailored for the construction industry.

    Built for contractors and trades people working in the construction industry

    Our customizable online timesheets adapt to diverse contractor and project needs. From intricate, large-scale endeavors to smaller projects, MSCTIME ensures precise time recording.
    Learn More
  • 5
    Text to Chord

    Text to Chord

    Turn words into chords

    Convert words and sentences to 5 note chords you can use to inspire music creation. Have fun turning your name, your city name, your friends' names, your team's name, your pet's name into wild and original harmonies that go beyond serialism and classic jazz.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Text to Waveform

    Text to Waveform

    Create synth presets from words

    Convert words to waveforms you can load into a synthesizer oscillator to create synth presets. Have fun turning your name, your friends' names, your city name, your pet's name, your team's name into synth presets you can use to produce a track.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    SpeeD ReaD ("Speedy Read-y")

    SpeeD ReaD ("Speedy Read-y")

    SpeeD ReaD is a little program to help you read faster.

    SpeeD ReaD helps you to read faster and more efficiently. By minimizing subvocalization and saccades, you can process and comprehend the text you read much faster than with normal reading. First, subvocalization is the natural tendency for all of us to "hear" the words in our brains as we read. Think of it as reading out loud inside your head. But our minds do not need us to sound out the words we read - even inside our heads - in order to understand them. The words can be processed...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Hannes File Cleaner

    Hannes File Cleaner

    Automated file organization in a user-friendly GUI

    Automated file organization by moving them from a source folder to specific destination folders based on their types, such as images, text, sound, and video files. It provides a user-friendly interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    psgdump

    Dump psg/ym chip tune files to txt and midi format

    PSGDump tool is parser and converter for chip tune files. It supports PSG and YM input file formats, focusing on AY/YM chip tunes from ZX Spectrum and Atari ST. The tool produces text output of notes played and creates multi-track MIDI file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Nectar: Employee Recognition Software to Build Great Culture Icon
    Nectar: Employee Recognition Software to Build Great Culture

    Nectar is an employee recognition software built for the modern workforce.

    Our 360 recognition & rewards platform enables everyone (peer to peer & manager to employees alike) to send meaningful recognition rooted in core values. Nectar has the most extensive rewards catalog so users can choose from company branded swag, Amazon products, gift cards or custom reward types. Integrate with your other tools like Slack and Teams to make sending recognition easy. We support top organizations like MLB, SHRM, Redfin, Heineken and more.
    Learn More
  • 10
    18k-youtube-download

    18k-youtube-download

    ❤️ 18k-youtube-download with python and kivy Dev.Wk-18k

    18k-youtube-download A simple project to make gui on kivy mixed with the function of downloading music from youtube with youtube_dl package
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following...
    Downloads: 32 This Week
    Last Update:
    See Project
  • 12

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Defox text to speech and downloader

    Defox text to speech and downloader

    Written or imported text offline read or online download.

    This software design to convert text to speech and download the converted speech. Description : • Installation setup with two languages (English, French) • Two areas called text reading and speech downloading • Many languages supported to download center Note 1: I'm a student yet and I'm not in the software designing industry. Therefore maybe I haven't software making skills. I'm worried about that. ! Note 2 : When you double click on the software maybe it will get some seconds...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    oscp

    oscp

    multiplatform, small and handy audio/video player with network remote

    ... (fc, bp, ay, dl, tfmx, hip, mon, sa, ahx, aon, cust, etc.) feats: - small and fast - zip/rar support - ncurses ui with easy navigation - gtk or classic winamp skins mode (with the ability to load random one) - network remotes (crossplatform linux/windows, written in pygtk, protocol is plain text, ie. easy to write own client) - scrobble (last.fm) - built-in equalizer and effects - lirc (remote) - slave libs statically compiled in (no deps) -- read more in wiki above
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 21 This Week
    Last Update:
    See Project
  • 16
    AngelReader

    AngelReader

    An E-book, Audio-book, & Library Loader in One Application

    AngelReader: A minimalist but powerful GUI application that has the capacity to load [1] E-books in plain text format with the least use of both software and hardware resources. It can also load [2] Audio-books with the basic functions of play, stop, pause, and resume with the same minimalist economy that doesn't hog computer resources. When used in integration with the AngelReader Library Selector, it can function as a mini library management system for books in electronic formats. It's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    puddletag

    SImple, powerful audio tagger for GNU/Linux

    puddletag is an audio tag editor (primarily created) for GNU/Linux similar to the Windows program, Mp3tag. Unlike most taggers for GNU/Linux, it uses a spreadsheet-like layout so that all the tags you want to edit by hand are visible and easily editable. The usual tag editor features are supported like extracting tag information from filenames, renaming files based on their tags by using patterns and basic tag editing. Then there’re Functions, which can do things like replace text, trim...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 19

    Analyzer for Files

    A tool to look into file contents

    Analyzer for Files (AoF) is a tool to look into file contents, analyze the structure with installed plug-ins, and show the results with several split windows including converted data and a tree if successful. It was designed as a workbench with a core and plug-in extensions. It can handle the normal plain-text file and data, complex binaries supported with the corresponding plug-ins. What's more, the developers can deploy and release their own plug-ins according to the plug-in developing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A python module that provides algorithms for advanced search - basically all you need to build a search engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Steel TTS

    A cross-platform wrapper for common text-to-speech engines in Python

    Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AarTon
    AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The aim of the project is to make programming through voice input as easy and productive as with mouse and keyboard. For install, Use subversion, as described in this page: http://sourceforge.net/apps/mediawiki/voicecode/index.php?title=VCode_1_Doc/InstallationManual.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Speect
    Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices. It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG. Speect is free and open source...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    PhonoRipper

    A GUI tool for creating audio CDs from analog sources.

    PhonoRipper is an easy to use, all-in-one GUI tool for creating audio CDs from analog sources like vinyl records. It uses Cdrdao for CD writing and Normalize for volume normalization. Features include a fully buffered WAV file recorder with mixer and VU meter (supporting both ALSA and OSS drivers), automatic detection of track borders, an intuitive drag'n'drop CD writing interface with CD-text support, CDDB queries of CD text data, and a WYSIWYG CD cover editor.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next