Showing 31 open source projects for "audio"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    pyVideoTrans

    pyVideoTrans

    Translate the video from one language to another and embed dubbing

    pyVideoTrans is an ambitious open-source multimedia processing project that assembles speech recognition, subtitle generation, AI translation, voice synthesis, and video assembly into a unified pipeline for converting videos from one language to another with embedded dubbing and captions. At its core it runs speech-to-text models to transcribe audio tracks, translates the resulting text into a target language using local or cloud-based translation engines, synthesizes new speech to match the translated subtitles, and then merges that speech back into the video, creating a fully localized media file. The tool supports both command-line and GUI modes, making it accessible to developers and creatives needing batch or automated processing.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 2
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 3

    pysoundanalyser

    a python program to generate, visualize, and manipulate short sounds

    pysoundanalyser is a Python application that can be used to generate, visualize, and manipulate short sounds through a graphical user interface. Visualization functions include visualization of the power spectrum, the spectrogram, the autocorrelation, and the autocorrelogram of a sound. Manipulation functions include filtering, concatenating, cutting, and scaling the level of a sound. Several types of sounds can also be generated including, pure tones, harmonic complex tones, noise of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    GeigerLog

    GeigerLog

    Python program for Geiger counters and Environmental Sensors

    ...Initially developed for Geiger counters, it is now a more universal tool, which equally well handles data like temperature, air-pressure, humidity, CO2. Continues to support Geiger counters like GQ Electronic's GMC-300/500/600 , RadMon, RadPro, Gamma-Scout, audio-click and other low-cost Geiger counters, IoT devices, I2C based sensors, GQ EMF devices. GeigerLog reads data from devices, saves them to databases, prints to screen and plots as auto-updated, customizable graphs. Quality control using Statistics, Scatter, Poisson, FFT & Autocorrelation, and more. Gamma Spectra - can be displayed, calibrated, and evaluated in GeigerLog! ...
    Leader badge
    Downloads: 71 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Signal Ranger DSP Linux Support Tools
    This project delivers Linux support (USB driver and Tools) for a Digital Signal Processor (DSP) cards of the Signal Ranger series, see http://www.softdb.com/product_dsp.shtml for details) connected to a Linux host via USB.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    pydatascope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    OpenDAFF

    OpenDAFF

    Directional Audio File Format

    OpenDAFF is a free, open-source software package for directional audio data - like the directivity of microphones, speakers, as well as head-related transfer functions (HRTFs)
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Antenna DTV
    Antenna, a free and open-source dvb-t (terrestrial digital tv) tuner for linux desktops. Antenna is a full featured tool to see what's going on in the sky. It enables you to scan, watch and record television, radio and services broadcasted on air.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PyRadmon Reborn

    PyRadmon Reborn

    PyRadmon, a Python script to use on both Linux and Windows.

    PyRadmon Reborn. This project started after the original programmers quit supporting it. The project now contains support for audio*, has a version that supports two Geiger counters** and a cleaner, more stable code. * = PyAudio is used as library for the audio support. ** = The two counters both run in their own thread. UPDATE-2015-2: * Added timeouts to sockets to prevent crashing of the script. * Updated some comments. UPDATE-2015-1: Since there have been some passed time I decided to pickup the keyboard to write for PyRadmon again, hence the new updates. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A collection of software made by Milos Rancic.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13

    pyespeak

    Python to eSpeak speech synthesis

    ctypes Python module for eSpeak http://espeak.sf.net speech synthesis
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Voice keyboard/dictation. Aims to be a total substitute for a keyboard. Spell out words letter by letter (using code: alpha, bravo, ..). Arrow keys, modifiers work. Speak whole words (but whole word accuracy is not good). Attach commands to some word
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    LibreEngineering
    LibreEngineering - suite of instrumentation, electrical, mechanical, process engineering calculation and design programs and other tools. Licensed under GPL3. Written in Python with Qt toolkit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    pyscope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually. It has been renamed "pdatascope" to avoid a name-clash with Pyscope, a scoping package on PyPi. See https://sourceforge.net/p/pydatascope/ for the latest code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A python-based calculator that uses the Sabine, Norris-Eyring and Fitzroy formulae to calculate the RT60 time of a room with given specifications. Material list from SAE Institute reference.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    DTMF detector library and/or application that reads in the specified audio file and returns/outputs whether DTMF detected and/or list of detected digits. May have GUI and command line interfaces.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Transana is a cross-platform tool for the transcription and qualitative analysis of audio and video data. It also provides the ability to identify and organize analytically interesting portions of videos, as well as attach keywords to those video clips.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Ouroborus is an artificial life framework for mobile agents on a background of cellular automata. It can be used to teach and research topics such as population genetics, ecology and evolution. The demo includes a curses view and live Csound audio.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A.L.V.I. e' nato per essere un semplice ma modulare Bot, in grado di interagire con l'essere umano attraverso il linguaggio naturale ed eseguire svariati compiti, come leggere ad alta voce Mail, notizie, Feeds. Tutto in Italiano!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    AudioSavant is a music visualization software that is aimed to allow students to get a better understanding of music. It is based in the OLPC's TamTam application, with a set of modifications from the information visualization world.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ASR-Builder provides an easy-to-use interface to the HTK toolkit, that allows users to build ASR systems. ASR-Builder provides a platform that performs house-keeping tasks when using HTK and also provides default training/testing/recognition scripts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Carolina Computer Assistive Technolog group at UNC-Chapel Hill focuses on the development of enabling technologies. We are currently maintaining the pyTTS package here, but our older work is still available for (unsupported) use.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB