Showing 304 open source projects for "audio processing"

View related business solutions
  • Build IoT Your Way Icon
    Build IoT Your Way

    Quickly build, secure, and manage connected devices from anywhere with the Golioth IoT Cloud.

    Connect and secure devices, send sensor data to the web, update firmware over the air, and scale your fleet with our instant IoT cloud.
  • The Apple Device Management and Security Platform Icon
    The Apple Device Management and Security Platform

    For IT teams at organizations that run on Apple

    Achieve harmony across your Apple device fleet with Kandji's unmatched management and security capabilities.
  • 1
    VideoSrt

    VideoSrt

    Windows-GUI

    ... to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.
    Downloads: 44 This Week
    Last Update:
    See Project
  • 2
    Whisper

    Whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...
    Downloads: 37 This Week
    Last Update:
    See Project
  • 3
    Sneedacity

    Sneedacity

    Audio Editor

    Sneedacity (formerly Audacity) is an easy-to-use, multi-track audio editor and recorder for Windows, Mac OS X, GNU/Linux and other operating systems. Sneedacity aims to provide a telemetry-free experience while otherwise being up-to-date with Audacity's original codebase, with minor improvements being added as (s)needed by contributors and volunteers like you at large. Sneedacity is free software licensed under GPL, version 2. Macros for chaining commands and batch processing. Scripting...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 4
    Faust

    Faust

    Functional programming language for signal processing

    Faust (Functional Audio Stream) is a functional programming language for sound synthesis and audio processing with a strong focus on the design of synthesizers, musical instruments, audio effects, etc. Faust targets high-performance signal processing applications and audio plug-ins for a variety of platforms and standards. The core component of Faust is its compiler. It allows to "translate" any Faust digital signal processing (DSP) specification to a wide range of non-domain specific languages...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Contractor Foreman is the most affordable all-in-one construction management software for contractors and is trusted by contractors in more than 75 countries. Icon
    Starting at $49/m for the WHOLE company, Contractor Foreman is the most affordable all-in-one construction management system for contractors. Our customers in 75+ countries and industry awards back it up. And it's all backed by a 100 day guarantee.
  • 5
    AudioKit

    AudioKit

    Swift audio synthesis, processing, & analysis platform

    AudioKit is an entire audio development ecosystem of code repositories, packages, libraries, algorithms, applications, playgorunds, tests, and scripts, built and used by a community of audio programmers, app developers, engineers, researchers, scientists, musicians, gamers, and people new to programming. An important goal for AudioKit is to allow it to grow and be maintainable by a handful of volunteers. For this reason we have extensive tests that are run whenever changes are made to any...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Overtone

    Overtone

    Collaborative programmable music

    Overtone is an open-source audio environment designed to explore new musical ideas from synthesis and sampling to instrument building, live coding and collaborative jamming. We combine the powerful SuperCollider audio engine, with Clojure, a state-of-the-art lisp, to create an intoxicating interactive sonic experience. Synchronize your visuals and noise with ease. Overtone features seamless integration with both Quil, a Clojure front-end to Processing and ShaderTone, a Clojure version...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    BotSharp

    BotSharp

    Open source AI chatbot platform builder in 100% C#

    Conversation as a platform (CaaP) is the future, so it's perfect that we're already offering the whole toolkits to our .NET developers using the BotSharp AI BOT Platform Builder to build a CaaP. It opens up as much learning power as possible for your own robots and precisely control every step of the AI processing pipeline. BotSharp is an open source machine learning framework for AI Bot platform builder. This project involves natural language understanding, computer vision and audio processing...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    LittleJS

    LittleJS

    The Tiny JavaScript Game Engine That Can!

    LittleJS is a super lightweight 2D JavaScript game engine with fast WebGL rendering. It is designed to be small, simple, and easy to use for various applications, from game jams to commercial releases. This engine has everything necessary to make high-quality games, including fast rendering, physics, particles, sound effects, music, keyboard/mouse/gamepad input handling, update/render loop, and debug tools. It is recommended that you start by copying the LittleJS Starter Project This file is...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ... of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Software for modern PR Icon
    Software for modern PR

    Prezly serves brands and communications agencies.

    Newsrooms, campaigns and contact management in one powerful tool With Prezly, you can save time on juggling multiple outdated tools like Word docs, plain text email with heavy attachments, and Excel spreadsheets that keep getting longer.
  • 10
    Datasets

    Datasets

    Hub of ready-to-use datasets for ML models

    Datasets is a library for easily accessing and sharing datasets, and evaluation metrics for Natural Language Processing (NLP), computer vision, and audio tasks. Load a dataset in a single line of code, and use our powerful data processing methods to quickly get your dataset ready for training in a deep learning model. Backed by the Apache Arrow format, process large datasets with zero-copy reads without any memory constraints for optimal speed and efficiency. We also feature a deep integration...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SuperCollider

    SuperCollider

    Audio server, programming language, and IDE for sound synthesis

    SuperCollider is a platform for audio synthesis and algorithmic composition, used by musicians, artists, and researchers working with sound. It is free and open source software available for Windows, macOS, and Linux. scsynth, a real-time audio server, forms the core of the platform. It features 400+ unit generators (“UGens”) for analysis, synthesis, and processing. Its granularity allows the fluid combination of many known and unknown audio techniques, moving between additive and subtractive...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DALI

    DALI

    A GPU-accelerated library containing highly optimized building blocks

    The NVIDIA Data Loading Library (DALI) is a library for data loading and pre-processing to accelerate deep learning applications. It provides a collection of highly optimized building blocks for loading and processing image, video and audio data. It can be used as a portable drop-in replacement for built-in data loaders and data iterators in popular deep learning frameworks. Deep learning applications require complex, multi-stage data processing pipelines that include loading, decoding...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    T81 558

    T81 558

    Applications of Deep Neural Networks

    Deep learning is a group of exciting new technologies for neural networks. Through a combination of advanced training techniques and neural network architectural components, it is now possible to create neural networks that can handle tabular data, images, text, and audio as both input and output. Deep learning allows a neural network to learn hierarchies of information in a way that is like the function of the human brain. This course will introduce the student to classic neural network...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    txtai

    txtai

    Build AI-powered semantic search applications

    ..., models can understand concepts in documents, audio, images and more. Machine-learning pipelines to run extractive question-answering, zero-shot labeling, transcription, translation, summarization and text extraction. Cloud-native architecture that scales out with container orchestration systems (e.g. Kubernetes). Applications range from similarity search to complex NLP-driven data extractions to generate structured databases. The following applications are powered by txtai.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Triton Inference Server

    Triton Inference Server

    The Triton Inference Server provides an optimized cloud

    ... real-time, batched, ensembles, and audio/video streaming. Provides Backend API that allows adding custom backends and pre/post-processing operations. Model pipelines using Ensembling or Business Logic Scripting (BLS). HTTP/REST and GRPC inference protocols based on the community-developed KServe protocol. A C API and Java API allow Triton to link directly into your application for edge and other in-process use cases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Competent Audio

    Competent Audio

    Machine graph audio engine for computer games

    Competent Audio (CA) is an audio engine suitable for video games. It is written in C, but is designed for interoperability with other languages. Windows and Linux binaries for x86 and amd64 are available. CA uses a machine graph model with support for arbitrary numbers of machines, limited only by the available system resources: - Samplers play back audio clips. - Mixers combine audio signals and optionally perform signal processing. - Sinks send audio signals to an output device. Stereo...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Beep

    Beep

    A little package that brings sound to any Go application

    A little package that brings sound to any Go application. Suitable for playback and audio processing. Beep is built on top of its Streamer interface, which is like io.Reader, but for audio. It was one of the best design decisions I've ever made and it enabled all the rest of the features to naturally come together with not much code. Decode and play WAV, MP3, OGG, and FLAC. Encode and save WAV. Very simple API. Limiting the support to stereo (two channel) audio made it possible to simplify...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SPTK is a suite of speech signal processing tools for UNIX environments, e.g., LPC analysis, PARCOR analysis, LSP analysis, PARCOR synthesis filter, LSP synthesis filter, vector quantization techniques, and other extended versions of them.
    Leader badge
    Downloads: 48 This Week
    Last Update:
    See Project
  • 20
    Data augmentation

    Data augmentation

    List of useful data augmentation resources

    List of useful data augmentation resources. You will find here some links to more or less popular github repos, libraries, papers, and other information. Data augmentation can be simply described as any method that makes our dataset larger. To create more images for example, we could zoom in and save a result, we could change the brightness of the image or rotate it. To get a bigger sound dataset we could try to raise or lower the pitch of the audio sample or slow down/speed up. Keypoints...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Equalizer APO

    A system-wide equalizer for Windows 7 / 8 / 8.1 / 10 / 11

    Equalizer APO is a parametric / graphic equalizer for Windows. It is implemented as an Audio Processing Object (APO) for the system effect infrastructure introduced with Windows Vista. Features: - virtually unlimited number of filters - works on any number of channels - very low latency, which makes it suited for interactive applications - low CPU usage - modular graphical user interface - VST plugin support - integrates into Voicemeeter (http://www.voicemeeter.com/) Requirements: - Windows...
    Leader badge
    Downloads: 53,071 This Week
    Last Update:
    See Project
  • 22
    PC Equalizer - GUI

    PC Equalizer - GUI

    GUI for Equalizer APO, Instantly Boost Your Computer's Audio.

    .... It offers control over all audio channels, including stereo, mono, swap, invert, balance, and more. The Pan\Expand processing feature allows for the adjustment of stereo effects. Additionally, the Multi-Delays feature enables users to create a variety of effects like echoes, delays, reverb, and more. It also allows for assigning presets to specific programs, such as the Rock preset to Winamp, Classic preset to Windows Media Player, Pop preset to PotPlayer, and so on.
    Leader badge
    Downloads: 2,772 This Week
    Last Update:
    See Project
  • 23
    FFmpeg Batch AV Converter

    FFmpeg Batch AV Converter

    Free all in one audio/video ffmpeg batch encoder

    FFmpeg Batch AV Converter is a free universal audio and video encoder, that allows to use the full potential of ffmpeg command line with a few mouse clicks in a convenient GUI with drag and drop, progress information. Some fancy wizards make things easy for non-experts. Thanks to its multi-file encoding feature, it may be the fastest a/v batch encoder available, since it maximizes system resources usage by launching as many simultaneous processes up to user cpu thread count. You can change...
    Leader badge
    Downloads: 2,407 This Week
    Last Update:
    See Project
  • 24
    LameXP

    LameXP

    Multi-Format Audio-Encoder Front-end

    LameXP is a free multi-format audio file converter that supports a variety of output formats, including MP3, AAC/MP4, Ogg Vorbis, Opus, as well as FLAC, and an even higher number of input formats. It also supports batch processing and can utilize multiple processor cores.
    Leader badge
    Downloads: 215 This Week
    Last Update:
    See Project
  • 25
    Nyquist

    Nyquist

    Nyquist is a language for sound synthesis and music composition.

    Nyquist is a language for sound synthesis and music composition. It is implemented in C and C++ and runs on Win32, OSX, and Linux. Nyquist combines a powerful functional programming style with efficient signal-processing primitives. Nyquist is also embedded as a scripting language in Audacity.
    Leader badge
    Downloads: 86 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next