audio detection free download

Showing 101 open source projects for "audio detection"

View related business solutions

Eptura Workplace Software
From desk booking and visitor management, to space planning and office utilization data, Eptura Workplace helps your entire organization work smarter.

With the world of work changed forever, it’s essential to manage your workplace and assets together to effectively create a high-performing environment. The Eptura experience combines the power of workplace management software with asset management, enabling you to effectively operate your building and facilitate hybrid work.

Learn More
Recruit and Manage your Workforce
Evolia makes it easier to hire, schedule and track time worked by frontline in medium and large-sized businesses.

Evolia is a web and mobile platform that connects enterprises with 1000’s of local shift workers and offers free workforce scheduling and time and attendance solutions. Is your business on Evolia?

Learn More
1

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 47 This Week

Last Update: 2023-12-07
See Project
2

FFsubsync

Automagically synchronize subtitles with video

... language. In this case, you can use the correctly synchronized srt file directly as a reference for synchronization, instead of using the video as the reference. ffsubsync uses the file extension to decide whether to perform voice activity detection on the audio or to directly extract speech from an srt file. ffsubsync usually finishes in 20 to 30 seconds, depending on the length of the video.

Downloads: 15 This Week

Last Update: 2022-12-31
See Project
3

Label Studio

Label Studio is a multi-type data labeling and annotation tool

The most flexible data annotation tool. Quickly installable. Build custom UIs or use pre-built labeling templates. Detect objects on image, bboxes, polygons, circular, and keypoints supported. Partition image into multiple segments. Use ML models to pre-label and optimize the process. Label Studio is an open-source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats. It can...

Downloads: 10 This Week

Last Update: 2024-08-20
See Project
4

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 9 This Week

Last Update: 1 day ago
See Project
Enterprise AI Search, Intranet, and Wiki in one platform.
Your company’s all-in-one solution for trusted information

Cut through the noise and end information overload with Guru, an all-in-one wiki, intranet, and knowledge base that serves as your company's single source of truth.

Learn More
5

Basic Pitch

A lightweight audio-to-MIDI converter with pitch bend detection

Basic Pitch is a Python library for Automatic Music Transcription (AMT), using lightweight neural network developed by Spotify's Audio Intelligence Lab. It's small, easy-to-use, pip install-able and npm install-able via its sibling repo. Basic Pitch may be simple, but it's is far from "basic"! basic-pitch is efficient and easy to use, and its multi pitch support, its ability to generalize across instruments, and its note accuracy compete with much larger and more resource-hungry AMT systems...

Downloads: 4 This Week

Last Update: 2024-08-16
See Project
6

Homebridge UniFi Protect

Complete HomeKit integration for UniFi Protect with full support

Complete HomeKit integration for UniFi Protect with full support for most features including autoconfiguration, motion detection, and multiple controllers. homebridge-unifi-protect is a Homebridge plugin that provides HomeKit support to the UniFi Protect device ecosystem. UniFi Protect is Ubiquiti's video security platform, with rich camera, doorbell, and NVR controller hardware options for you to choose from, as well as an app which you can use to view, configure and manage your video camera...

Downloads: 2 This Week

Last Update: 2024-06-16
See Project
7

BizHawk

BizHawk is a multi-system emulator written in C#

A multi-system emulator written in C#. As well as quality-of-life features for casual players, it also has recording/playback and debugging tools, making it the first choice for TASers (Tool-Assisted Speedrunners). Screenshotting and recording audio + video to file. Firmware management, input, framerate, and more in a HUD over the game. Rebindable hotkeys for controlling the frontend (keyboard+mouse+gamepad). A comprehensive input mapper for the emulated gamepads and other peripherals...

Downloads: 2 This Week

Last Update: 2023-05-03
See Project
8

Transformers

State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX

... classification, object detection, and segmentation. Audio, for tasks like speech recognition and audio classification. Transformers provides APIs to quickly download and use those pretrained models on a given text, fine-tune them on your own datasets and then share them with the community on our model hub. At the same time, each python module defining an architecture is fully standalone and can be modified to enable quick research experiments.

Downloads: 1 This Week

Last Update: 2024-08-22
See Project
9

Lumos Engine

Cross-Platform C++ 2D/3D game engine

Cross-platform 2D and 3D Game Engine written in C++ that supports both OpenGL and Vulkan. Support for Windows, Linux, macOS. Support for OpenGL/Vulkan. D audio using OpenAL. Rendering 3D models with PBR shading. Debug gui using ImGui 3D collision detection - cuboid/sphere/pyramid. 2D collision detection - Box2D. Basic lua scripting support.

Downloads: 1 This Week

Last Update: 2023-07-06
See Project
Fully managed relational database service for MySQL, PostgreSQL, and SQL Server
Focus on your application, and leave the database to us

Cloud SQL manages your databases so you don't have to, so your business can run without disruption. It automates all your backups, replication, patches, encryption, and storage capacity increases to give your applications the reliability, scalability, and security they need.

Try for free
10

DeepDetect

Deep Learning API and Server in C++14 support for Caffe, PyTorch

... of image tagging, object detection, segmentation, OCR, Audio, Video, Text classification, CSV for tabular data and time series. Neural network templates for the most effective architectures for GPU, CPU, and Embedded devices. Training in a few hours and with small data thanks to 25+ pre-trained models. Full Open Source, with an ecosystem of tools (API clients, video, annotation, ...) Fast Server written in pure C++, a single codebase for Cloud, Desktop & Embedded.

Downloads: 0 This Week

Last Update: 2024-01-10
See Project
11

hfapigo

Unofficial (Golang) Go bindings for the Hugging Face Inference API

(Golang) Go bindings for the Hugging Face Inference API. Directly call any model available in the Model Hub. An API key is required for authorized access. To get one, create a Hugging Face profile.

Downloads: 0 This Week

Last Update: 2024-07-11
See Project
12

Mixxx

Mixxx is Free DJ software that gives you everything you need

Free and open source DJ software for Windows, macOS, and Linux. Mixxx integrates the tools DJs need to perform creative live mixes with digital music files. Whether you are a new DJ with just a laptop or an experienced turntablist, Mixxx can support your style and techniques of mixing. BPM and musical key detection help you find the perfect next track from your library. Use Sync Lock to match the tempo and beats of four songs for seamless mixing. Built-in mappings for DJ controller hardware...

Downloads: 0 This Week

Last Update: 2024-06-22
See Project
13

Adversarial Robustness Toolbox

Adversarial Robustness Toolbox (ART) - Python Library for ML security

Adversarial Robustness Toolbox (ART) is a Python library for Machine Learning Security. ART provides tools that enable developers and researchers to evaluate, defend, certify and verify Machine Learning models and applications against the adversarial threats of Evasion, Poisoning, Extraction, and Inference. ART supports all popular machine learning frameworks (TensorFlow, Keras, PyTorch, MXNet, sci-kit-learn, XGBoost, LightGBM, CatBoost, GPy, etc.), all data types (images, tables, audio, video...

Downloads: 0 This Week

Last Update: 2024-08-03
See Project
14

ml5.js

Friendly machine learning for the web

A neighborly approach to creating and exploring artificial intelligence in the browser. ml5.js aims to make machine learning approachable for a broad audience of artists, creative coders, and students. The library provides access to machine learning algorithms and models in the browser, building on top of TensorFlow.js with no other external dependencies.

Downloads: 0 This Week

Last Update: 2024-08-01
See Project
15

Tensorflow Transformers

State of the art faster Transformer with Tensorflow 2.0

Imagine auto-regressive generation to be 90x faster. tf-transformers (Tensorflow Transformers) is designed to harness the full power of Tensorflow 2, designed specifically for Transformer based architecture. These models can be applied on text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
16

Vividl - Video Downloader

Free video downloader for YouTube and hundreds of other websites

Vividl is a Windows app that lets you download videos from various websites, including YouTube, Soundcloud, Vimeo, Dailymotion and various media channels. It uses the popular downloader youtube-dl/ yt-dlp to extract videos from several hundred websites. Just copy a video URL, paste it in Vividl and download the video in different qualities, convert it to other formats or to an audio track. You can also manage the parallel download of multiple videos and retrieve additonal meta information using...

5 Reviews

Downloads: 794 This Week

Last Update: 2024-01-06
See Project
17

AugLy

A data augmentations library for audio, image, text, and video

AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...

Downloads: 0 This Week

Last Update: 2022-03-29
See Project
18

Subtitle-Workshop-Classic-v6.3.1

Subtitle Editor derived from 6.0c, but with VLC and Hunspell checker

Audio waveform, VLC Video Renderer, UTF8 coding, Audio stream detection and Selection, Resizeable screens, Hunspell spellcheck, Easy shortcut editing, user profiles and more than 70 filetypes supported.

Downloads: 131 This Week

Last Update: 2024-08-06
See Project
19

OpenClinic GA

Open Source Integrated Hospital Information Management System

OpenClinic GA is an open source integrated hospital information management system covering management of administrative, financial, clinical, lab, x-ray, pharmacy, meals distribution and other data. Extensive statistical and reporting capabilities.

29 Reviews

Downloads: 195 This Week

Last Update: 4 days ago
See Project
20

Shutter Encoder

Free professional video converter Windows|Mac|Linux

... - Editing codecs: DNxHD, DNxHR, Apple ProRes, QT Animation, GoPro CineForm, Uncompressed YUV - Output codecs: H.264, H.265, VP8, VP9, AV1, OGV - Broadcast codecs: XDCAM HD422, AVC-Intra 100, XAVC, HAP - Old codecs: DV PAL, MJPEG, Xvid, WMV, MPEG - Archiving codec: FFV1 - Images creation: JPEG, Image - Burn & Rip: DVD, Blu-ray, DVD RIP - Analysis: Loudness & True Peak, Audio normalization, Cut detection, Black detection, Media, VMAF - Download: Web video

1 Review

Downloads: 27 This Week

Last Update: 2024-06-12
See Project
21

Apprentice Video

it's a video player, also works for music and pictures

This player stands on the giant shoulders of FFmpeg. Audio rendering is accomplished via portaudio v19. Video rendering is via OpenGL, using fragment programs when possible. User interface is implemented with Qt 4/5. ASS/SSA subtitle rendering is implemented with libass. MOD, XM, S3M, IT file support via libmodplug. This player provides several performance options to enable adequate video playback on slow hardware: * skip loop filter * skip non-reference frames * skip color converter...

4 Reviews

Downloads: 9 This Week

Last Update: 6 days ago
See Project
22

SpectrumViewer for Windows

Continuous display of audio spectrum from PC input or wave file

Reads audio from Windows audio interface or wave file and creates a continuous spectrum display. Command line support for generating sine, square, triangle and sawtooth wave files. Uses biquad IIR digital filters for frequency detection, plus digital lowpass filters for output extraction. Highly configurable. 60+ FPS with suitable hardware. Manual or automatic gain control.

Downloads: 6 This Week

Last Update: 2023-09-28
See Project
23

footswitch2

Audio Transcription software for Linux (Vlc) with a foot pedal

Footswitch 2 is a media player for transcribers on Linux. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a USB footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a footpedal/footswitch. Control of the media player from LibreOffice can be via Hotkeys or an integrated toolbar...

Downloads: 2 This Week

Last Update: 2023-04-02
See Project
24

footswitch3basic

Audio Transcription software for Linux (Gstreamer) with a foot pedal

Footswitch3basic is a media player for transcribers on Linux. Written in python using the bindings for Gstreamer it allows a transcriber to control the audio or video with a foot pedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal/foot switch. Control of the media player from LibreOffice can be via Hotkeys or an integrated toolbar...

Downloads: 0 This Week

Last Update: 2023-04-02
See Project
25

footswitch2basic

Audio Transcription software for Linux (Vlc) with a foot pedal

Footswitch 2 (Basic) is a media player for transcribers on Linux. This version is a stripped down version of Footswitch2, containing only the absolute essentials for transcription. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do...

Downloads: 0 This Week

Last Update: 2023-04-02
See Project