Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
SPTK is a suite of speech signal processing tools for UNIX
environments, e.g., LPC analysis, PARCOR analysis, LSP analysis,
PARCOR synthesis filter, LSP synthesis filter, vector
quantization techniques, and other extended versions of them.
MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
A fast GPU accelerated feature extraction software for speech analysis
A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.
Advanced Speech Signal Analysis library provides a structure to handle various file formats and a variety of analysis functions commonly used in speech processing.
1.) Investigation with cosine transform, and anti transform algorithm, with some voice recognition code. 2.) Translator: Croatian, English. 3.) 2D to 3D picture algorithm (principle) and new 2Dto3D video conversion code with AviSynth video scripting
This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
Speech Made Visible is an experiment in showing some of the qualities of speech in printed text. Analyze a recording for attributes like pitch, intensity (loudness), and speed; then style the words in a transcript to suggest those characteristics.
eXtace is a 3D audio visualization tool (or eye candy depending on how you look at it). eXtace requires ESD (Esound) for its sound input source. It performs a FFT (fast fourier transform) on audio and displays it via various graphical modes.
Audacity Policial (aka Audacity Police) is an extension of Audacity sound editor that was created to help police and justice investigations based on phone call and environmental recordings, supporting audio analysis and transcription.
The purpose of this project is to provide a biometric security solution by using voice print, fingerprint and/or facial recognition along with a password and/or smart card support using AES to protect data. Please read forums for if interested.
SAA (SSPLab Audio Analyzer)
It will be able to separate sources, recognize speech and analyze
the auditory scene. It can also synthesize spatialised sounds from
mono recording, edit, analyze via spectrogram, filter and re-sample
signals.
jATLAS is a Java implementation of ATLAS [Architecture and Tools for Linguistic Analysis Systems]. For more information, see http://jatlas.sourceforge.net.
Find the pitch of a power spectrum (signal) as per the afferent/efferent neural crossover. This occurs between the Lateral olivocochlear efferents and the inner hair cell afferents.
ViGiL is supposed to be a platform-independent tool for singing students. In it's final version it should be able to analyze a voice recording (read from audio file or microphone) and compare it to a given score according to melody, rhythm and dynamics.
Idi is a voice recognition program intended to help people with physical disabilities to use a keyboard by dictating. It can also be used as a way to remote control your computer or as a nice way to type in your bath.