A Fast Fourier Transform based up on the principle, "Keep It Simple, Stupid." Kiss FFT is a very small, reasonably efficient, mixed radix FFT library that can use either fixed or floating point data types.
SPTK is a suite of speech signal processing tools for UNIX environments, e.g., LPC analysis, PARCOR analysis, LSP analysis, PARCOR synthesis filter, LSP synthesis filter, vector quantization techniques, and other extended versions of them.
WaveSurfer is an open source tool for sound visualization and manipulation. Typical applications are speech/sound analysis and sound annotation/transcription. WaveSurfer may be extended by plug-ins as well as embedded in other applications.
MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
Speech Made Visible is an experiment in showing some of the qualities of speech in printed text. Analyze a recording for attributes like pitch, intensity (loudness), and speed; then style the words in a transcript to suggest those characteristics.
Vamp is an audio processing plugin system for plugins that extract descriptive information from audio data.
Recommends music based upon your current taste.
A music recommendation engine. It is meant to be an add-on for popular media players like Winamp, Amarok, Rhythmbox or Banshee. Currently supports only MediaMonkey Player. Downlaod, extract and run "pronac.exe". Play the first song from the Now Playing list, it'll recommend you next songs from the same list. NOTE: MAKE SURE THAT SONG SHUFFLE IS TURNED OFF WHILE USING PRONAC. Based upon K-Nearest Neighbor Machine Learning Algorithm, K-Fold Cross Validation and EchoNest for audio features.
convert wav file to bmp image file. this image show sonic analysis.
Snack is an extension to Tcl which adds commands for sound I/O and processing.
The Audio-Analyzer project is a set of tools for measuring the frequency response, distortion, and quality of audio equipment. It includes test signal generators and spectrum analyzers.
Toolbox for speech processing. Realization of Voicebox interface.
This is a toolbox for speech processing written in C. It realizes interface of Voicebox toolbox (http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html).
Add annotations (tags and notes) to collections of audio/video files. Intended to be used e.g. for annotating class recordings, interviews, archival footage, etc.
Mixing Bhatkhande rules with Traditional and Learning based software techniques to classify of existing as well as predicting new North Indian (Hindustani) Classical Music raaga\'s (raga\'s)
Sound Orgy is a suite of command line tools for the purpose of sound production and modification as well other misc sound related functons. Sound Orgy consists of components which use command line piping to connect to each other much like patch cords.
sujiSound is a flexible, lightweight audio processing engine. High efficiency, total flexibility, clean code and portability are the main goals.
A portable utility for checking the consistency of MPEG streams or files. The primary accent of the check is on the seamless flow of frames and tags since most MPEG defects introduced by aborted network transfers manifest theirselves in its breakage.
The Mayhem & Chaos Collection is a collection of semi-related software projects written/maintained by Robert Kaye. This umbrella project allows me to make my various side projects available to the public.
The project aims to identify a simple and low cost eddy current probe that can be interfaced directly to ordinary linux systems and develop software to record data and identify hidden corrosion on aircraft during their mandated annual inspections.
Speach recognition project for Lithuanian language.
-Traduction to spanish from english texts. -Helping to developers test and give ideas... -In other words, also I yield my possible material of general utility without profit spirit
Various tools for analysis of mySpace relationships and media
QNX Speech Recognition
Replaytool provides an environment for the replay and annotation of multiple media sources, such as video files, text logs, map data etc. See www.cs.nott.ac.uk/~apf for some more screenshots and videos.
Training Studio helps you to capture and analyze your fitness training diary and race results for many sports, as well as monitoring your overall health and planning for future events.
Prcho is a Ruby on Rails-based online music player and organizer. Its design enables it to access extremely large music libraries with grace, finding songs quickly and streaming them all over the world.