MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
Srt Viewer for Mac is a software to show the subtitles with a timeline on SRT FORMAT(SubTitles) files independently.
EMU is a collection of software tools for the creation, manipulation and analysis of speech databases. At the core of EMU is a database search engine which allows queries based on the sequential and hierarchical structure of the annotations.
This project is now located at http://xiph.org/quicktime/ Xiph QuickTime Components (XiphQT) is, in short, the solution for Mac and Windows users who want to use Xiph formats in any QuickTime-based application, e.g. playing Ogg Vorbis in iTunes or produc
a 16-bit midi player for DOS
DOSMid is a real mode (16-bit) midi player for DOS. DOSMid supports a variety of MIDI synthesizers, and has very low hardware requirements.
MPT is a toolbox that supplies cross-platform libraries for real-time perception primitives, including face detection, eye detection, blink detection, and color tracking.
LibSMF is a BSD-licensed C library for handling SMF ("*.mid") files. It transparently handles conversions between time and pulses, tempo map handling etc. The only dependencies are C compiler and glib. Full API documentation and examples are included. Note that the development has moved to https://github.com/nilsgey/libsmf.
A tool written in Java that help you to extract your songs and videos from your iPod to your PC.
EasyBMPtoAVI is a cross-platform (Linux, Windows, OSX, Solaris..), easy-to-use application to convert a series of BMP images of any bit depth to an AVI movie file. EasyBMPtoAVI supports both command-line and interactive use, and a GUI version is planned.
HMM-based singing voice synthesis system
Sinsy is an HMM-based singing voice synthesis system. This software is released under the Modified BSD license.
Provide a user-level API (C library) for communicating with the Creative NOMAD Jukebox MP3 players and Dell DJs under Linux, *BSD and Windows. The protocol in question is colloquially known as "PDE" (Portable Digital Entertainment). It includes simple
OggCarton is a cross-platform CD ripper, database, and web server for Ogg and MP3 files. Needs no external database or web server! <br> Linux and Windows require Java 1.4.1 (or later) installed. Java is included with Mac OS X.
Copies video and audio from a PVR's HDD and produces .mpg, M2v and or mp2 / AC3 files suitable for furthure video/audio processing, Griphical interface and EPG information also. Currently supports Dish/Echostar 501/508/510/522/625 model numbers.
A free, open source Spotify-compatible client
Small tool that creates a PDF file with thumbnails of all images in a folder. The number of thumbnails per page along with some other settings can be adjusted. Jpg2Pdf uses the iText library for pdf-generating.
Klang is a project that allows viewing and editing of binary files in a structured way. Unlike traditional hex editors, Klang provides a hierarchical view of many binary file types that can be 'chunked', such as WAV and AIFF.
Matlab Toolbox for reading and writing videos.
Matlab Toolbox to process video files, which consists on a set of classes for reading, writing, correcting light changes and generating gaussian pyramids in real time. This toolbox is designed for Windows x64, Max OS X x64 and Linux x64. Through to use of Ffmpeg, it can reproduce and create videos very fast and also do no require to load the entire video file in memory. In addition the C++ code is already compiled in order to simplify the toolbox installation. In order to stay tuned for updates, you can follow the Matlab VideoUtils on Twitter (@VideoUtils): https://twitter.com/#!/VideoUtils If anyone is interested in collaborate, please contact to me -> https://sourceforge.net/sendmessage.php?touser=3811831
Variations is a digital music library software system that provides online access to streaming audio and scanned score images with a flexible access control framework to ensure respect for intellectual property.
A Java Utility to edit and create new game content for Might and Magic 8 with support for 6 and 7. This is an extension of the no-longer-developed UnLod java utility for MM8. MM8LevelEditor is no longer actively developed. A new MMLevelEditor is being re
FMOD.net is an implementation in .net c# for the FMOD sound system. Its possible to play a sound using only 3 lines of code! Its possible to create DSP functions by yourself using c# or other .net languages. You can play just many sounds as you want.
Cross platform C++ library for developping audio plugin´s UI
VSTGUI is a cross platform C++ library mainly for developing user interfaces for audio plug-ins (VST, AudioUnit, AAX, RTAS, etc).
Last Dot FM is an alternative open source player for last.fm web service. It supports all features of official client and also some additional features. Written in .NET 2.0.
Tools recording, mixing, mastering and delivering music tracks
Command line tool handling steps to clean, calibrate, process, mix, master and deliver music tracks from recordings. Easy-to-use configuration files drive the complete processes.
Speech Made Visible is an experiment in showing some of the qualities of speech in printed text. Analyze a recording for attributes like pitch, intensity (loudness), and speed; then style the words in a transcript to suggest those characteristics.
The Integrating Vision Toolkit (IVT) is a powerful and fast C++ computer vision library with an easy-to-use object-oriented architecture. It offers its own multi-platform GUI toolkit. OpenCV is integrated optionally. Website: http://ivt.sourceforge.net