SPTK is a suite of speech signal processing tools for UNIX
environments, e.g., LPC analysis, PARCOR analysis, LSP analysis,
PARCOR synthesis filter, LSP synthesis filter, vector
quantization techniques, and other extended versions of them.
MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
mp3 library, advanced ID3V1 and ID3V2 tagger, player. Organize a large mp3 library, over 40,000 songs. Speech synthesis and tag backup utilities. Scripts to maintain and organize song files.
Intended eventually to be a live CD, LinVision allows the blind to use a computer to:
1) Organise books & read aloud.
2) Organise & play music.
3) Teach & test keyboard skills.
4) Write & save or email work.
5) Browse Internet.
VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
DTW is intended to be a Voice in -> Pictures + Text out program written in java using Sphinx from CMU. This is intended to be useful to people who have good oral/visual literacy skills but poor written literacy skills.
The Pawn will make it possibly for you to tell the computer exactly what you would like it to do. Fiction. No its reality now. The highly customizable slackware will be the base for Pawn.