a tool for segmenting, labeling and transcribing speech
Subtitle translator from one natural language to other.
Translating subtitles in format SubRip from one natural language to other. It is based on Google Translate without API and therefore without payment. Translator have automatic and manual spell checkers.
A Biblia Falada é um software para leitura e estudo da Biblia Sagrada. Muito simples de usar e totalmente acessível para deficientes visuais, traz, além do novo sistema de leitura, os textos completos da edição Revista e Atualizada.
MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
Clavier virtuel et synthétiseur vocal pour les personnes ne pouvant plus parler et ayant du mal à utiliser leurs mains. Virtual keyboard and speech synthetiser for people with reduced mobility and unability to speak. In French and english.
Virtual Hypnotist is a software application that aims to provide a virtual interactive hypnosis session framework, for many uses. It is a rewrite of the Hypnotizer 2000 software. See the readme.txt file for legal info.
A speech recognition system using Matlab/Simulink/Stateflow.
This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.
Audiobook Cutter is an easy-to-use tool which splits large speech MP3 files into smaller ones without re-encoding. The split points are determined by silent parts. The main purpose is to make audiobooks or podcasts more manageable in a user-friendly way.
Sermon Recorder is a program for recording sermons or anything else.
Sermon Recorder is a program for recording sermons, speeches or anything else. It has some special features, such as automatic filename creation with parameters, almost "dummy-resistant" and localized user-interface (currently English and German), commandline-call after recording stop and much more... The recorded data is directly stream-encoded and written into the defined files, so the risk in case of a PC crash is minimized. The file name can be entered or changed throughout the whole recording. The recorded files are then renamed after recording stopped. Ideas for new features and help concerning translation to more languages is welcome...
mp3 library, advanced ID3V1 and ID3V2 tagger, player. Organize a large mp3 library, over 40,000 songs. Speech synthesis and tag backup utilities. Scripts to maintain and organize song files.
EMU is a collection of software tools for the creation, manipulation and analysis of speech databases. At the core of EMU is a database search engine which allows queries based on the sequential and hierarchical structure of the annotations.
Chess for the Blind for the JAWS or NVDA Screen Readers
Winboard 4.5 32-bit is a free Windows accessible Chess program that works automatically with the JAWS or the free NVDA screen reader. It is for the blind, low sighted or those who can not use a mouse. It provides vocal announcements of position changes and other selectable board conditions. Blind players also use a separate "tactile chess board". Winboard 4.5 has full keyboard access to move pieces and run menu items. Partial sighted players use high contrast mode and adjust board, piece, most font sizes and colors. Available languages are English, German, Spanish, Italian, Dutch and Russian. Games may be viewed, modified or saved in standard PGN format. Two Chess engines supply play and grand master strength move analysis. Connect to the Free Internet Chess Server (FICS) and play humans. Modify verbosity in screen reader configurations and in the Winboard "Sound" dialog under the "General" menu. Find all documentation in the "doc" folder under the Winboard directory of your C drive.
Arabisc is speaker independent large vocabulary continuous speech recognizer for Arabic language released under GNU license.It is also a collection of open source tools that allows researchers and developers to build speech recognition systems for Arab
Implementation of duration high-order hidden Markov model in Matlab.
Implementation of duration high-order hidden Markov model (DHO-HMM) in Matlab with application in speech recognition.
a GUI for the Festival speech synthesis program
Speak Freely is a Cross Platform Internet telephony (Voice Chat) application which provides high quality voice grade audio with GSM and CELP compression and encryption with DES, Blowfish, and IDEA ciphers. Refering to recent news: The maintainers will never add any unwanted extra software in the zip download file but we can only speak for our selves.
The Open Source [GNU GPL} library writed in Delphi, who provide easy access to MS Speech API (SAPI4 and SAPI5 like one) COM interface. The source code have sample to call it library for Delphi, Assembler, C#, C, Lasarus and FreeBasic.
Sayz Me is a text-to-speech application for Windows. Text can be typed in or read from clipboard. Words are highlighted when spoken. Select voice, adjust reading speed, voice pitch, font and color. Simple and easy to use.
A complete management solution for LAN Houses, CyberCafes and Internet Services Bureaus, and also a higly customizable and reliable substitute for expensive and poorly designed proprietary tools. Resources: Access/MSDE, TCP/IP, UDP and DirectSpeech.
AGTK is a suite of software components for building tools for annotating linguistic signals, time-series data which documents any kind of linguistic behavior (e.g. audio, video). The internal data structures are based on annotation graphs.
EBBA is a project aiming to develop an advanced chatbot by combining AIML, 3d facial expressions, speech synthesizer, speech recognition and an iq-test solving functionality.
GRANULE is a flashcards program based on Leitner cardfile methodology for learning new words. It features long-term memory training capabilities with scheduling, integrated pictures, sound, and full-screen mode.
Audacity Policial (aka Audacity Police) is an extension of Audacity sound editor that was created to help police and justice investigations based on phone call and environmental recordings, supporting audio analysis and transcription.
OC Volume is a speech recognition engine written in Java for integration with other applications. It is currently an User-Dependent Isolated Word Recognizer and can be expanded to include more capability for recognition.
Regulus is a Prolog-based toolkit for building spoken dialogue systems.