This tool is for Nuance developers who wish to analyze the results of Nuance's batchrec program. It reads in the results of a Nuance batch recognition run, calculates WER using sclite, and stores the results in a database for subsequent analysis.
The Lazybones project is a server and a collection of clients. It allows one person to voice control multiple machines. Designed for software developers and sys admins, Lazybones speeds up redundant and/or complex tasks.
To provide basic text-to-speech capability on as many platforms and for as many spoken
languages as possible by formant synthesis from an International Phonetic Alphabet
representation.
DAI = Distributed Artificial Intelligence The projected is intended to be a test bed for AI related concepts and technologies, not necessarily an end user product, though that could change. Some of the modules can be modified for other uses.
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Automated Attendance System (AAS) uses 2 modes for authentication -
* Voice Identification System (VIS)
* Fingerprinting Method
The algorithms used for the same has been developed by me. This algo is more efficient and faster.
The Open VXI VoiceXML interpreter is a portable open source library that interprets the VoiceXML dialog markup language. It is designed to serve as a reference for parties interested in understanding how VoiceXML markup might be executed.
A subtitle agent, which developed on Java, helps to create, modify, and operate subtitle files easily. Also providing Java packages for developing subtitle agents conveniently. Moreover, they're free and open-sourced, based on GPL lisence.
Speech profile of Person is created that contains elementary sounds uttered. Profile is 1 time download for listeners. The actual audio sample is encoded based on profile. Decode using the profile stored earlier by User, and the audio can be regenerated.
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Software to fit whole-sentence language models using the principle of maximum entropy. For developers of speech recognizers, text prediction interfaces, OCR, machine translation software.
A language teaching program and library based on C. It includes sound snippets featuring native speakers. You can create, edit and use various lessons and learn via an optional GTK2 interface.
DTW is intended to be a Voice in -> Pictures + Text out program written in java using Sphinx from CMU. This is intended to be useful to people who have good oral/visual literacy skills but poor written literacy skills.
A plugin for gaim that interfaces with the popular program festival. It allows for instant messages to be spoken by festival so you can hear it thru your speakers.
gwavmerger is an interactive memory training tool designed to facilitate the learning of foreign languages. It helps you memorize long passages of text by heart.
The Pawn will make it possibly for you to tell the computer exactly what you would like it to do. Fiction. No its reality now. The highly customizable slackware will be the base for Pawn.
Time of day service over telephone using Voicent Gateway, a VoiceXML gateway that specially designed for voice modems. A Free version is available for download at http://www.voicent.com/download. Sample code for interactive telephony applications.
Voxapl is a suite of voice-enabled applications supporting voice recognition and text to speech functions for home automation services. It is written in Java and communicates via xAP and/or xPL messaging.
What is "use it". it is a groupware which is really simple to use for everyone. it includes a forum, private messages,painting tool, voice tool (teamspeak), document management system, calendar system, profile and a rights management...
Portable GUI Ogg/Speex/FLAC audio encoder/player that can encode wave file and provides additional functionality such as audiofile tagging, html-album generator, cd-ripping, etc. Targeted to be used on Freebsd, Linux and win32 platforms and the frontends
The purpose of this project is to provide a biometric security solution by using voice print, fingerprint and/or facial recognition along with a password and/or smart card support using AES to protect data. Please read forums for if interested.