InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour.
Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
De-essing software to reduce sibilance in speech using TSP
This de-esser uses a novel approach called Temporal Sibilance Processing. The idea is to distinguish between fricatives and voiced sections of the speech signal by the number of zero crossings in time. Most of the speech file is left untouched (the samples are directly copied from source to destination). Only fricatives that are long enough and loud enough are filtered. The advantage of this approach over traditional approaches is that the clarity of the remaining speech is completely unaffected.
RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition.
full installation and usage instructions given at
http://sourceforge.net/p/rnnl/wiki/Home/
Summary:
Your eyes are tired of looking into the tablet or cell-phone screen reading ebooks? You have difficulty reading from LCD screen specially in a driving vehicle? This software is for you!
It converts your PDF files to MP3 audio books.
Special Features (Compared to similar projects):
Each page is in a separate MP3 file.
Created MP3 files have ID3v2 tags showing Book name and page number.
Multi-threaded conversion, means all CPU cores will be used thus multiple times faster conversion.
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.
Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
Free Open Source Software for the Speech & Hearing Impaired
RHEMA - Speak to Hear Software Application
RHEMA is a software designed to help people with speech disability. Thiruvalluvar, the Tamil Sage of the 1st Century CE had said:
“Wealth of wealth is wealth acquired be ear attent;
Wealth mid all wealth supremely excellent. “ Kural No : 411
This software is the first version, with limited words in Tamil for them to practice. We have tested it with the help of a school and atleast two children were able to pick up some...
VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The aim of the project is to make programming through voice input as easy and productive as with mouse and keyboard.
For install, Use subversion, as described in this page: http://sourceforge.net/apps/mediawiki/voicecode/index.php?title=VCode_1_Doc/InstallationManual.
Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices.
It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG.
Speect is free and open source software. ...
JListenToMe lets you take control of eSpeak so comfortable, thanks to its friendly graphical interface and easy configuration. It is too easy and the best, is free software!.
VEDICS (Voice Enabled Desktop Interaction and Control System) is an assistive software which lets the user to interact with the OS using voice commands. Using this software the user can access any element found on the user's screen.
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
A JNI wrapper for pjsip. You can use this wrapper to develop Java applications using the pjsip library. At the moment only the pjsua API is implemented. If you would like to obtain a commercial license, or need customisations, please contact us.
An IDE for visually impaired users. It supports compiling and immediate error line focus, automatic code clean-up and not to mention all screen-readers E.G. NVDA. Sorry Linux can't work. Also, does NOT require Java Access Bridge.
This is a Linux project that acts as a front end to cdparanoia, sox, and ffmpeg with the hope of making it incredibly simple to rip many audiobook cds into one mono, audiobook (m4b) format file for use in audio players capable of playing audiobooks.
Simple testing tool to generate RTP data packets and send it via netwok interface or save into pcap file. Primarily intended for use with SIPp application to test speech quality with different codecs.
Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
'Text to Voice' or 'Text to Speech' is 1 of the coolest Firefox add-ons. It gives ur brwsr the pwr of speech. Select txt, clck the bttn on the bttm rite & this add-on spks the selectd txt 4 u. Isn't it brllant? Moreovr odio file cn b dnloaded as
Webvoice is a text to speech cgi program. You can embed a link in a html page to send things you want to say, via sound. No software is required on the client side. Festival and sox are needed on the server. Webvoice has its own interface (if needed).
Speech based User Interface Components Library for Java is a project to create Java controls and applications that can be used not only by literate people but also by non-literates. Speech and visual element with minimal text is used to create components
BladeWareVXML is a portable VoiceXML 2.1 interpreter that is an enhanced version (performance, usability and integration) of OpenVXI. A commercial version, with documentation, sample code, and support options, is available from the Commetrex Website.