1.) Investigation with cosine transform, and anti transform algorithm, with some voice recognition code. 2.) Translator: Croatian, English. 3.) 2D to 3D picture algorithm (principle) and new 2Dto3D video conversion code with AviSynth video scripting
Cairo sets out to provide an enterprise grade, MRCPv2 compliant speech solution utilizing existing open source speech resources such as FreeTTS and Sphinx-4.
The Bonn Open Synthesis System (BOSS) is an extensible concatenative speech synthesizer with Non-Uniform Unit Selection capabilities developed at the University of Bonn, Germany.
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Simple testing tool to generate RTP data packets and send it via netwok interface or save into pcap file. Primarily intended for use with SIPp application to test speech quality with different codecs.
This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
FreeTTS is a speech synthesis engine written entirely in the
Java(tm) programming language. FreeTTS was written by the Sun Microsystems Laboratories Speech Team and is based on CMU's
Flite engine. FreeTTS also includes a partial JSAPI 1.0
XVoice provides voice control of X applications using IBM's ViaVoice for Linux (free download at their web site). Both user-defined commands and dictation are supported. It can be used to write letters, write code, control netscape, etc.
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
Asterisk Dialplan application, which allows you to use Lia_Phon and Mbrola as a French speech synthesizer. Application du plan de numérotation d'Asterisk, qui permet d'utiliser Lia_Phon et Mbrola comme synthétiseur vocal français sous Asterisk.
Adds the ability to navigate within and between pages to the standard WIkipedia interface. The project won Best-in-contest for the AVIOS Speech Application contest in 2010.
'Text to Voice' or 'Text to Speech' is 1 of the coolest Firefox add-ons. It gives ur brwsr the pwr of speech. Select txt, clck the bttn on the bttm rite & this add-on spks the selectd txt 4 u. Isn't it brllant? Moreovr odio file cn b dnloaded as
Speech Made Visible is an experiment in showing some of the qualities of speech in printed text. Analyze a recording for attributes like pitch, intensity (loudness), and speed; then style the words in a transcript to suggest those characteristics.
Performs actions on detected volume threshold Examples : - Launch music on clap - Launch speech recording when you start speaking - Launch guard webcam when a significant sound is detected - Increase or decrease headphones volume when ambient noise pass
audacity-extra now provides a sleek dark themed version of the Audacity open source sound editor. The project experiments with Audacity variations. There's a vowel-sound target-practice display for language learners and an analog waveform data logger for embedded systems.
Use our tools to hand edit phonetic word dictionaries for speech recognition engines. The new G2P4J format supporting SAMPA and Kirshenbaum IPA is portable to Sphinx, Julius and others. Demo medical, legal and technical dictionaries are featured.
Linux on Sound is a project to create a transparent console interface over eSpeak (http://espeak.sourceforge.net/), to increase linux console's accessibility.