1.) Investigation with cosine transform, and anti transform algorithm, with some voice recognition code. 2.) Translator: Croatian, English. 3.) 2D to 3D picture algorithm (principle) and new 2Dto3D video conversion code with AviSynth video scripting
Cairo sets out to provide an enterprise grade, MRCPv2 compliant speech solution utilizing existing opensource speech resources such as FreeTTS and Sphinx-4.
Simple testing tool to generate RTP data packets and send it via netwok interface or save into pcap file. Primarily intended for use with SIPp application to test speech quality with different codecs.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
FreeTTS is a speech synthesis engine written entirely in the
Java(tm) programming language. FreeTTS was written by the Sun Microsystems Laboratories Speech Team and is based on CMU's
Flite engine. FreeTTS also includes a partial JSAPI 1.0
XVoice provides voice control of X applications using IBM's ViaVoice for Linux (free download at their web site). Both user-defined commands and dictation are supported. It can be used to write letters, write code, control netscape, etc.
Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.
Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
Adds the ability to navigate within and between pages to the standard WIkipedia interface. The project won Best-in-contest for the AVIOS Speech Application contest in 2010.
Asterisk Dialplan application, which allows you to use Lia_Phon and Mbrola as a French speech synthesizer. Application du plan de numérotation d'Asterisk, qui permet d'utiliser Lia_Phon et Mbrola comme synthétiseur vocal français sous Asterisk.
'Text to Voice' or 'Text to Speech' is 1 of the coolest Firefox add-ons. It gives ur brwsr the pwr of speech. Select txt, clck the bttn on the bttm rite & this add-on spks the selectd txt 4 u. Isn't it brllant? Moreovr odio file cn b dnloaded as
Speech Made Visible is an experiment in showing some of the qualities of speech in printed text. Analyze a recording for attributes like pitch, intensity (loudness), and speed; then style the words in a transcript to suggest those characteristics.
Performs actions on detected volume threshold Examples : - Launch music on clap - Launch speech recording when you start speaking - Launch guard webcam when a significant sound is detected - Increase or decrease headphones volume when ambient noise pass
audacity-extra now provides a sleek dark themed version of the Audacity opensource sound editor. The project experiments with Audacity variations. There's a vowel-sound target-practice display for language learners and an analog waveform data logger for embedded systems.
Use our tools to hand edit phonetic word dictionaries for speech recognition engines. The new G2P4J format supporting SAMPA and Kirshenbaum IPA is portable to Sphinx, Julius and others. Demo medical, legal and technical dictionaries are featured.
Linux on Sound is a project to create a transparent console interface over eSpeak (http://espeak.sourceforge.net/), to increase linux console's accessibility.
eSpeakIt is a Firefox extension that converts text to speech (using the espeak command), and plays the audio or saves it for use in portable media players. eSpeak must be installed for this to work. (see http://espeak.sourceforge.net/)