One platform to build, fine-tune, and deploy ML models. No MLOps team required.
Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Try Free
Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.
Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
SAMI is an extensible, voice-controlled home automation system which seamlessly controls all of the pieces of your apartment or house, without installation or monthly fees.
For more information about how to use her, see the documentation tab!
De-essing software to reduce sibilance in speech using TSP
This de-esser uses a novel approach called Temporal Sibilance Processing. The idea is to distinguish between fricatives and voiced sections of the speech signal by the number of zero crossings in time. Most of the speech file is left untouched (the samples are directly copied from source to destination). Only fricatives that are long enough and loud enough are filtered. The advantage of this approach over traditional approaches is that the clarity of the remaining speech is completely unaffected.
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices.
It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG.
Speect is free and open...
Advanced Speech Signal Analysis library provides a structure to handle various file formats and a variety of analysis functions commonly used in speech processing.
Simple testing tool to generate RTP data packets and send it via netwok interface or save into pcap file. Primarily intended for use with SIPp application to test speech quality with different codecs.
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Asterisk Dialplan application, which allows you to use Lia_Phon and Mbrola as a French speech synthesizer. Application du plan de numérotation d'Asterisk, qui permet d'utiliser Lia_Phon et Mbrola comme synthétiseur vocal français sous Asterisk.
BladeWareVXML is a portable VoiceXML 2.1 interpreter that is an enhanced version (performance, usability and integration) of OpenVXI. A commercial version, with documentation, sample code, and support options, is available from the Commetrex Website.
The SingIt Lyric Displayer is an XMMS plugin which displays formatted lyrics, including id3v2xx lyrics. It consists of the displayer and an integrated editor which allows one to easily insert time stamps, edit the text, and export & strip HTML.
Flite text-to-speech module for Asterisk. This provides the "Flite" dialplan application, which allows you to use the Flite TTS Engine as a speech synthesizer in Asterisk.
VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
A language teaching program and library based on C. It includes sound snippets featuring native speakers. You can create, edit and use various lessons and learn via an optional GTK2 interface.
Portable GUI Ogg/Speex/FLAC audio encoder/player that can encode wave file and provides additional functionality such as audiofile tagging, html-album generator, cd-ripping, etc. Targeted to be used on Freebsd, Linux and win32 platforms and the frontends
SAA (SSPLab Audio Analyzer)
It will be able to separate sources, recognize speech and analyze
the auditory scene. It can also synthesize spatialised sounds from
mono recording, edit, analyze via spectrogram, filter and re-sample
signals.
Speak Freely for X is a GNOME interface for the popular Speak Freely applications. Telephony over the internet is now available in a user friendly GNOME environment!
Idi is a voice recognition program intended to help people with physical disabilities to use a keyboard by dictating. It can also be used as a way to remote control your computer or as a nice way to type in your bath.