One platform to build, fine-tune, and deploy ML models. No MLOps team required.
Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Try Free
Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.
Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
The SpeakRight Framework is a speech application framework written in Java. SpeakRight applications are fast to create and work on any (VoiceXML) speech platform. Applications are written in Java with full debug and unit testing available.
Free OpenSource VoiceXML editor programmed in Java (Swing). The VoiceXML document is regularly parsed, a tree view is built and syntax errors are reported in a specific table.
Automatically translate english/french/german text to german/french/english text and output speech in appropriate language.
All Automagically with the power of the inter-webs.
An open PHP-based framework for the Nabaztag™ (http://www.nabaztag.com/) electronic pet. Due to major changes on the Violet backend, OpenNab can no longer be connected to it but it still can be used as a standalone server to set your bunnies free !
JeSpeak is a Java library that bridges eSpeak, which is a compact opensource software speech synthesizer. JeSpeak uses JNI to make native call to libespeak.
Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
PDF Annot is a piece of software that enables you to add audio and text annotation to a PDF. It uses JPedal SimpleViewer and iText library. Annotations are supported by Adobe'sofficial PDF Reader. Report any bug here: krakosia[at]gmail.com
AGTK is a suite of software components for building tools for annotating linguistic signals,
time-series data which documents any kind of linguistic behavior (e.g. audio, video).
The internal data structures are based on annotation graphs.
vox-attendant is a telephony voice enabled virtual receptionist designed to work with VoiceXML and includes a distribution for the Voxeo Prophecy platform. It provides a web based interface for managing the directory.
Get your database online quickly using configuration not code. Use this secure, scalable and proven enterprise technology to publish any relational data, any custom process on the internet. MySQL, Java, XML, XSL. xHTML GUI, beta voiceXML and WML/WAP.
Internet Text Radio,designed around freeTTS,connects to a text server and tunes to a channel.The server starts pumping text data for that channel to the client, which converts text to speech, playing back the text as audio,like an internet radio station.
The man goal of Dr.Ta/MOD project is to create a brand new communication in vehicle. Under multi-network environment, Dr.ta/Mod use SCTP multi-homing feature to process multi-path association. Please read more in our website http://mod.maple.tw
The official software package for Vietnamese voice support in the Festival speech synthesis system (text-to-speech). This voice is developed (and owned) by Pham Thanh Nam.
VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and OpenSource Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
A simple software that speaks a text. You can type the text or appoint a file.
Fala is just a frontend to festival. It's designed for GNOME, but if you have gtk, pyhton and festival you are able to run it.
The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.
Project c2h - cetacean to human - building Seadragon, a tool for the scientific research of the acoustic communication of cetaceans, supporting the creation, emission, and recognition of underwater whistles. The blog: http://leafyseadragon.blogspot.com/
Audacity Policial (aka Audacity Police) is an extension of Audacity sound editor that was created to help police and justice investigations based on phone call and environmental recordings, supporting audio analysis and transcription.
festival-te synthesizes text in Telugu language into speech using Festival TTS. The package provides the supporting modules required to use festival for Telugu. It includes modules for text/lexical analysis and intonation/duration prediction for Telugu.
GibPhone is a highly extensible VoIP/IM client for the .NET framework that uses a powerful plugin engine to allow for UI extensions and any call control stack / media payload / transport protocol combination.
This tool is for Nuance developers who wish to analyze the results of Nuance's batchrec program. It reads in the results of a Nuance batch recognition run, calculates WER using sclite, and stores the results in a database for subsequent analysis.