Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.
Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
Try It Free
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.
Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
Speech recognition software for English & Polish languages
...SkryBot Administracyjny - for civil and government administration.
3. SkryBot Medycyna Rodzinna - for physicians
Professional version of SkryBot (commercial) offers you:
1. Audio conversion and cutting sound files into smaller ones.
2. Searching for words or phrases in sound files (recognized by SkryBot).
3. Editing sound files and automatic cutting off long silence parts in audio file.
De-essing software to reduce sibilance in speech using TSP
This de-esser uses a novel approach called Temporal Sibilance Processing. The idea is to distinguish between fricatives and voiced sections of the speech signal by the number of zero crossings in time. Most of the speech file is left untouched (the samples are directly copied from source to destination). Only fricatives that are long enough and loud enough are filtered. The advantage of this approach over traditional approaches is that the clarity of the remaining speech is completely unaffected.
The SingIt Lyric Displayer is an XMMS plugin which displays formatted lyrics, including id3v2xx lyrics. It consists of the displayer and an integrated editor which allows one to easily insert time stamps, edit the text, and export & strip HTML.
SAA (SSPLab Audio Analyzer)
It will be able to separate sources, recognize speech and analyze
the auditory scene. It can also synthesize spatialised sounds from
mono recording, edit, analyze via spectrogram, filter and re-sample
signals.
AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.