text processing free download

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 59 This Week

Last Update: 2025-06-26

See Project

NVIDIA NeMo

Toolkit for conversational AI

NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. ...

Downloads: 2 This Week

Last Update: 2026-04-22

See Project

VideoSrt

Windows-GUI

...Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. ...

Downloads: 22 This Week

Last Update: 2023-01-13

See Project

Speechalyzer

Process large speech data wrt transcription, labeling and annotation

...It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text, text-to-speech and speech classification software systems.

Downloads: 0 This Week

Last Update: 2016-04-27

See Project

Arabic Phonetic Platform using VoiceXML

This project'll be the core engine of many voice based platforms,which can be implemented into your projects,websites...etc to provide an Arabic speech service, where your servers can interact with the clients through Arabic Speech Recognition.

Downloads: 0 This Week

Last Update: 2013-04-01

See Project

Search Results for "text processing"

Showing 5 open source projects for "text processing"

Whisper

NVIDIA NeMo

VideoSrt

Speechalyzer

Arabic Phonetic Platform using VoiceXML

Search Results for "text processing"

Showing 5 open source projects for "text processing"

Whisper

NVIDIA NeMo

VideoSrt

Speechalyzer

Arabic Phonetic Platform using VoiceXML

Related Searches

Related Categories