application free download

Diffgram

Training data (data labeling, annotation, workflow) for all data types

From ingesting data to exploring it, annotating it, and managing workflows. Diffgram is a single application that will improve your data labeling and bring all aspects of training data under a single roof. Diffgram is world’s first truly open source training data platform that focuses on giving its users an unlimited experience. This is aimed to reduce your data labeling bills and increase your Training Data Quality. Training Data is the art of supervising machines through data. ...

Downloads: 3 This Week

Last Update: 2024-10-14

See Project

FireRedASR

Open-source industrial-grade ASR models

FireRedASR is an industrial-grade family of open-source automatic speech recognition models designed to provide high-precision speech-to-text performance across languages including Mandarin, English, and various Chinese dialects, achieving new state-of-the-art benchmarks on public test sets. The project includes multiple model variants to meet different application needs, such as high-accuracy end-to-end interaction using an encoder-adapter-LLM framework and efficient real-time recognition using attention-based encoder-decoder architectures, giving developers flexibility in balancing performance and resource constraints. FireRedASR not only excels in traditional speech recognition tasks but also demonstrates strong capability in challenging scenarios like singing lyrics recognition, where accurate transcription is often difficult for conventional models.

Downloads: 0 This Week

Last Update: 2026-02-25

See Project

Lip Reading

Cross Audio-Visual Recognition using 3D Architectures

The input pipeline must be prepared by the users. This code is aimed to provide the implementation for Coupled 3D Convolutional Neural Networks for audio-visual matching. Lip-reading can be a specific application for this work. Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the extracted information from one modality to improve the recognition ability of the other modality by complementing the missing information. ...

Downloads: 1 This Week

Last Update: 2022-08-11

See Project

Domotic Speech-recognition interface

Speech-recognition interface for a domotic system.

...Available oral commands are generated from a house description file in XML format. The oral commands have to be trained for a specific users. For this purpose 2 interfaces are provided: a command line interface and a web application. These interfaces allow to visualize oral commands, train and delete trainings.

Downloads: 0 This Week

Last Update: 2015-12-29

See Project

Centauri Voice Interface

Provides a voice interface for applications via a plug in system. Allows the inclusion of voice recognition in an application with a minimum of effort.

Downloads: 0 This Week

Last Update: 2016-03-11

See Project

Search Results for "application"

Showing 5 open source projects for "application"

Diffgram

FireRedASR

Lip Reading

Domotic Speech-recognition interface

Centauri Voice Interface

Search Results for "application"

Showing 5 open source projects for "application"

Diffgram

FireRedASR

Lip Reading

Domotic Speech-recognition interface

Centauri Voice Interface

Related Searches

Related Categories