speaker free download

Open Notebook

An Open Source implementation of Notebook LM with more flexibility

...Open Notebook enables users to organize and analyze multi-modal content such as PDFs, videos, audio files, web pages, and Office documents. It combines full-text and vector search with context-aware AI chat to deliver insights grounded in your own research materials. With advanced features like multi-speaker podcast generation, customizable content transformations, and a comprehensive REST API, Open Notebook provides a powerful and extensible research environment.

Downloads: 29 This Week

Last Update: 2026-06-18

See Project

The SpeechBrain Toolkit

A PyTorch-based Speech Toolkit

...SpeechBrain supports state-of-the-art methods for end-to-end speech recognition, including models based on CTC, CTC+attention, transducers, transformers, and neural language models relying on recurrent neural networks and transformers. Speaker recognition is already deployed in a wide variety of realistic applications. SpeechBrain provides different models for speaker recognition, including X-vector, ECAPA-TDNN, PLDA, and contrastive learning. Spectral masking, spectral mapping, and time-domain enhancement are different methods already available within SpeechBrain. Separation methods such as Conv-TasNet, DualPath RNN, and SepFormer are implemented as well. ...

Downloads: 1 This Week

Last Update: 2026-03-30

See Project

Vosk Speech Recognition Toolkit

Offline speech recognition API for Android, iOS, Raspberry Pi

...More to come. Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification. Speech recognition bindings are implemented for various programming languages like Python, Java, Node.JS, C#, C++, Rust, Go and others. Vosk supplies speech recognition for chatbots, smart home appliances, and virtual assistants. It can also create subtitles for movies, and transcription for lectures and interviews. ...

Downloads: 93 This Week

Last Update: 2024-04-22

See Project

TTS

Deep learning for text to speech

...Notebooks for extensive model benchmarking. Modular (but not too much) code base enabling easy testing for new ideas. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings efficiently. Vocoder models (MelGAN, Multiband-MelGAN, GAN-TTS, ParallelWaveGAN, WaveGrad, WaveRNN). If you are only interested in synthesizing speech with the released TTS models, installing from PyPI is the easiest option.

Downloads: 0 This Week

Last Update: 2021-10-18

See Project

Deepvoice3_pytorch

PyTorch implementation of convolutional neural networks

An open source implementation of Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning.

Downloads: 0 This Week

Last Update: 2024-08-13

See Project

Lip Reading

Cross Audio-Visual Recognition using 3D Architectures

...Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the extracted information from one modality to improve the recognition ability of the other modality by complementing the missing information. The essential problem is to find the correspondence between the audio and visual streams, which is the goal of this work. ...

Downloads: 4 This Week

Last Update: 2022-08-11

See Project

Search Results for "speaker"

Showing 6 open source projects for "speaker"

Open Notebook

The SpeechBrain Toolkit

Vosk Speech Recognition Toolkit

TTS

Deepvoice3_pytorch

Lip Reading

Search Results for "speaker"

Showing 6 open source projects for "speaker"

Open Notebook

The SpeechBrain Toolkit

Vosk Speech Recognition Toolkit

TTS

Deepvoice3_pytorch

Lip Reading

Related Searches

Related Categories