image text input free download

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...

Downloads: 9 This Week

Last Update: 2026-04-24

See Project

PersonaPlex

PersonaPlex code

...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.

Downloads: 1 This Week

Last Update: 2026-03-02

See Project

Moshi

A speech-text foundation model for real time dialogue

...Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.

Downloads: 2 This Week

Last Update: 2024-11-05

See Project

Virtual Speech Mechanism System

Virtual Speech Mechanism System converts text to voice.

Virtual Speech Mechanism System is .NET based application written in C#. It can convert text to speech either in interactive mode or take input from a TEXT file. It's output can either be directed to speakers or saved as WAV file that can be played with any audio player. Output wave can be selected to be of channel 1 or 2. It is 2 by default. The speech rate can be controlled by -10 to 10 points depending upon the requirements along with volume ranging from 0 to 100%. ...

Downloads: 0 This Week

Last Update: 2015-05-05

See Project

JRTalk

JRTalk is a speech syth program. It allows handicap users to use a mouse and the keyboard to select phrases, type words and sentences. The software converts this text input into speech and plays it though the speakers attached to the soundcard. It also a

Downloads: 0 This Week

Last Update: 2015-06-27

See Project

Dialektos

A machine translation program designed to accept verbal or text input and provide text or speech synthesized voice translation as output. Makes use of 3 current open-source projects. The source is currently C/C++ and embedded perl.

Downloads: 0 This Week

Last Update: 2015-08-03

See Project

Talk Box

Talkbox is a program wich makes your computer talk "with" you. It has a AI based on ALICE program C and uses Festvial speech engin along with speechd to produce voice synthisis. You input text by typeing there is no support for voice reconition.

Downloads: 1 This Week

Last Update: 2015-02-24

See Project

Search Results for "image text input"

Showing 7 open source projects for "image text input"

SpeechRecognition

PersonaPlex

Moshi

Virtual Speech Mechanism System

JRTalk

Dialektos

Talk Box

Search Results for "image text input"

Showing 7 open source projects for "image text input"

SpeechRecognition

PersonaPlex

Moshi

Virtual Speech Mechanism System

JRTalk

Dialektos

Talk Box

Related Searches

Related Categories