Showing 49 open source projects for "using"

View related business solutions
  • Outgrown Windows Task Scheduler? Icon
    Outgrown Windows Task Scheduler?

    Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

    Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.
    Download Free Tool
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    ...Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    Buzz

    Buzz

    Transcribe and translate audio offline on your personal computer

    Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for Windows: App is not signed, you will get a warning when you install it. ...
    Leader badge
    Downloads: 6,217 This Week
    Last Update:
    See Project
  • 4
    Simple TTS Reader

    Simple TTS Reader

    A small clipboard reader

    Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. Select your preferred speech engine from those installed on your system, such as Microsoft Zira, and adjust speed and volume for personalized playback. The application can also be minimized to the system tray. Plus, it is free and comes with an intuitive interface that makes it accessible to everyone.
    Leader badge
    Downloads: 97 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5

    Omilo - a text to speech application

    Omilo is a simple text to speech application

    Omilo is a simple text to speech application for Windows and Linux using Festival, Flite, Marytts and Piper voices.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    radio_vocoder_FFT

    radio_vocoder_FFT

    a vocoder + equalizer + FFT effects version of radio_chung

    radio vocoder chung is a vocoder + linear equalizer(s) + FFT effect(s) version of radio chung free internet web radio stream url and audio file generic path player ( * ,mp3,*name*.ogg,wav,...) with dsp(s) (baxandall , resonance , automod , decay , flat , noisered , speed , feedback ) using bass.dll , gui_chung , FFTdll.dll fft fast fourier transform and freebasic .high quality small pitch shift shifting for radio url . added record, playrec, save as MP3 , feedback , anticlick .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 8
    XZVoice

    XZVoice

    Free and open source text-to-speech software

    ...Technically, multi-level rhythmic pauses are taken into account to achieve the purpose of natural synthesizing rhythm, and comprehensively use acoustic parameters and linguistic parameters to establish multiple automatic prediction models based on deep learning. Using massive audio data to train the pronunciation model, the synthetic sound is real, full, cadenced, and expressive, and the MOS score has reached the professional level in the industry.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    annyang!

    annyang!

    Speech recognition for your site

    ...Grab the latest version of annyang.min.js, drop it in your html, and start adding commands. You can easily add a GUI for the user to interact with Speech Recognition using Speech KITT. Speech KITT is fully customizable and comes with many different themes, and instructions on how to create your own designs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 10
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11

    eNTranslator

    To aid translation of satsangs of Paramhamsa Nithyananda

    ...Can be used for general purpose by others as well. This translator desktop app uses google translator to translate English text. The auto generated translations are then enriched with human alternation using an easy graphical user interface. Time stamp information may be synched and a subtitle file or a simple textual output may be generated. Additionally it is planned to use google voice tools to also add voice over from these translated text. Finally the subtitle, translated audio (if any) would be muxed with the original video and uploaded.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SubsHub
    SubsHub is a subtitle Downloader. It can be used to download subtitles of multiple video files in a single instance.You can download subtitles in multiple languages of movies and sitcoms using this application.This software is written in C#.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13

    Steel TTS

    A cross-platform wrapper for common text-to-speech engines in Python

    Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead developer, Jasper Danielson, at jrd4@rice.edu.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    de-ess

    de-ess

    De-essing software to reduce sibilance in speech using TSP

    This de-esser uses a novel approach called Temporal Sibilance Processing. The idea is to distinguish between fricatives and voiced sections of the speech signal by the number of zero crossings in time. Most of the speech file is left untouched (the samples are directly copied from source to destination). Only fricatives that are long enough and loud enough are filtered. The advantage of this approach over traditional approaches is that the clarity of the remaining speech is completely unaffected.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    HMM Speech Recognition in Matlab

    A speech recognition system using Matlab/Simulink/Stateflow.

    This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Voice keyboard/dictation. Aims to be a total substitute for a keyboard. Spell out words letter by letter (using code: alpha, bravo, ..). Arrow keys, modifiers work. Speak whole words (but whole word accuracy is not good). Attach commands to some word
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Osmosis TTS

    Osmosis TTS

    Text to Speech application with searching capabilities.

    Osmosis TTS is a Text-to-Speech application with a built in browser and the ability to easily search for terms in the text using configurable search providers including search engines and dictionaries. It is particularly useful for language learning as one can easily search for foreign words using external dictionary websites. Text is spoken by copying text to the clipboard, and text can be queued up by continuing to copy new text to the clipboard while Osmosis TTS is speaking. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    Speaking gmail checker

    Python + espeak gmail checker

    ...Features: - logs into, and checks your gmail accounts - if you have unread email: - textual info about it appears in console/terminal - "you have X unread email" is spoken using espeak - if you have no unread email: - textual info about it appears in console/terminal Requirements: - POSIX OS - python: import imaplib import os import time - espeak - soundcard, speakers/headphones Installation/running: - modify gmail.py inserting your username and password - modify gmail.py inserting desired delay value in miliseconds - run: python gmail.py Author: pkrawczak@gmail.com 2013-02-06 13:54:51 Caution: Even though this program uses SSL use only at your own risk.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ClassleSoft Text to Mp3 Converter 2.0

    ClassleSoft Text to Mp3 Converter 2.0

    Convert your text files in to audio books

    ...You can able to make your very own audio books very easily. It has Simple interface and occupies less memory space. You can batch convert more than one text files. Using this time saving software, even large numbers of files can be handled with one click.You can adjust the volume, speed and pitch of the recording voice. And more over its a FREEWARE from Classlesoft. You can also use spoken audio files as background sounds on your Web site, in presentations or in personalized e-mails.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    My Voice Commander

    My Voice Commander

    Control few of your PC tasks using your voice on Win7 PC

    How can i use this program? 1- Execute media files, start or stop your webcam, Capture images via webcam, browse webcam captured images, execute file or folders and even more, using your voice. 2- GoodDay Caller Feature [ A User welcome approach ]. (GoodDay Caller says Good Morning, Good Afternoon, Good Evening, Good Night to user)
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    JListenToMe lets you take control of eSpeak so comfortable, thanks to its friendly graphical interface and easy configuration. It is too easy and the best, is free software!.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    VEDICS
    VEDICS (Voice Enabled Desktop Interaction and Control System) is an assistive software which lets the user to interact with the OS using voice commands. Using this software the user can access any element found on the user's screen.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ...It includes an MRCPv2 Server with ASR and TTS engines as well as an voiceXML interpreter so that you can deploy and run voiceXML applications. It integrates with VOIP PBX’s (like Asterisk) using SIP and RTP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A phrase to phoneme code converter for the SpeakJet chip by Magnevation. Speakalator runs on Unix type operating systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A JNI wrapper for pjsip. You can use this wrapper to develop Java applications using the pjsip library. At the moment only the pjsua API is implemented. If you would like to obtain a commercial license, or need customisations, please contact us.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next