whisper-windows free download

Showing 145502 open source projects for "whisper-windows"

View related business solutions

Mac Clear Filters & Widen Search

Add Two Lines of Code. Get Full APM.
AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.

Start Free
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection.

Downloads: 80 This Week

Last Update: 2025-06-26
See Project
2

Whisper-WebUI

A Web UI for easy subtitle using whisper model

Whisper WebUI is an open-source browser-based interface that simplifies the use of Whisper speech recognition models by providing an intuitive graphical environment for transcription, translation, and subtitle generation. Built with Gradio, it allows users to upload audio or video files, process them locally, and generate accurate text outputs without relying on command-line tools.

Downloads: 8 This Week

Last Update: 2026-03-18
See Project
3

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. ...

Downloads: 391 This Week

Last Update: 2026-03-19
See Project
4

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps

Multilingual Automatic Speech Recognition with word-level timestamps and confidence. Whisper is a set of multi-lingual, robust speech recognition models trained by OpenAI that achieve state-of-the-art results in many languages. Whisper models were trained to predict approximate timestamps on speech segments (most of the time with 1-second accuracy), but they cannot originally predict word timestamps. This repository proposes an implementation to predict word timestamps and provide a more accurate estimation of speech segments when transcribing with Whisper models. ...

Downloads: 1 This Week

Last Update: 2025-09-09
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

WhisperLive

A nearly-live implementation of OpenAI's Whisper

WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently.

Downloads: 15 This Week

Last Update: 2026-03-17
See Project
6

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 58 This Week

Last Update: 2025-12-05
See Project
7

Windows (Dockur)

Windows inside a Docker container

This project makes it surprisingly easy to run a Windows environment inside a Docker container by using a QEMU-based virtual machine under the hood. It provides a turnkey image and a simple set of environment variables so you can select Windows editions, control disk persistence, and access the VM via a web-based VNC console or similar remote viewers. Because the VM is wrapped in Docker, you can treat Windows as a disposable, repeatable service: create, snapshot with volumes, tear down, and rebuild with consistent results. ...

Downloads: 53 This Week

Last Update: 2025-11-22
See Project
8

Go OpenAI

OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go

This library provides Go clients for OpenAI API. OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go.

Downloads: 0 This Week

Last Update: 2025-08-29
See Project
9

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper

WhisperSpeech is an open-source text-to-speech system created by “inverting” OpenAI’s Whisper, reusing its strengths as a semantic audio model to generate speech instead of only transcribing it. The project aims to be for speech what Stable Diffusion is for images: powerful, hackable, and safe for commercial use, with code under Apache-2.0/MIT and models trained only on properly licensed data. Its architecture follows a token-based, multi-stage pipeline inspired by AudioLM and SPEAR-TTS: Whisper is used to produce semantic tokens, EnCodec compresses the waveform into acoustic tokens, and Vocos reconstructs high-fidelity audio from those tokens. ...

Downloads: 3 This Week

Last Update: 2025-11-28
See Project
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Sign Up Free
10

Meetily

Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper

...It’s built for organizations that want meeting intelligence without sending recordings or transcripts to third-party cloud services, which helps address compliance and data sovereignty requirements. The app supports live transcription with local model options (including Whisper- and Parakeet-based workflows) and presents the transcript as the meeting happens, making it useful both for note-taking and accessibility. After or during the session, it can produce structured, AI-generated summaries, and it’s designed to be flexible about where that summarization comes from, supporting local providers as well as external endpoints when allowed by policy.

Downloads: 27 This Week

Last Update: 2026-03-03
See Project
11

mpv.net for Windows

mpv.net is a media player for Windows with a modern GUI

mpv.net is a media player for Windows with a modern GUI. The player is based on the popular mpv media player. mpv.net is designed to be mpv compatible, and almost all mpv features are available, this means the official mpv manual applies to mpv.net, differences are documented in the mpv.net manual.

Downloads: 23 This Week

Last Update: 2026-01-09
See Project
12

WhisperJAV

Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

...WhisperJAV introduces a specialized pipeline that separates text generation from timestamp alignment, allowing the system to generate transcripts and then align them with audio using forced alignment techniques. The framework supports several speech recognition models, including Qwen-based ASR systems and fine-tuned Whisper models trained on domain-specific dialogue.

Downloads: 21 This Week

Last Update: 5 days ago
See Project
13

WhisperKit

On-device Speech Recognition for Apple Silicon

WhisperKit is a Swift package that integrates OpenAI's popular Whisper speech recognition model with Apple's CoreML framework for efficient, local inference on Apple devices. Whisper has pulled the future forward when fast, free and virtually error-free translation and transcription will be ubiquitous. It inspired numerous developers to improve and deploy it with minimal friction and maximum performance.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
14

Windows 95 in Electron

Windows 95 in Electron. Runs on macOS, Linux, and Windows

windows95 in Electron is a quirky, nostalgia-driven project that ports a complete emulation of Microsoft Windows 95 into a desktop application that runs on modern operating systems like Windows, macOS, and Linux. It essentially wraps an emulator — preconfigured to boot Windows 95 — inside a user-friendly app built with web technologies so developers and hobbyists can explore the look and feel of that classic OS without needing legacy hardware or complicated setup. ...

Downloads: 71 This Week

Last Update: 2026-04-13
See Project
15

Handy STT

A free, open source, and extensible speech-to-text application

Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active text field. Its backend leverages OpenAI’s Whisper models for GPU-accelerated speech recognition and Parakeet V3 for efficient CPU-only transcription with automatic language detection. ...

Downloads: 72 This Week

Last Update: 2026-04-27
See Project
16

Buzz

Transcribe and translate audio offline on your personal computer

Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for Windows: App is not signed, you will get a warning when you install it. ...

1 Review

Downloads: 8,806 This Week

Last Update: 2026-03-14
See Project
17

Windows 11 Guide

Windows Security tools, Encryption, Nextcloud, Graphics, Gaming

Windows 11 Guide is a comprehensive, single-repository handbook for setting up and tuning a Windows 11 (and often Windows 10) desktop for real day-to-day use, covering everything from basic configuration to advanced development and gaming scenarios. It walks users through installing essential applications, configuring Windows security tools, setting up encryption and backups, and integrating productivity services like Nextcloud.

Downloads: 33 This Week

Last Update: 2025-12-11
See Project
18

React Native for Windows

A framework for building native Windows apps with React

Bring your React Native apps to some of the most powerful devices out there. Take your apps across PC, Xbox, Surface Tablets, and dual-screens with our robust Windows extension to React Native. Extend your desktop experience to more than just Windows! Try out our fully supported macOS extension to React Native. React Native for Windows + macOS brings React Native support for the Windows 10 SDK as well as the macOS 10.13 SDK. With this, you can use JavaScript to build native Windows apps for all devices supported by Windows 10 including PCs, tablets, 2-in-1s, Xbox, Mixed reality devices, etc., as well as the macOS desktop and laptop ecosystems. ...

Downloads: 33 This Week

Last Update: 2026-03-17
See Project
19

Whisper-Studio

Another whisper wrapper, built fully in C++, with some neat features.

a native lightweight C++ application for OpenAI's Whisper, with a few new things like transcribing audio in real-time, identifying speakers, auto-paste transcriptions, and a few other things. Its not the prettiest app, I suck at design, but it gets the job done.

Downloads: 1 This Week

Last Update: 2026-02-07
See Project
20

swords for whisper

Downloads: 1 This Week

Last Update: 2025-12-15
See Project
21

HeartMuLa

A Family of Open Sourced Music Foundation Models

...The project also includes HeartCodec, a music codec optimized for high reconstruction fidelity, enabling efficient tokenization and reconstruction workflows that are critical for training and generation pipelines. For text extraction from audio, it provides HeartTranscriptor, a Whisper-based model tuned specifically for lyrics transcription, which helps bridge generated or recorded audio back into structured text. It also introduces HeartCLAP, which aligns audio and text into a shared embedding space.

Downloads: 12 This Week

Last Update: 2026-04-10
See Project
22

AutoCut

Cut videos with a text editor

...This approach transforms video editing into a textual editing task, greatly lowering the barrier to editing for users who find traditional video editors complex or unintuitive. AutoCut supports multiple transcription backends, including Whisper and faster-whisper modes, allowing users to choose based on speed or accuracy needs. After editing the transcript text, the corresponding video clips are merged into the final output, and the tool also produces matching subtitle files. Its command-line interface can be integrated into scripts, making it suitable for automated workflows or batch processing.

Downloads: 3 This Week

Last Update: 2026-02-06
See Project
23

Scriberr

Self-hosted AI audio transcription

...Unlike cloud-based transcription services, Scriberr runs entirely on the user’s machine, ensuring that sensitive recordings are never sent to third-party servers and remain fully under user control. It leverages modern speech recognition models such as Whisper and other advanced architectures to deliver precise transcripts with word-level timing and speaker identification. The application includes a polished user interface that simplifies the management of recordings, transcripts, and annotations, making it suitable for both casual users and professionals handling large volumes of audio. ...

Downloads: 7 This Week

Last Update: 2026-03-19
See Project
24

Hyprnote

Local-first AI Notepad for Private Meetings

Hyprnote is an open-source, privacy-first AI notepad app designed for taking notes during meetings—transcribing audio (microphone and system) and generating context-rich summaries using on-device AI models like Whisper and HyprLLM, all without any data leaving your machine.(turn0search7, turn0search1). Listens to your meetings while you write. Crafts smart summaries based on your quick notes. Runs completely offline using open-source models like Whisper or HyprLLM. Use approved third-party APIs like Gemini, Claude, or Azure-hosted GPT.

Downloads: 7 This Week

Last Update: 2026-04-17
See Project
25

Windows Presentation Foundation (WPF)

Framework for building Windows desktop applications

...The visual designer provided in Visual Studio makes it easy to build WPF application, with drag-in-drop and/or direct editing of XAML markup. WinForms is another UI framework for building Windows desktop applications that is supported on .NET (6.0.x/5.0.x/3.1.x). WPF and WinForms applications only run on Windows.

Downloads: 5 This Week

Last Update: 2026-02-12
See Project