recognition free download

Showing 15 open source projects for "recognition"

View related business solutions

Video Mac Clear Filters & Widen Search

Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
1

Textream

Textream is a free macOS teleprompter app for streamers, interviewers

Textream is an open-source, free macOS teleprompter application designed for streamers, podcasters, presenters, and interviewers who want a smooth, distraction-free way to stay on script. It runs natively on macOS and leverages on-device speech recognition to highlight each word in real time as you speak, keeping your focus where it belongs — on delivery rather than memorization. The interface supports multiple modes of use, such as classic constant-scroll auto-scrolling, voice-activated scrolling that pauses when you’re silent, and direct word tracking that syncs the displayed script to your spoken pace. ...

Downloads: 37 This Week

Last Update: 2026-05-08
See Project
2

Windrecorder

Windrecorder is a memory search app by records everything

Windrecorder is an open-source personal memory search engine that continuously records on-screen activity in a highly optimized and storage-efficient format. It captures screen content locally and builds a searchable database using OCR and image understanding, allowing users to rewind and rediscover anything they have previously seen. The system indexes only meaningful visual changes, extracting text, browser data, and contextual information to improve search accuracy and reduce storage...

Downloads: 4 This Week

Last Update: 2026-04-24
See Project
3

Google2SRT

Download, save and convert multiple subtitles from YouTube videos

Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.

33 Reviews

Downloads: 26 This Week

Last Update: 2025-01-11
See Project
4

The Hear

The Hear program is made for journalists.

To transcribe audio, the app uses the built-in speech recognition features of macOS. Turn your audio and video files into text You can change the font size by pressing the command and +/- keys. The font size is saved during further use. A folder "Hear" with text files is created on the desktop. The program is universal - arm64/x86_64 You can ask questions here https://sourceforge.net/p/the-hear/discussion

Downloads: 0 This Week

Last Update: 2026-02-24
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Jvedio

Jvedio is a local video management software

...The software supports tagging, filtering, and advanced search, enabling users to manage large collections efficiently. It integrates AI-based features such as actor recognition and translation of metadata, improving the usability and accessibility of stored content. Jvedio also includes media processing tools powered by FFmpeg, allowing users to generate screenshots and GIF previews directly from videos. Its plugin system enables customization through themes and synchronization tools, while its modern interface provides a smooth user experience. ...

Downloads: 5 This Week

Last Update: 2026-04-24
See Project
6

auto-subtitle

Automatically generate and overlay subtitles for any video

auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output. It supports multiple transcription models with varying accuracy and performance, allowing users to balance speed and quality depending on their needs. ...

Downloads: 2 This Week

Last Update: 2026-04-24
See Project
7

AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

...AutoSub leverages FFmpeg for media handling and integrates with speech recognition engines for transcription. It is particularly useful for content creators who want to quickly produce subtitles without manual effort. Overall, it simplifies the process of making media content accessible and searchable.

Downloads: 10 This Week

Last Update: 2026-04-28
See Project
8

TimeSformer

The official pytorch implementation of our paper

TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
9

OpenFace

A state-of-the-art facial behavior analysis toolkit

...The OpenFace toolkit is capable of performing several complex facial analysis tasks, including facial landmark detection, eye-gaze estimation, head pose estimation and facial action unit recognition. OpenFace is able to deliver state-of-the-art results in all of these mentioned tasks. OpenFace is available for Windows, Ubuntu and macOS installations. It is capable of real-time performance and does not need to run on any specialist hardware, a simple webcam will suffice.

Downloads: 18 This Week

Last Update: 2023-11-30
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

Video Nonlocal Net

Non-local Neural Networks for Video Classification

...Non-local blocks compute attention-like responses across all positions in space-time, allowing a feature at one frame and location to aggregate information from distant frames and regions. This formulation improves action recognition and spatiotemporal reasoning, especially for classes requiring context beyond short temporal windows. The repo provides training recipes and models for standard datasets, as well as ablations that show how many non-local blocks to insert and at which stages. Efficient implementations keep memory and compute manageable so the blocks can be added without rewriting the entire backbone. ...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
11

Ivolution

Timelapse creation using Face Recognition

Ivolution is a face timelapse generator. Feed it with a bunch of images and it will generate a movie with your face centered on the screen. Ivolution uses face detection and modifies the images so that your face always keeps the same size and location over the movie. Images are processed in chronological order, so that you can see your face evoluate over time !

Downloads: 0 This Week

Last Update: 2012-09-18
See Project
12

asmlibrary/aamlibrary

Active Shape/Appearance Model Library (ASMLibrary/AAMLibrary) source code, which includes ASMBuilding/AAMBuilding as well as ASMFitting/AAMFitting algorithm. It is developped under OpenCV 1.0 for locating features in a face and face recognition.

Downloads: 0 This Week

Last Update: 2014-06-29
See Project
13

VisAmp

VisAmp is a visually controlled mp3 player. It was initially developed during the "Softwarepraktikum" at the Chair for Image Processing and Pattern Recognition of the University of Freiburg, Germany in 2001.

Downloads: 0 This Week

Last Update: 2016-11-13
See Project
14

MRCP4J

The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.

Downloads: 0 This Week

Last Update: 2013-04-25
See Project
15

Cocoa Barcode Recognition

Cocoa Barcode Scanner Software. The software is an Xcode project written in Cocoa and Carbon that detects barcodes from images streaming from a firewire camera. Mac OS X only.

1 Review

Downloads: 0 This Week

Last Update: 2013-03-22
See Project