recognition free download

Showing 31 open source projects for "recognition"

View related business solutions

Video Windows Clear Filters & Widen Search

Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Video-subtitle-extractor

A GUI tool for extracting hard-coded subtitle (hardsub) from videos

Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu and Ali to complete text recognition locally. ...

1 Review

Downloads: 52 This Week

Last Update: 2026-04-05
See Project
2

Textream

Textream is a free macOS teleprompter app for streamers, interviewers

Textream is an open-source, free macOS teleprompter application designed for streamers, podcasters, presenters, and interviewers who want a smooth, distraction-free way to stay on script. It runs natively on macOS and leverages on-device speech recognition to highlight each word in real time as you speak, keeping your focus where it belongs — on delivery rather than memorization. The interface supports multiple modes of use, such as classic constant-scroll auto-scrolling, voice-activated scrolling that pauses when you’re silent, and direct word tracking that syncs the displayed script to your spoken pace. ...

Downloads: 37 This Week

Last Update: 2026-05-08
See Project
3

Windrecorder

Windrecorder is a memory search app by records everything

Windrecorder is an open-source personal memory search engine that continuously records on-screen activity in a highly optimized and storage-efficient format. It captures screen content locally and builds a searchable database using OCR and image understanding, allowing users to rewind and rediscover anything they have previously seen. The system indexes only meaningful visual changes, extracting text, browser data, and contextual information to improve search accuracy and reduce storage...

Downloads: 4 This Week

Last Update: 2026-04-24
See Project
4

Google2SRT

Download, save and convert multiple subtitles from YouTube videos

Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.

33 Reviews

Downloads: 26 This Week

Last Update: 2025-01-11
See Project
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
5

SmartVision

Free video surveillance software compatible with Windows

...In emergencies, it automatically initiates recording, preserving crucial video footage as evidence. The system offers features such as motion detection, object detection, face recognition, automatic license plate recognition (ALRP), fire and dust detection, and is integrated with cloud services.

2 Reviews

Downloads: 0 This Week

Last Update: 2024-10-09
See Project
6

Jvedio

Jvedio is a local video management software

...The software supports tagging, filtering, and advanced search, enabling users to manage large collections efficiently. It integrates AI-based features such as actor recognition and translation of metadata, improving the usability and accessibility of stored content. Jvedio also includes media processing tools powered by FFmpeg, allowing users to generate screenshots and GIF previews directly from videos. Its plugin system enables customization through themes and synchronization tools, while its modern interface provides a smooth user experience. ...

Downloads: 5 This Week

Last Update: 2026-04-24
See Project
7

auto-subtitle

Automatically generate and overlay subtitles for any video

auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output. It supports multiple transcription models with varying accuracy and performance, allowing users to balance speed and quality depending on their needs. ...

Downloads: 2 This Week

Last Update: 2026-04-24
See Project
8

VATSG

Video automatic transcribe and translated subtitle generator

...This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you are a general user who want to view any video file and mp3 file to your language, It will provide way. It's very easy to use because it has simple gui and very intuitive. So you can easily use it for any purpose. Now, you can choose to download either window installer setup type or uninstalled type. ...

Downloads: 1 This Week

Last Update: 2023-09-19
See Project
9

Automatic YouTube subtitle generation

Using OpenAI's Whisper to automatically generate YouTube subtitles

...It allows users to download videos or audio from YouTube and automatically generate subtitles or transcripts. The tool processes media locally, extracting audio and applying speech recognition to produce accurate text outputs. It supports multiple languages and can handle different Whisper model sizes, balancing performance and accuracy. yt-whisperc is designed for automation, enabling batch processing of multiple videos for transcription workflows. It also provides options for exporting subtitles in common formats such as SRT. ...

Downloads: 0 This Week

Last Update: 2026-04-24
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

...AutoSub leverages FFmpeg for media handling and integrates with speech recognition engines for transcription. It is particularly useful for content creators who want to quickly produce subtitles without manual effort. Overall, it simplifies the process of making media content accessible and searchable.

Downloads: 10 This Week

Last Update: 2026-04-28
See Project
11

TimeSformer

The official pytorch implementation of our paper

TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
12

Polarity Browser

A fast, secure, stable web browser powered by Chromium and Trident.

Polarity is a dual engine browser powered by both Chromium and Trident that focuses on system efficiency by bringing low RAM and low CPU usage. This browser was also programmed to be optimized for Windows with an Android version available. Browse the web with an uncomplicated UI that is highly customizable with themes, apps, and extensions from the Polarity Store, GreasyFork, OpenUserJS, and UserStyles. Polarity also comes with a built in password manager which safely encrypts all data to...

9 Reviews

Downloads: 71 This Week

Last Update: 2021-03-14
See Project
13

RAVL, Recognition And Vision Library.

General C++ Library, with modules for Computer Vision, Pattern Recognition and much more.

Downloads: 3 This Week

Last Update: 2020-04-22
See Project
14

OpenFace

A state-of-the-art facial behavior analysis toolkit

...The OpenFace toolkit is capable of performing several complex facial analysis tasks, including facial landmark detection, eye-gaze estimation, head pose estimation and facial action unit recognition. OpenFace is able to deliver state-of-the-art results in all of these mentioned tasks. OpenFace is available for Windows, Ubuntu and macOS installations. It is capable of real-time performance and does not need to run on any specialist hardware, a simple webcam will suffice.

Downloads: 18 This Week

Last Update: 2023-11-30
See Project
15

Video Nonlocal Net

Non-local Neural Networks for Video Classification

...Non-local blocks compute attention-like responses across all positions in space-time, allowing a feature at one frame and location to aggregate information from distant frames and regions. This formulation improves action recognition and spatiotemporal reasoning, especially for classes requiring context beyond short temporal windows. The repo provides training recipes and models for standard datasets, as well as ablations that show how many non-local blocks to insert and at which stages. Efficient implementations keep memory and compute manageable so the blocks can be added without rewriting the entire backbone. ...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
16

SegmentDisplayOCR

Seven-segment display recognition filter for AviSynth

SegmentDisplayOCR is a seven-segment display recognition filter for AviSynth. It has built in logging functionality (it will log frame recognition results) and also can be used in AviSynth conditional filters. The main purpose of this filter is to process readings of various digital instruments (e.g. digital multimeters) captured on video. So if your favourite instrument lacks interface for connecting it to PC you can capture it's readings on cam and convert them to computer readable format with SegmentDisplayOCR filter.

Downloads: 0 This Week

Last Update: 2014-08-26
See Project
17

BioSuite Professional

the software assists in member management and camera management

Manage of personnel, employee, members details. Advanced identification mechanism using Barcodes scanner, Smart Card and Fingerprint identification. Connect to cameras both web cams and internet camera with high quality. Face detection and Motion detection which will trigger alarm upon trigger. use you smart phone as a IP camera. Manage sessions - using advanced identification mechanism such as Meals, Conference Entry among others. Blacklist also available.

Downloads: 0 This Week

Last Update: 2015-04-25
See Project
18

avimmir

(audio, video, image) Multimedia Multimodal Information Retrieval

audio classification; speaker segmentation; speaker clustering; speaker recognition; spoken document retrieval; image retrieval; video retrieval; etc.

Downloads: 0 This Week

Last Update: 2013-11-23
See Project
19

TV Series Browser

Browse, Manage, and Play Television Series media and information

...TV Series Browser is a simple to use tool which allows searching for TV Series and Episode information, Actor Information, Fanart, Banners, and save the information to your computer. TV Series Browser also uses filename recognition so dropping hundreds of video files and having them sorted to their proper episodes automatically is quick and easy. Play a video anytime by double clicking on the desired episode. Rename video files using a standard format which can include Series Name, Episode Title, Season Number, and Episode Number.

Downloads: 0 This Week

Last Update: 2015-06-25
See Project
20

Ivolution

Timelapse creation using Face Recognition

Ivolution is a face timelapse generator. Feed it with a bunch of images and it will generate a movie with your face centered on the screen. Ivolution uses face detection and modifies the images so that your face always keeps the same size and location over the movie. Images are processed in chronological order, so that you can see your face evoluate over time !

Downloads: 0 This Week

Last Update: 2012-09-18
See Project
21

ViAmI-Server

Pattern recognition for ADL events

This software uses computer vision algorithms for mining sequence data from telemonitoring data with CBRs. We propose an approach which treats the detection of changes in behavior detected with a sensor/video fusion, which occur at radically different time-scales, through a CBR in two levels: low and high level. The system is always updating the database with the daily data.

Downloads: 0 This Week

Last Update: 2013-09-15
See Project
22

VideoRecognitionToolkit

Video Recognition Toolkit is a software for visual video processing algorithms modeling. It supports algorithm execution and result display in real-time. There are several new algorithms and some algorithms from OpenCV and LTIlib libraries.

Downloads: 0 This Week

Last Update: 2013-05-02
See Project
23

Oculux - Video Analysis

Oculux is a Video Analysis Software with automatic point recognition intended primarily for Sport Analysis in 2D.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
24

asmlibrary/aamlibrary

Active Shape/Appearance Model Library (ASMLibrary/AAMLibrary) source code, which includes ASMBuilding/AAMBuilding as well as ASMFitting/AAMFitting algorithm. It is developped under OpenCV 1.0 for locating features in a face and face recognition.

Downloads: 0 This Week

Last Update: 2014-06-29
See Project
25

VisAmp

VisAmp is a visually controlled mp3 player. It was initially developed during the "Softwarepraktikum" at the Chair for Image Processing and Pattern Recognition of the University of Freiburg, Germany in 2001.

Downloads: 0 This Week

Last Update: 2016-11-13
See Project