recognition free download

Showing 22 open source projects for "recognition"

View related business solutions

Video Linux Clear Filters & Widen Search

Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Sign Up Free
1

Textream

Textream is a free macOS teleprompter app for streamers, interviewers

Textream is an open-source, free macOS teleprompter application designed for streamers, podcasters, presenters, and interviewers who want a smooth, distraction-free way to stay on script. It runs natively on macOS and leverages on-device speech recognition to highlight each word in real time as you speak, keeping your focus where it belongs — on delivery rather than memorization. The interface supports multiple modes of use, such as classic constant-scroll auto-scrolling, voice-activated scrolling that pauses when you’re silent, and direct word tracking that syncs the displayed script to your spoken pace. ...

Downloads: 37 This Week

Last Update: 2026-05-08
See Project
2

Windrecorder

Windrecorder is a memory search app by records everything

Windrecorder is an open-source personal memory search engine that continuously records on-screen activity in a highly optimized and storage-efficient format. It captures screen content locally and builds a searchable database using OCR and image understanding, allowing users to rewind and rediscover anything they have previously seen. The system indexes only meaningful visual changes, extracting text, browser data, and contextual information to improve search accuracy and reduce storage...

Downloads: 4 This Week

Last Update: 2026-04-24
See Project
3

VMZ (Video Model Zoo)

VMZ: Model Zoo for Video Modeling

The codebase was designed to help researchers and practitioners quickly reproduce FAIR’s results and leverage robust pre-trained backbones for downstream tasks. It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
4

Google2SRT

Download, save and convert multiple subtitles from YouTube videos

Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.

33 Reviews

Downloads: 26 This Week

Last Update: 2025-01-11
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
5

Jvedio

Jvedio is a local video management software

...The software supports tagging, filtering, and advanced search, enabling users to manage large collections efficiently. It integrates AI-based features such as actor recognition and translation of metadata, improving the usability and accessibility of stored content. Jvedio also includes media processing tools powered by FFmpeg, allowing users to generate screenshots and GIF previews directly from videos. Its plugin system enables customization through themes and synchronization tools, while its modern interface provides a smooth user experience. ...

Downloads: 5 This Week

Last Update: 2026-04-24
See Project
6

AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

...AutoSub leverages FFmpeg for media handling and integrates with speech recognition engines for transcription. It is particularly useful for content creators who want to quickly produce subtitles without manual effort. Overall, it simplifies the process of making media content accessible and searchable.

Downloads: 10 This Week

Last Update: 2026-04-28
See Project
7

escom-henoc

Physics Simulation Software based on user sketchs running a pattern recognition agent, this app is able to animate a physics sketch, from a blackboard

Downloads: 0 This Week

Last Update: 2022-04-08
See Project
8

TimeSformer

The official pytorch implementation of our paper

TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
9

RAVL, Recognition And Vision Library.

General C++ Library, with modules for Computer Vision, Pattern Recognition and much more.

Downloads: 3 This Week

Last Update: 2020-04-22
See Project
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
10

OpenFace

A state-of-the-art facial behavior analysis toolkit

...The OpenFace toolkit is capable of performing several complex facial analysis tasks, including facial landmark detection, eye-gaze estimation, head pose estimation and facial action unit recognition. OpenFace is able to deliver state-of-the-art results in all of these mentioned tasks. OpenFace is available for Windows, Ubuntu and macOS installations. It is capable of real-time performance and does not need to run on any specialist hardware, a simple webcam will suffice.

Downloads: 18 This Week

Last Update: 2023-11-30
See Project
11

Video Nonlocal Net

Non-local Neural Networks for Video Classification

...Non-local blocks compute attention-like responses across all positions in space-time, allowing a feature at one frame and location to aggregate information from distant frames and regions. This formulation improves action recognition and spatiotemporal reasoning, especially for classes requiring context beyond short temporal windows. The repo provides training recipes and models for standard datasets, as well as ablations that show how many non-local blocks to insert and at which stages. Efficient implementations keep memory and compute manageable so the blocks can be added without rewriting the entire backbone. ...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
12

Ivolution

Timelapse creation using Face Recognition

Ivolution is a face timelapse generator. Feed it with a bunch of images and it will generate a movie with your face centered on the screen. Ivolution uses face detection and modifies the images so that your face always keeps the same size and location over the movie. Images are processed in chronological order, so that you can see your face evoluate over time !

Downloads: 0 This Week

Last Update: 2012-09-18
See Project
13

ViAmI-Server

Pattern recognition for ADL events

This software uses computer vision algorithms for mining sequence data from telemonitoring data with CBRs. We propose an approach which treats the detection of changes in behavior detected with a sensor/video fusion, which occur at radically different time-scales, through a CBR in two levels: low and high level. The system is always updating the database with the daily data.

Downloads: 0 This Week

Last Update: 2013-09-15
See Project
14

Real Time Face Tracking and Recognition

Real time face tracking and recognition refers to the task of locating human faces in a video stream and identifying the faces by matching them against the database of known faces.

Downloads: 0 This Week

Last Update: 2013-05-02
See Project
15

asmlibrary/aamlibrary

Active Shape/Appearance Model Library (ASMLibrary/AAMLibrary) source code, which includes ASMBuilding/AAMBuilding as well as ASMFitting/AAMFitting algorithm. It is developped under OpenCV 1.0 for locating features in a face and face recognition.

Downloads: 0 This Week

Last Update: 2014-06-29
See Project
16

VisAmp

VisAmp is a visually controlled mp3 player. It was initially developed during the "Softwarepraktikum" at the Chair for Image Processing and Pattern Recognition of the University of Freiburg, Germany in 2001.

Downloads: 0 This Week

Last Update: 2016-11-13
See Project
17

Mimas

The Mimas Toolkit is a C++ real-time computer vision library. Algorithms include edge/corner-detection, object recognition/tracking, LSI-filters, segmentation, array-operators, convolution etc. OO wrappers for LAPACK, libxine, V4L, FFTW are provided.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
18

MRCP4J

The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.

Downloads: 0 This Week

Last Update: 2013-04-25
See Project
19

Kainoa Biometric User Authentication

The purpose of this project is to provide a biometric security solution by using voice print, fingerprint and/or facial recognition along with a password and/or smart card support using AES to protect data. Please read forums for if interested.

Downloads: 0 This Week

Last Update: 2015-08-03
See Project
20

Open Biometry

The development of a biometric system and applications e.g. Access Control based on the system to recognize and verify people. The primary goal is face recognition, but other human attributes might get used too in the future.

Downloads: 0 This Week

Last Update: 2013-04-10
See Project
21

Scalable Multimodal Object Recognition

This is an object recognition library written on top of OpenCV. Scalable Multimodal Object Recognition (SMORs) is designed for real time highly accurate object detection.

Downloads: 0 This Week

Last Update: 2015-04-24
See Project
22

Sentry Pr0

A software package designed for controlling an AEG-based sentry turret with optical target recognition through a USB webcam

Downloads: 0 This Week

Last Update: 2013-03-22
See Project