recognition free download

Showing 56 open source projects for "recognition"

View related business solutions

Multimedia C++ Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

MediaPipe

Cross-platform, customizable ML solutions for live and streaming media

...Utilizing lightweight model architectures together with GPU acceleration throughout the pipeline, the solution delivers real-time performance-critical for live experiences. Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control. For example, it can form the basis for yoga, dance, and fitness applications. It can also enable the overlay of digital content and information on top of the physical world in augmented reality.

Downloads: 71 This Week

Last Update: 2026-04-23
See Project
2

VMZ (Video Model Zoo)

VMZ: Model Zoo for Video Modeling

The codebase was designed to help researchers and practitioners quickly reproduce FAIR’s results and leverage robust pre-trained backbones for downstream tasks. It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
3

SmartVision

Free video surveillance software compatible with Windows

...In emergencies, it automatically initiates recording, preserving crucial video footage as evidence. The system offers features such as motion detection, object detection, face recognition, automatic license plate recognition (ALRP), fire and dust detection, and is integrated with cloud services.

2 Reviews

Downloads: 0 This Week

Last Update: 2024-10-09
See Project
4

scantailor-experimental

Scan Tailor Experimental is an interactive post-processing tool

Scan Tailor Experimental is an interactive post-processing tool for scanned pages. You give it raw scans, and you get pages ready to be printed or assembled into a PDF or DJVU file. Scanning, optical character recognition, and assembling multi-page documents are out of scope of this project.

Downloads: 21 This Week

Last Update: 2024-11-27
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

cuneimusicplus

Optical music recognition library

Optical music recognition library in C++/C

Downloads: 0 This Week

Last Update: 2023-02-07
See Project
6

VoodooHDA

VoodooHDA is an open source audio driver for devices compliant with the Intel High Definition Audio specification. It is intended as a replacement for AppleHDA on Mac OS X with support for a wide range of audio controllers and codecs.

20 Reviews

Downloads: 199 This Week

Last Update: 2022-09-07
See Project
7

escom-henoc

Physics Simulation Software based on user sketchs running a pattern recognition agent, this app is able to animate a physics sketch, from a blackboard

Downloads: 0 This Week

Last Update: 2022-04-08
See Project
8

gImageReader

A graphical frontend to tesseract-ocr

...Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.

27 Reviews

Downloads: 122 This Week

Last Update: 2022-01-28
See Project
9

cuneiformplus

Fork of OCR software cuneiform

Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD

Downloads: 2 This Week

Last Update: 2020-12-08
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
10

LTI-Lib (C++ Computer Vision Library)

LTI-Lib is an object oriented computer vision library written in C++ for Windows/MS-VC++ and Linux/gcc. It provides lots of functionality to solve mathematical problems, many image processing algorithms, some classification tools and much more...

Downloads: 1 This Week

Last Update: 2020-11-05
See Project
11

RAVL, Recognition And Vision Library.

General C++ Library, with modules for Computer Vision, Pattern Recognition and much more.

Downloads: 3 This Week

Last Update: 2020-04-22
See Project
12

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki/ Domain advanced versions (Polish Language) 1. ...

2 Reviews

Downloads: 0 This Week

Last Update: 2020-03-15
See Project
13

OpenPR

OpenPR stands for Open Pattern Recognition project and is intended to be an open source library for algorithms of image processing, computer vision, natural language processing, pattern recognition, machine learning and the related fields.

Downloads: 3 This Week

Last Update: 2018-05-15
See Project
14

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 0 This Week

Last Update: 2018-05-11
See Project
15

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.

Downloads: 2 This Week

Last Update: 2019-08-21
See Project
16

Gamera

Gamera is a framework for the creation of structured document analysis applications by domain experts. It combines a programming library with GUI tools for the training and interactive development of recognition systems.

Downloads: 0 This Week

Last Update: 2016-05-11
See Project
17

libcrn

libcrn is document image processing library written in C++11 for Linux, Windows, Mac OsX and Google Android. It is a toolbox that allows to create easily software such as OCRs and layout analysis tools.

Downloads: 1 This Week

Last Update: 2016-10-23
See Project
18

Specimen Photography for Canon Powershot

SpecimenPhoto controls a Canon Powershot camera for specimen archival photography. Each photograph is assigned a case number, labeled and stored. Identification is manual or "hands free" using separately available barcode and speech recognition modules.

Downloads: 0 This Week

Last Update: 2015-04-08
See Project
19

My Music Recognition

This application can help you quickly identify the name of any song.

My Music Recognition uses a powerful audio recognition engine in order to help you get the name of the song you are listening to. It can capture sound from radio streams, the installed music player or any other source and display the name of the song in seconds.

3 Reviews

Downloads: 3 This Week

Last Update: 2016-11-29
See Project
20

SegmentDisplayOCR

Seven-segment display recognition filter for AviSynth

SegmentDisplayOCR is a seven-segment display recognition filter for AviSynth. It has built in logging functionality (it will log frame recognition results) and also can be used in AviSynth conditional filters. The main purpose of this filter is to process readings of various digital instruments (e.g. digital multimeters) captured on video. So if your favourite instrument lacks interface for connecting it to PC you can capture it's readings on cam and convert them to computer readable format with SegmentDisplayOCR filter.

Downloads: 0 This Week

Last Update: 2014-08-26
See Project
21

jaivox

Speech recognition application builder and library

Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
22

Sage-sb

Semiautomatic generation of semantic building models from image series

We present an approach to generate a 3D model of a building including semantic annotations from image series. In the recent years semantic based modeling, reconstruction of buildings and building recognition became more and more important. Semantic building models have more information than just the geometry, thus making them more suitable for recognition or simulation tasks. The time consuming generation of such models and annotations makes an automatism desirable. Therefore, we present a semiautomatic approach towards semantic model generation. ...

Downloads: 0 This Week

Last Update: 2014-08-29
See Project
23

Arabic Licence Plate Recognition

Arabic Licence Plate Recognition

This project is open souce for the Arabic Licence plate recognition for vehicles in Saudi Arabia.

1 Review

Downloads: 0 This Week

Last Update: 2014-11-02
See Project
24

LeapInto

Simplified interface to Leap Motion designed for art and music apps

LeapInto provides a simplified interface to the Leap Motion hand sensor input device. Multiple hand recognition is simplified to several stable categories and coordinates are normalised. The interface comes two flavours at present, an open broadcast system using the OSC protocol and a plugin for the Csound audio/music programming language.

Downloads: 0 This Week

Last Update: 2016-05-05
See Project
25

DownloadDaemon

DownloadDaemon is a comfortable download-manager with many features like one-click-hoster support, etc. It can be remote-controled in several ways (web/gui/console clients), which makes it perfect for file- and root-servers, as well as for local use.

5 Reviews

Downloads: 0 This Week

Last Update: 2014-03-12
See Project