Page 2 | recognition free download

Showing 148 open source projects for "recognition"

View related business solutions

Multimedia Linux Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
1

gImageReader

A graphical frontend to tesseract-ocr

...Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.

27 Reviews

Downloads: 142 This Week

Last Update: 2022-01-28
See Project
2

Photonix Photo Manager

A modern, web-based photo management server

A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms. This project is currently in development and not feature complete for a version 1.0 yet. If you don't mind putting up with broken parts or want to help out, run the Docker image and give it a go. I'd love for other contributors to get involved. You can move some photos into the folder data/photos and they should get detected and imported immediately. ...

Downloads: 1 This Week

Last Update: 2022-09-02
See Project
3

img2css

Convert any image to pure CSS. Recreates images using only box-shadows

This is a tool that can convert any image into a pure CSS image. I also made a per-pixel animation experiment using the box-shadow idea, see morphin. Pure CSS, this output was created by resizing and setting each pixel as a box shadow of a single-pixel div, so no IMG tag or background image is needed. This can result in huge outputs, and the use of this output is not recommended for production unless there is no other option. Base64, the entire image file is embedded inside the <img> tag...

Downloads: 0 This Week

Last Update: 2022-06-21
See Project
4

TimeSformer

The official pytorch implementation of our paper

TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
5

cuneiformplus

Fork of OCR software cuneiform

Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD

Downloads: 0 This Week

Last Update: 2020-12-08
See Project
6

LTI-Lib (C++ Computer Vision Library)

LTI-Lib is an object oriented computer vision library written in C++ for Windows/MS-VC++ and Linux/gcc. It provides lots of functionality to solve mathematical problems, many image processing algorithms, some classification tools and much more...

Downloads: 0 This Week

Last Update: 2020-11-05
See Project
7

chatbot_chung

chatbot chung is a keywords based probabilities algorythm simple entertainment chatbot with 3D talking openGL avatars written in freebasic. Can import aiml simple question/answer or question/random/answers or single star/ multi srai data saved from "AIML_chung" open source application . Online html5 javascript version with 44 languages multilingual auto detection available on the website (source included in the zip file). SORT gentext text generation algorythm option added (desktop version) .

Downloads: 2 This Week

Last Update: 2020-06-27
See Project
8

RAVL, Recognition And Vision Library.

General C++ Library, with modules for Computer Vision, Pattern Recognition and much more.

Downloads: 0 This Week

Last Update: 2020-04-22
See Project
9

Dissapearing-People

Removing people from complex backgrounds in real time

Person removal from complex backgrounds over time. Removing people from complex backgrounds in real-time using TensorFlow.js in the web browser using JavaScript. This code attempts to learn over time the makeup of the background of a video such that I can attempt to remove any humans from the scene. This is all happening in real-time, in the browser, using TensorFlow.js. This is an experiment. It may not be perfect in all situations. Go ahead and try it right now in your own web browser....

Downloads: 0 This Week

Last Update: 2021-11-22
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki/ Domain advanced versions (Polish Language) 1. ...

2 Reviews

Downloads: 0 This Week

Last Update: 2020-03-15
See Project
11

JNIZ music notation audio to midi

music composition and notation software, audio to midi converter

The Jniz project is stopped. The new Web version is now JnizWeb hosted on Gitlab (under construction): https://gitlab.com/jniz70/jnizweb/ Demo: https://jniz70.gitlab.io/jnizweb/ Jniz is a piece of software designed for musicians as a support tool to the musical composition. It allows you to build and to harmonize several voices according to the rules of classical harmony. Sound/audio-to-Midi converter: real-time conversion of any monophonic sound (voice, instrument etc.) into...

2 Reviews

Downloads: 1 This Week

Last Update: 2026-01-14
See Project
12

PRMLT

Matlab code of machine learning algorithms in book PRML

This Matlab package implements machine learning algorithms described in the great textbook: Pattern Recognition and Machine Learning by C. Bishop (PRML). It is written purely in Matlab language. It is self-contained. There is no external dependency. This package requires Matlab R2016b or latter, since it utilizes a new Matlab syntax called Implicit expansion (a.k.a. broadcasting). It also requires Statistics Toolbox (for some simple random number generator) and Image Processing Toolbox (for reading image data). ...

Downloads: 0 This Week

Last Update: 2021-10-27
See Project
13

OpenFace

A state-of-the-art facial behavior analysis toolkit

...The OpenFace toolkit is capable of performing several complex facial analysis tasks, including facial landmark detection, eye-gaze estimation, head pose estimation and facial action unit recognition. OpenFace is able to deliver state-of-the-art results in all of these mentioned tasks. OpenFace is available for Windows, Ubuntu and macOS installations. It is capable of real-time performance and does not need to run on any specialist hardware, a simple webcam will suffice.

Downloads: 15 This Week

Last Update: 2023-11-30
See Project
14

LaueTools

open source python packages for X-ray MicroLaue Diffraction analysis

LaueTools is an open-source project for white beam Laue x-ray microdiffraction data analysis including tools in image processing, peaks searching & indexing, crystal structure solving (orientation & strain) and data & grain mapping visualisation. Python 3 Code and new features are now at: https://gitlab.esrf.fr/micha/lauetools

2 Reviews

Downloads: 3 This Week

Last Update: 2019-09-12
See Project
15

Video Nonlocal Net

Non-local Neural Networks for Video Classification

...Non-local blocks compute attention-like responses across all positions in space-time, allowing a feature at one frame and location to aggregate information from distant frames and regions. This formulation improves action recognition and spatiotemporal reasoning, especially for classes requiring context beyond short temporal windows. The repo provides training recipes and models for standard datasets, as well as ablations that show how many non-local blocks to insert and at which stages. Efficient implementations keep memory and compute manageable so the blocks can be added without rewriting the entire backbone. ...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
16

NASH OS

Nash Operating System for Modern Ecommerce

The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.

Downloads: 2 This Week

Last Update: 2019-03-24
See Project
17

AudioCuesheetEditor

Write audio cuesheets fast and efficient

Annoyed of writing your audio cuesheets via text editor? Here comes the solution! AudioCuesheetEditor helps you editing your audio cuesheet file. With AudioCuesheetEditor you can import textfiles, edit the imported data and afterwards export to a working audio cuesheet. There are also other export posibilities like csv or xml. Feel free to try it ;).

2 Reviews

Downloads: 5 This Week

Last Update: 2021-01-21
See Project
18

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). ...

4 Reviews

Downloads: 0 This Week

Last Update: 2018-07-23
See Project
19

OpenPR

OpenPR stands for Open Pattern Recognition project and is intended to be an open source library for algorithms of image processing, computer vision, natural language processing, pattern recognition, machine learning and the related fields.

Downloads: 3 This Week

Last Update: 2018-05-15
See Project
20

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 1 This Week

Last Update: 2018-05-11
See Project
21

FM2TXT

RtlSdr listen to radio, recognize audio, and writes text file log

Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt

Downloads: 1 This Week

Last Update: 2017-12-17
See Project
22

retina.js

JavaScript helpers for rendering high-resolution image variants

retina.js makes it easy to serve high-resolution images to devices with displays that support them. You can prepare images for as many levels of pixel density as you want and let retina.js dynamically serve the right image to the user. retina.js assumes you are using Apple's prescribed high-resolution modifiers (@2x, @3x, etc) to denote high-res image variants on your server. It also assumes that if you have prepared a variant for a given high-res environment, that you have also prepared...

Downloads: 0 This Week

Last Update: 2021-12-09
See Project
23

FormRead

Free OMR - OCR web sofware based on javascript and PHP

https://formread.org FormRead is a completely free OMR (optical mark recognition) web software for scanning and grading user-filled, multiple choice forms. Create your formats with any of your office or drawing tools, scan them and parameterize their coordinates in an easy way. Once you have parameterized your form, you can print many of them, give it to your students/respondents, scan and recognize them with formread, and you can finally export the data in your preferred formats (excel, pdf, csv)

Downloads: 6 This Week

Last Update: 2022-03-04
See Project
24

OCR Web based

OCR web based for Browser Firefox & PC

...id=com.ulm.ocr ========= Add-on for Opera: http://bit.ly/1F0E0wP ========= Release 1.0.1 For safety reasons, I disabled the possibility to import an image from url. Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction

2 Reviews

Downloads: 0 This Week

Last Update: 2018-09-05
See Project
25

cbrTekStraktor

an application to automatically extract text from comic books.

...It is based on the following 3 major algorithms - Binarization of color images (Niblak and other methods) - Connected components - K-Means clustering Apache Tesseract is used to perform Optical Character Recognition on the extracted text. A subsequent version of the application will integrate with translation software in order to provide automated translation of comic book texts and re-inserion of translated texts

Downloads: 5 This Week

Last Update: 2017-06-14
See Project