Search Results for "recognition" - Page 7

Sort By:

Showing 260 open source projects for "recognition"

View related business solutions

Python Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
1

AiHound

AI powered image classification for nudity and documents / id-cards

AI Hound is designed to run from an USB pendrive or any other kind of removeable and writeable media. The programm checks all Office-documents, Images and videos for various categories for images. Actually It can recognice nudity/porn and scanned or photographed documents / ID- and credit-cards. I am working on a model that also recognice various types of drugs in images.

Downloads: 0 This Week

Last Update: 2023-04-20
See Project
2

ConvNeXt V2

Code release for ConvNeXt V2 model

...A key innovation is a new Global Response Normalization (GRN) layer added to the ConvNeXt backbone, which enhances feature competition across channels. The result is a convnet that competes strongly with transformer architectures on recognition benchmarks while being efficient and hardware-friendly. The repository provides official PyTorch implementations for multiple model sizes (Atto, Femto, Pico, up through Huge), conversion from JAX weights, code for pretraining/fine-tuning, and pretrained checkpoints. It supports both self-supervised pretraining and supervised fine-tuning.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
3

LSTMs for Human Activity Recognition

Human Activity Recognition example using TensorFlow on smartphone

LSTM-Human-Activity-Recognition is a machine learning project that demonstrates how recurrent neural networks can be used to recognize human activities from sensor data. The repository implements a deep learning model based on Long Short-Term Memory (LSTM) networks to classify physical activities using time-series data collected from wearable sensors. The project uses the well-known Human Activity Recognition dataset derived from smartphone accelerometer and gyroscope signals. ...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
4

ImageAI

A python library built to empower developers

ImageAI is an easy-to-use Computer Vision Python library that empowers developers to easily integrate state-of-the-art Artificial Intelligence features into their new and existing applications and systems. It is used by thousands of developers, students, researchers, tutors and experts in corporate organizations around the world. You will find features supported, links to official documentation as well as articles on ImageAI. ImageAI is widely used around the world by professionals,...

Downloads: 6 This Week

Last Update: 2022-12-21
See Project
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
5

tgcf

The ultimate tool to automate custom telegram message forwarding

The ultimate tool to automate custom telegram message forwarding. Live-syncer, Auto-poster, backup-bot, cloner, chat-forwarder, duplicator, ... Call it whatever you like! tgcf is an advanced telegram chat forwarding automation tool that can fulfill all your custom needs.

Downloads: 0 This Week

Last Update: 2024-09-19
See Project
6

Automatic YouTube subtitle generation

Using OpenAI's Whisper to automatically generate YouTube subtitles

...It allows users to download videos or audio from YouTube and automatically generate subtitles or transcripts. The tool processes media locally, extracting audio and applying speech recognition to produce accurate text outputs. It supports multiple languages and can handle different Whisper model sizes, balancing performance and accuracy. yt-whisperc is designed for automation, enabling batch processing of multiple videos for transcription workflows. It also provides options for exporting subtitles in common formats such as SRT. ...

Downloads: 0 This Week

Last Update: 2026-04-24
See Project
7

Facexlib

FaceXlib aims at providing ready-to-use face-related functions

facexlib is a PyTorch-based library providing ready-to-use face-related functions, including detection, alignment, recognition, and more. It integrates state-of-the-art open-source methods for various face processing tasks.

Downloads: 12 This Week

Last Update: 2025-04-24
See Project
8

ASRT Speech Recognition

A Deep-Learning-Based Chinese Speech Recognition System

ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.

Downloads: 2 This Week

Last Update: 2025-07-03
See Project
9

ConvNeXt

Code release for ConvNeXt model

...It revisits classic ResNet-style backbones through the lens of transformer design trends—large kernel sizes, inverted bottlenecks, layer normalization, and GELU activations—to bridge the performance gap between convolutions and attention-based models. ConvNeXt’s clean, hierarchical structure makes it efficient for both pretraining and fine-tuning across a wide range of visual recognition tasks. It achieves competitive or superior results on ImageNet and downstream datasets while being easier to deploy and train than transformers. The repository provides pretrained models, training recipes, and ablation studies demonstrating how incremental design choices collectively yield state-of-the-art performance.

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Tensorflow Transformers

State of the art faster Transformer with Tensorflow 2.0

...These models can be applied on text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like speech recognition and audio classification. Faster AutoReggressive Decoding, TFlite support, creating TFRecords is simple. Auto-Batching tf.data.dataset or tf.ragged tensors. Everything is dictionary (inputs and outputs) Multiple mask modes like causal, user-defined, prefix. tensorflow-text tokenizer support. Supports GPU, TPU, multi-GPU trainer with wandb, multiple callbacks, auto tensorboard.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
11

tom_core

tom_core - a tool for automating events on a computer

...The application repeats all your clicks or drags, keystrokes, hotkeys, etc. All in exactly the timing and number of repetitions you need. The toolbox such as the optical recognition and voice control enables to branch out the recordings into complex forms, with which application brings the possibility of programming even to those who don’t have programming skills or experiences.

Downloads: 0 This Week

Last Update: 2022-05-17
See Project
12

DeepStack

The World's Leading Cross Platform AI Engine for Edge Devices

DeepStack is an AI API engine that serves pre-built models and custom models on multiple edge devices locally or on your private cloud. DeepStack runs completely offline and independent of the cloud. You can also install and run DeepStack on any cloud VM with docker installed to serve as your private, state-of-the-art and real-time AI server.

Downloads: 4 This Week

Last Update: 2024-09-04
See Project
13

Alfi Face

Face Recognition based Attendance System for school, college...

ALFI FACE uses facial recognition technology to record the attendance through a digital camera that detects and recognizes faces and compare the faces with students’ faces images stored in faces database. Once the recognized face matches a stored image, attendance is marked in attendance database for that person. Note: While adding a new student you have to click on" Train the Recognizer" button .

1 Review

Downloads: 0 This Week

Last Update: 2022-04-16
See Project
14

AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

...AutoSub leverages FFmpeg for media handling and integrates with speech recognition engines for transcription. It is particularly useful for content creators who want to quickly produce subtitles without manual effort. Overall, it simplifies the process of making media content accessible and searchable.

Downloads: 14 This Week

Last Update: 2026-04-28
See Project
15

KoNLPy

Python package for Korean natural language processing

KoNLPy is a natural language processing (NLP) library for the Korean language, offering tokenization, morphological analysis, and named entity recognition.

Downloads: 1 This Week

Last Update: 2025-01-24
See Project
16

Photonix Photo Manager

A modern, web-based photo management server

A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms. This project is currently in development and not feature complete for a version 1.0 yet. If you don't mind putting up with broken parts or want to help out, run the Docker image and give it a go. I'd love for other contributors to get involved. You can move some photos into the folder data/photos and they should get detected and imported immediately. ...

Downloads: 1 This Week

Last Update: 2022-09-02
See Project
17

Detectron2

Next-generation platform for object detection and segmentation

Detectron2 is Facebook AI Research's next generation software system that implements state-of-the-art object detection algorithms. It is a ground-up rewrite of the previous version, Detectron, and it originates from maskrcnn-benchmark. It is powered by the PyTorch deep learning framework. Includes more features such as panoptic segmentation, Densepose, Cascade R-CNN, rotated bounding boxes, PointRend, DeepLab, etc. Can be used as a library to support different projects on top of it. We'll...

Downloads: 0 This Week

Last Update: 2021-10-26
See Project
18

PyTorchVideo

A deep learning library for video understanding research

PyTorchVideo is a deep learning library for video understanding, providing modular components and pretrained models for tasks like action recognition, video classification, detection, and self-supervised learning. It is tightly integrated with PyTorch and PyTorch Lightning, offering flexible APIs for building and training spatiotemporal networks. The library includes efficient implementations of state-of-the-art architectures such as SlowFast, X3D, and MViT, optimized for both research prototyping and production inference. ...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
19

DeepImageTranslator

DeepImageTranslator: a deep-learning utility for image translation

Created by: Run Zhou Ye, En Zhou Ye, and En Hui Ye DeepImageTranslator: a free, user-friendly tool for image translation using deep-learning and its applications in CT image analysis Citation: Please cite this software as: Ye RZ, Noll C, Richard G, Lepage M, Turcotte ÉE, Carpentier AC. DeepImageTranslator: a free, user-friendly graphical interface for image translation using deep-learning and its applications in 3D CT image analysis. SLAS technology. 2022 Feb 1;27(1):76-84....

Downloads: 0 This Week

Last Update: 2022-11-16
See Project
20

Kashgari

Kashgari is a production-level NLP Transfer learning framework

Kashgari is a simple and powerful NLP Transfer learning framework, build a state-of-art model in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS), and text classification tasks.

Downloads: 0 This Week

Last Update: 2024-08-09
See Project
21

Differentiable Neural Computer

A TensorFlow implementation of the Differentiable Neural Computer

The Differentiable Neural Computer (DNC), developed by Google DeepMind, is a neural network architecture augmented with dynamic external memory, enabling it to learn algorithms and solve complex reasoning tasks. Published in Nature in 2016 under the paper “Hybrid computing using a neural network with dynamic external memory,” the DNC combines the pattern recognition power of neural networks with a memory module that can be written to and read from in a differentiable way. This allows the model to learn how to store and retrieve information across long time horizons, much like a traditional computer. The architecture consists of modular components including an access module for managing memory operations, a controller (often an LSTM or feedforward network) for issuing read/write commands, and submodules for temporal linkage and memory allocation tracking.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
22

Store Management System

system which works as electronic notebook for keeping records...

Visit our site given below for the LATEST UPDATE and guide for use of store management system on https://tmotagam.github.io/ Shop management system which works as electronic notebook it keeps data about sales and also of products it has Point of sale backups and inventory system WHAT`S NEW: Added New Icon We have launched new project FTPT for sharing your files with your teams using ftp go check it out LINK https://sourceforge.net/projects/ftpt/ We have launched new project Tahafacex for attendance systems using face recognition technology. A great and easy to use interface, can be used in any environment from business to schools to colleges go check it out LINK https://sourceforge.net/projects/tahafacex/

1 Review

Downloads: 3 This Week

Last Update: 2022-10-27
See Project
23

Texthero

Text preprocessing, representation and visualization from zero to hero

Texthero is a python package to work with text data efficiently. It empowers NLP developers with a tool to quickly understand any text-based dataset and it provides a solid pipeline to clean and represent text data, from zero to hero.

Downloads: 0 This Week

Last Update: 2024-08-07
See Project
24

Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito

...It employs a domain-specific language to define templates, enabling the creation of diverse and extensive training examples for intent classification and entity recognition.

Downloads: 0 This Week

Last Update: 2025-04-28
See Project
25

TimeSformer

The official pytorch implementation of our paper

TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project