Showing 204 open source projects for "recognition"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    TaskMatrix

    TaskMatrix

    Enable sending and receiving images during chatting

    ...Originally introduced alongside the Visual ChatGPT concept, TaskMatrix acts as an orchestration framework where a central language model delegates subtasks to domain-specific AI systems such as image generators, segmentation tools, or recognition models. The architecture focuses on modularity, allowing new APIs and foundation models to be integrated as interchangeable task-solving components. The project also explores low-code human-AI interaction workflows that improve controllability and transparency during complex task execution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    VATSG

    VATSG

    Video automatic transcribe and translated subtitle generator

    ...This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you are a general user who want to view any video file and mp3 file to your language, It will provide way. It's very easy to use because it has simple gui and very intuitive. So you can easily use it for any purpose. Now, you can choose to download either window installer setup type or uninstalled type. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    Image to Text

    Convert an image to text to spot intelligible words.

    The program will convert to text an image, such as a photo , with the purpose of analyzing it to spot intelligible words. Use the program with photos of clouds, sea, soil, vegetation or any other photo of natural or man-made semi-homogeneous configuration, to reveal the hidden universal-philosophical messages of the image. You can also use it on photos of people or art pieces to have a psychological insight of the person portrayed or of the image author. The resulting text will be a long...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Hyperformer

    Hyperformer

    Hypergraph Transformer for Skeleton-based Action Recognition

    This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition." Skeleton-based action recognition aims to recognize human actions given human joint coordinates with skeletal interconnections. By defining a graph with joints as vertices and their natural connections as edges, previous works successfully adopted Graph Convolutional networks (GCNs) to model joint co-occurrences and achieved superior performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    Img2Txt

    Img2Txt

    Img2Txt - Extract Text From Images using AI

    ...Please Include the official Download Link What is Img2Txt? Img2Txt is a Python-based application packaged using PyInstaller that utilizes the power of pytesseract, an AI-powered optical character recognition (OCR) library, to extract text from images and convert it into plain text. The application features a simple and modern user-friendly interface created using customtkinter, allowing users to easily process images and obtain the text within them. Support me at : https://www.buymeacoffee.com/zsynctic it will motivate me and it will make me create more projects Support For any questions or issues, please open an issue on the Img2Txt GitHub repository. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    AiHound

    AiHound

    AI powered image classification for nudity and documents / id-cards

    AI Hound is designed to run from an USB pendrive or any other kind of removeable and writeable media. The programm checks all Office-documents, Images and videos for various categories for images. Actually It can recognice nudity/porn and scanned or photographed documents / ID- and credit-cards. I am working on a model that also recognice various types of drugs in images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ConvNeXt V2

    ConvNeXt V2

    Code release for ConvNeXt V2 model

    ...A key innovation is a new Global Response Normalization (GRN) layer added to the ConvNeXt backbone, which enhances feature competition across channels. The result is a convnet that competes strongly with transformer architectures on recognition benchmarks while being efficient and hardware-friendly. The repository provides official PyTorch implementations for multiple model sizes (Atto, Femto, Pico, up through Huge), conversion from JAX weights, code for pretraining/fine-tuning, and pretrained checkpoints. It supports both self-supervised pretraining and supervised fine-tuning.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    LSTMs for Human Activity Recognition

    LSTMs for Human Activity Recognition

    Human Activity Recognition example using TensorFlow on smartphone

    LSTM-Human-Activity-Recognition is a machine learning project that demonstrates how recurrent neural networks can be used to recognize human activities from sensor data. The repository implements a deep learning model based on Long Short-Term Memory (LSTM) networks to classify physical activities using time-series data collected from wearable sensors. The project uses the well-known Human Activity Recognition dataset derived from smartphone accelerometer and gyroscope signals. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ImageAI

    ImageAI

    A python library built to empower developers

    ImageAI is an easy-to-use Computer Vision Python library that empowers developers to easily integrate state-of-the-art Artificial Intelligence features into their new and existing applications and systems. It is used by thousands of developers, students, researchers, tutors and experts in corporate organizations around the world. You will find features supported, links to official documentation as well as articles on ImageAI. ImageAI is widely used around the world by professionals,...
    Downloads: 8 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    tgcf

    tgcf

    The ultimate tool to automate custom telegram message forwarding

    The ultimate tool to automate custom telegram message forwarding. Live-syncer, Auto-poster, backup-bot, cloner, chat-forwarder, duplicator, ... Call it whatever you like! tgcf is an advanced telegram chat forwarding automation tool that can fulfill all your custom needs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    ASRT Speech Recognition

    ASRT Speech Recognition

    A Deep-Learning-Based Chinese Speech Recognition System

    ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ConvNeXt

    ConvNeXt

    Code release for ConvNeXt model

    ...It revisits classic ResNet-style backbones through the lens of transformer design trends—large kernel sizes, inverted bottlenecks, layer normalization, and GELU activations—to bridge the performance gap between convolutions and attention-based models. ConvNeXt’s clean, hierarchical structure makes it efficient for both pretraining and fine-tuning across a wide range of visual recognition tasks. It achieves competitive or superior results on ImageNet and downstream datasets while being easier to deploy and train than transformers. The repository provides pretrained models, training recipes, and ablation studies demonstrating how incremental design choices collectively yield state-of-the-art performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DeepStack

    DeepStack

    The World's Leading Cross Platform AI Engine for Edge Devices

    DeepStack is an AI API engine that serves pre-built models and custom models on multiple edge devices locally or on your private cloud. DeepStack runs completely offline and independent of the cloud. You can also install and run DeepStack on any cloud VM with docker installed to serve as your private, state-of-the-art and real-time AI server.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    Alfi Face

    Alfi Face

    Face Recognition based Attendance System for school, college...

    ALFI FACE uses facial recognition technology to record the attendance through a digital camera that detects and recognizes faces and compare the faces with students’ faces images stored in faces database. Once the recognized face matches a stored image, attendance is marked in attendance database for that person. Note: While adding a new student you have to click on" Train the Recognizer" button .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    KoNLPy

    KoNLPy

    Python package for Korean natural language processing

    KoNLPy is a natural language processing (NLP) library for the Korean language, offering tokenization, morphological analysis, and named entity recognition.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Photonix Photo Manager

    Photonix Photo Manager

    A modern, web-based photo management server

    A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms. This project is currently in development and not feature complete for a version 1.0 yet. If you don't mind putting up with broken parts or want to help out, run the Docker image and give it a go. I'd love for other contributors to get involved. You can move some photos into the folder data/photos and they should get detected and imported immediately. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Detectron2

    Detectron2

    Next-generation platform for object detection and segmentation

    Detectron2 is Facebook AI Research's next generation software system that implements state-of-the-art object detection algorithms. It is a ground-up rewrite of the previous version, Detectron, and it originates from maskrcnn-benchmark. It is powered by the PyTorch deep learning framework. Includes more features such as panoptic segmentation, Densepose, Cascade R-CNN, rotated bounding boxes, PointRend, DeepLab, etc. Can be used as a library to support different projects on top of it. We'll...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    DeepImageTranslator

    DeepImageTranslator

    DeepImageTranslator: a deep-learning utility for image translation

    Created by: Run Zhou Ye, En Zhou Ye, and En Hui Ye DeepImageTranslator: a free, user-friendly tool for image translation using deep-learning and its applications in CT image analysis Citation: Please cite this software as: Ye RZ, Noll C, Richard G, Lepage M, Turcotte ÉE, Carpentier AC. DeepImageTranslator: a free, user-friendly graphical interface for image translation using deep-learning and its applications in 3D CT image analysis. SLAS technology. 2022 Feb 1;27(1):76-84....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Kashgari

    Kashgari

    Kashgari is a production-level NLP Transfer learning framework

    Kashgari is a simple and powerful NLP Transfer learning framework, build a state-of-art model in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS), and text classification tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Store Management System

    Store Management System

    system which works as electronic notebook for keeping records...

    Visit our site given below for the LATEST UPDATE and guide for use of store management system on https://tmotagam.github.io/ Shop management system which works as electronic notebook it keeps data about sales and also of products it has Point of sale backups and inventory system WHAT`S NEW: Added New Icon We have launched new project FTPT for sharing your files with your teams using ftp go check it out LINK https://sourceforge.net/projects/ftpt/ We have launched new project Tahafacex for attendance systems using face recognition technology. A great and easy to use interface, can be used in any environment from business to schools to colleges go check it out LINK https://sourceforge.net/projects/tahafacex/
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    Texthero

    Texthero

    Text preprocessing, representation and visualization from zero to hero

    Texthero is a python package to work with text data efficiently. It empowers NLP developers with a tool to quickly understand any text-based dataset and it provides a solid pipeline to clean and represent text data, from zero to hero.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TimeSformer

    TimeSformer

    The official pytorch implementation of our paper

    TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Tahafacex

    Tahafacex

    It works as electronic book for keeping attendance records...

    It works as electronic book for keeping attendance records using face identification\recognition technology it can mark attendance upto 100 person it can be used anywhere. MINOR CHANGE: Functionality improved MAJOR CHANGE: 1. Backup and Restore (For 1.1.0 and future versions only) 2. Export your attendance sheets in excel format of each class
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Gluon CV Toolkit

    Gluon CV Toolkit

    Gluon CV Toolkit

    ...It features training scripts that reproduce SOTA results reported in latest papers, a large set of pre-trained models, carefully designed APIs and easy-to-understand implementations and community support. From fundamental image classification, object detection, semantic segmentation and pose estimation, to instance segmentation and video action recognition. The model zoo is the one-stop shopping center for many models you are expecting. GluonCV embraces a flexible development pattern while is super easy to optimize and deploy without retaining a heavyweight deep learning framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Lambda Networks

    Lambda Networks

    Implementation of LambdaNetworks, a new approach to image recognition

    Implementation of λ Networks, a new approach to image recognition that reaches SOTA on ImageNet. The new method utilizes λ layer, which captures interactions by transforming contexts into linear functions, termed lambdas, and applying these linear functions to each input separately. Shinel94 has added a Keras implementation! It won't be officially supported in this repository, so either copy / paste the code under .
    Downloads: 3 This Week
    Last Update:
    See Project
Auth0 Logo