Showing 528 open source projects for "recognition"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Hyperformer

    Hyperformer

    Hypergraph Transformer for Skeleton-based Action Recognition

    This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition." Skeleton-based action recognition aims to recognize human actions given human joint coordinates with skeletal interconnections. By defining a graph with joints as vertices and their natural connections as edges, previous works successfully adopted Graph Convolutional networks (GCNs) to model joint co-occurrences and achieved superior performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Nhom10_NhanDienKhuonMat_

    Nhom10_NhanDienKhuonMat_

    Face Recognition web app built on Python using Flask.

    Face Recognition web app built on Python using Flask. Students' final project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Dual Clip Translator
    Translation of Selected text or Clipboard contents powered by Google. HotKeys Paste/Change Text auto translated. View in Balloon/Window the result of translation, besides being sent to the clipboard. Screen Capture of Desktop/Game > OCR > Translated.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    LifeAI is an artificial intelligence system that can be applied to robotics, games, or business. It simulates key processes of our minds, such as organizing data into concepts and categories, planning actions based on their predicted outcome, and communication. LifeAI was designed to be simple, but powerful and flexible enough to have many applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    VnCoreNLP

    VnCoreNLP

    A Vietnamese natural language processing toolkit

    VnCoreNLP is a Java-based natural language processing toolkit tailored for Vietnamese. It offers a fast and accurate pipeline for essential NLP tasks, facilitating research and application development in Vietnamese language processing. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Knet

    Knet

    Koç University deep learning framework

    Knet.jl is a deep learning package implemented in Julia, so you should be able to run it on any machine that can run Julia. It has been extensively tested on Linux machines with NVIDIA GPUs and CUDA libraries, and it has been reported to work on OSX and Windows. If you would like to try it on your own computer, please follow the instructions on Installation. If you would like to try working with a GPU and do not have access to one, take a look at Using Amazon AWS or Using Microsoft Azure. If...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ConvNeXt V2

    ConvNeXt V2

    Code release for ConvNeXt V2 model

    ...A key innovation is a new Global Response Normalization (GRN) layer added to the ConvNeXt backbone, which enhances feature competition across channels. The result is a convnet that competes strongly with transformer architectures on recognition benchmarks while being efficient and hardware-friendly. The repository provides official PyTorch implementations for multiple model sizes (Atto, Femto, Pico, up through Huge), conversion from JAX weights, code for pretraining/fine-tuning, and pretrained checkpoints. It supports both self-supervised pretraining and supervised fine-tuning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    LSTMs for Human Activity Recognition

    LSTMs for Human Activity Recognition

    Human Activity Recognition example using TensorFlow on smartphone

    LSTM-Human-Activity-Recognition is a machine learning project that demonstrates how recurrent neural networks can be used to recognize human activities from sensor data. The repository implements a deep learning model based on Long Short-Term Memory (LSTM) networks to classify physical activities using time-series data collected from wearable sensors. The project uses the well-known Human Activity Recognition dataset derived from smartphone accelerometer and gyroscope signals. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    ImageAI

    ImageAI

    A python library built to empower developers

    ImageAI is an easy-to-use Computer Vision Python library that empowers developers to easily integrate state-of-the-art Artificial Intelligence features into their new and existing applications and systems. It is used by thousands of developers, students, researchers, tutors and experts in corporate organizations around the world. You will find features supported, links to official documentation as well as articles on ImageAI. ImageAI is widely used around the world by professionals,...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    tgcf

    tgcf

    The ultimate tool to automate custom telegram message forwarding

    The ultimate tool to automate custom telegram message forwarding. Live-syncer, Auto-poster, backup-bot, cloner, chat-forwarder, duplicator, ... Call it whatever you like! tgcf is an advanced telegram chat forwarding automation tool that can fulfill all your custom needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Pattern Recognition and Machine Learning

    Pattern Recognition and Machine Learning

    Repository of notes, code and notebooks in Python

    Pattern Recognition and Machine Learning is an open-source repository that provides Python implementations and interactive notebooks for algorithms presented in the book Pattern Recognition and Machine Learning by Christopher Bishop. The project recreates many of the mathematical concepts and diagrams from the book using executable Jupyter notebooks, allowing readers to experiment directly with the algorithms described in the text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    FaceRecognitionDotNet

    FaceRecognitionDotNet

    The world's simplest facial recognition api for .NET on Windows, MacOS

    The world's simplest facial recognition API for .NET on Windows, MacOS, and Linux.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    ASRT Speech Recognition

    ASRT Speech Recognition

    A Deep-Learning-Based Chinese Speech Recognition System

    ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Open Speech Corpora

    Open Speech Corpora

    A list of accessible speech corpora for ASR, TTS

    Open Speech Corpora is a curated catalog of speech datasets intended to support research and development in automatic speech recognition, text-to-speech, and other speech technologies. The repository is organized as a set of tables that list corpora along with their languages, total hours, number of speakers, download links, and licenses, giving practitioners a quick way to find data that matches their needs. It emphasizes free and truly “open” datasets, favoring those released under Creative Commons or community-friendly data licenses, though it also lists corpora that are accessible for research and many commercial uses. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Hello AI World

    Hello AI World

    Guide to deploying deep-learning inference networks

    ...In just a couple of hours, you can have a set of deep learning inference demos up and running for realtime image classification and object detection on your Jetson Developer Kit with JetPack SDK and NVIDIA TensorRT. The tutorial focuses on networks related to computer vision, and includes the use of live cameras. You’ll also get to code your own easy-to-follow recognition program in Python or C++, and train your own DNN models onboard Jetson with PyTorch. Ready to dive into deep learning? It only takes two days. We’ll provide you with all the tools you need, including easy to follow guides, software samples such as TensorRT code, and even pre-trained network models including ImageNet and DetectNet examples. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ConvNeXt

    ConvNeXt

    Code release for ConvNeXt model

    ...It revisits classic ResNet-style backbones through the lens of transformer design trends—large kernel sizes, inverted bottlenecks, layer normalization, and GELU activations—to bridge the performance gap between convolutions and attention-based models. ConvNeXt’s clean, hierarchical structure makes it efficient for both pretraining and fine-tuning across a wide range of visual recognition tasks. It achieves competitive or superior results on ImageNet and downstream datasets while being easier to deploy and train than transformers. The repository provides pretrained models, training recipes, and ablation studies demonstrating how incremental design choices collectively yield state-of-the-art performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    VideoSrt

    VideoSrt

    Windows-GUI

    ...Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 19
    Flashlight library

    Flashlight library

    A C++ standalone library for machine learning

    Flashlight is a fast, flexible machine learning library written entirely in C++ by Facebook AI Research and the creators of Torch, TensorFlow, Eigen, and Deep Speech. Native support in C++ and simple extensibility make Flashlight a powerful research framework that's hackable to its core and enables fast iteration on new experimental setups and algorithms with little unopinionated and without sacrificing performance. In a single repository, Flashlight provides apps for research across...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ml5.js

    ml5.js

    Friendly machine learning for the web

    A neighborly approach to creating and exploring artificial intelligence in the browser. ml5.js aims to make machine learning approachable for a broad audience of artists, creative coders, and students. The library provides access to machine learning algorithms and models in the browser, building on top of TensorFlow.js with no other external dependencies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PandaOCR

    PandaOCR

    Multifunctional OCR Image and Text Recognition

    ...If you have higher needs, you can try the professional version. You can also use the Baidu API interface without activation. Support shortcut keys and screen corner trigger screenshot recognition function, convenient and fast.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Meihu-FaceBeauty-Live

    Meihu-FaceBeauty-Live

    Beauty can be applied to live broadcasts, short videos, and selfies

    Meihu beauty sdk is a mobile sdk with face recognition technology as the core, providing professional-grade real-time beauty, big eyes and face reduction, beauty filters, dynamic stickers and other filters, to create a multi-functional video beauty software The goal is to fully meet the beautification needs of customers in many audio and video software business scenarios such as live beauty and short video beauty.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Yoha

    Yoha

    A practical hand tracking engine

    Yoha is a browser-based hand tracking engine designed to enable real-time gesture recognition and interaction using standard webcams, making it accessible for web applications without specialized hardware. Built using JavaScript and TensorFlow.js, it runs directly in the browser and performs inference on-device, eliminating the need for server-side processing. The engine is capable of detecting 21 two-dimensional hand landmarks, allowing developers to build applications that respond to gestures such as pinching or forming a fist. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DeepStack

    DeepStack

    The World's Leading Cross Platform AI Engine for Edge Devices

    DeepStack is an AI API engine that serves pre-built models and custom models on multiple edge devices locally or on your private cloud. DeepStack runs completely offline and independent of the cloud. You can also install and run DeepStack on any cloud VM with docker installed to serve as your private, state-of-the-art and real-time AI server.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    html2canvas

    html2canvas

    A JavaScript HTML screenshot renderer

    html2canvas is a JavaScript HTML renderer. The script provides you with the tools to take screenshots of webpages directly on the browser. The screenshot is based on the DOM and therefore, it may not be 100% accurate to the real representation, given that it is not an actual screenshot, but a type of screenshot built based on the available data and information of the page. The script renders such page as a canvas image, by reading the DOM and the different styles of the featured elements. It...
    Downloads: 9 This Week
    Last Update:
    See Project
Auth0 Logo