Showing 528 open source projects for "recognition"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    NodeTool

    NodeTool

    Visual AI Workflow Builder

    NodeTool is an open‑source, visual AI workflow builder that lets you connect nodes for text, images, audio, video, data, and automation—then run them locally or on the cloud. Build multi‑step agents, RAG systems, and creative media pipelines without coding, inspect execution in real time, and deploy anywhere: home server, private VPC, RunPod, or Cloud Run. With a local‑first design, NodeTool keeps models and data under your control while still supporting providers like OpenAI, Anthropic,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Hiera

    Hiera

    A fast, powerful, and simple hierarchical vision transformer

    Hiera is a hierarchical vision transformer designed to be fast, simple, and strong across image and video recognition tasks. The core idea is to use straightforward hierarchical attention with a minimal set of architectural “bells and whistles,” achieving competitive or superior accuracy while being markedly faster at inference and often faster to train. The repository provides installation options (from source or Torch Hub), a model zoo with pre-trained checkpoints, and code for evaluation and fine-tuning on standard benchmarks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    OpenKYC - FaceOnLive Community Project

    OpenKYC - FaceOnLive Community Project

    FaceOnLive Open KYC: Streamlining Identity Verification with AI

    ...With a commitment to leveraging the latest advancements in biometric technology, our platform presents a comprehensive solution encompassing cutting-edge features such as face recognition, face liveness detection, and ID document recognition. By seamlessly integrating these powerful tools, we empower businesses across industries to streamline their KYC processes with unparalleled accuracy and efficiency. At the heart of our initiative lies an open-source UI flow, meticulously designed to provide users with an intuitive and seamless experience throughout the identity verification journey. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 5
    MMDetection

    MMDetection

    An open source object detection toolbox based on PyTorch

    MMDetection is an open source object detection toolbox that's part of the OpenMMLab project developed by Multimedia Laboratory, CUHK. It stems from the codebase developed by the MMDet team, who won the COCO Detection Challenge in 2018. Since that win this toolbox has continuously been developed and improved. MMDetection detects various objects within a given image with high efficiency. Its training speed is comparable or even faster than those of other codebases like Detectron2 and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Amica

    Amica

    Amica is an open source interface for interactive communication

    ...Under the hood, Amica leverages modern web and desktop technologies: three.js and three-vrm for 3D rendering, Transformers.js for running models in the browser, Whisper and Silero VAD for speech recognition and voice-activity detection, and a variety of LLM backends such as llama.cpp servers, ChatGPT-compatible APIs, Ollama, KoboldCpp, and others. It also integrates multiple text-to-speech providers, including ElevenLabs, OpenAI, Coqui, RVC, and AllTalkTTS.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 7

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    ...Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 8
    Obsei

    Obsei

    Obsei is a low code AI powered automation tool

    Obsei is an automated no-code/low-code AI-powered text observation and analysis framework, designed for extracting insights from unstructured text data such as social media, reviews, and logs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Liveliness and Face Identification

    Leading free and open-source liveliness check &face recognition system

    ...If user's pic is in DB, it will show the matching name or else you can upload your pic with name to do detection. Application has many uses like door lock, attendance system or any similar identification usages. Face Recognition is highly accurate and simplest application
    Downloads: 1 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10
    Maia
    MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    pattern_classification

    pattern_classification

    A collection of tutorials and examples for solving machine learning

    The pattern_classification repository is an educational project that provides tutorials, examples, and reference materials related to machine learning and statistical pattern recognition. The project aims to help learners understand the process of building predictive models by presenting structured explanations and practical examples. It includes notebooks and guides that demonstrate data preprocessing, feature extraction, model training, and evaluation techniques used in machine learning workflows. The repository also covers algorithms such as Bayesian classification, logistic regression, neural networks, clustering methods, and ensemble models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MMAction2

    MMAction2

    OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

    ...One can easily construct a customized video understanding framework by combining different modules. Support four major video understanding tasks: MMAction2 implements various algorithms for multiple video understanding tasks, including action recognition, action localization, Spatio-temporal action detection, and skeleton-based action detection. We support 27 different algorithms and 20 different datasets for the four major tasks. We provide detailed documentation and API reference, as well as unit tests. We support Multigrid on Kinetics400, achieve 76.07% Top-1 accuracy and accelerate training speed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Autolabel

    Autolabel

    Label, clean and enrich text datasets with LLMs

    Autolabel is a Python library to label, clean and enrich datasets with Large Language Models (LLMs). Autolabel data for NLP tasks such as classification, question-answering and named entity recognition, entity matching and more. Seamlessly use commercial and open-source LLMs from providers such as OpenAI, Anthropic, HuggingFace, Google and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    funNLP

    funNLP

    Resources, corpora, and tools for Chinese natural language processing

    ...It aggregates datasets, lexicons, wordlists, sentiment dictionaries, knowledge graphs, and pretrained model references, serving as a one-stop resource hub for Chinese NLP practitioners. The repository is organized into categories such as sentiment analysis, text classification, named entity recognition, knowledge graphs, and various lexicons (e.g. sensitive words, emotion dictionaries, stopwords). It also includes links to academic papers, open-source model implementations, and practical utilities like word segmentation or text cleaning scripts. The project is highly community-oriented, frequently updated with contributions and new resources, and it’s widely used in both academic and applied NLP research. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Acharya

    Acharya

    A Data Centric annotation tool for your Named Entity Recognition

    A data-centric annotation tool to increase the accuracy of your Named Entity Recognition projects which helps rapidly identify and fix labeling errors in your dataset. Import/export datasets in multiple formats, train a model and use it to aid in the annotation process. Setup an MLOps pipeline to experiment with different algorithms on the same data and increase their accuracy and performance in a data-centric way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MMOCR

    MMOCR

    OpenMMLab Text Detection, Recognition and Understanding Toolbox

    MMOCR is an open-source toolbox based on PyTorch and mmdetection for text detection, text recognition, and the corresponding downstream tasks including key information extraction. It is part of the OpenMMLab project. The toolbox supports not only text detection and text recognition, but also their downstream tasks such as key information extraction. The toolbox supports a wide variety of state-of-the-art models for text detection, text recognition and key information extraction. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Exadel CompreFace

    Exadel CompreFace

    Leading free and open-source face recognition system

    Exadel CompreFace is a free and open-source face recognition GitHub project. Essentially, it is a docker-based application that can be used as a standalone server or deployed in the cloud. You don’t need prior machine learning skills to set up and use CompreFace. The system provides REST API for face recognition, face verification, face detection, face mask detection, landmark detection, age, and gender recognition.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    Chat with GPT

    Chat with GPT

    An open-source ChatGPT app with a voice

    ...Users can review past chat sessions, modify system prompts, and adjust model parameters such as temperature to control response creativity. The platform also integrates speech capabilities by connecting to text-to-speech systems and speech recognition engines, enabling voice-based conversations with the AI assistant. Additional features include message editing, response regeneration, and the ability to share conversations through public links.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Promptify

    Promptify

    se GPT or other prompt based models to get structured output

    ...Instead of manually crafting prompts for each task, Promptify introduces a unified architecture that combines prompt templates, language model interfaces, and processing pipelines into a single framework. This approach allows developers to perform tasks such as text classification, named entity recognition, question answering, and information extraction using consistent prompt templates. The library supports integration with multiple large language model providers, enabling users to experiment with various models without changing their overall workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Paper-with-Code-of-Wireless-comm

    Paper-with-Code-of-Wireless-comm

    Paper-with-Code-of-Wireless-communication-Based-on-DL

    Paper-with-Code-of-Wireless-communication-Based-on-DL is a curated repository that collects research papers and corresponding code implementations related to the application of deep learning in wireless communication systems. The project aims to help researchers and graduate students quickly find reproducible implementations of algorithms used in modern communication research. Wireless communication research has increasingly adopted deep learning techniques to address complex tasks such as...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    wukong-robot

    wukong-robot

    Chinese voice dialogue robot/smart speaker project

    wukong-robot is a Chinese voice assistant / smart speaker project built to let makers and hackers design highly customizable voice-controlled devices. It combines wake-word detection, automatic speech recognition, natural language understanding, and text-to-speech into a single framework aimed at the Chinese-speaking ecosystem. The project is positioned as a simple, flexible, and elegant platform that can run on devices like Raspberry Pi and other Linux-based boards, making it suitable for DIY smart speakers and home-automation hubs. It supports multi-turn conversational capabilities powered by ChatGPT or other large language models, letting users have continuous dialogues rather than one-shot commands. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    daily-paper-computer-vision

    daily-paper-computer-vision

    Document papers compiled daily in computer vision/deep learning

    This repo is a running feed of computer-vision research, tracking new papers and notable results so practitioners can keep up without scouring multiple sites. It’s organized chronologically and often thematically, making it easy to scan what’s new in detection, segmentation, recognition, generative vision, 3D, and video understanding. The cadence is intentionally frequent, reflecting how quickly CV advances and how hard it is to maintain awareness while working full time. By aggregating paper titles and references in one place, it reduces the overhead of deciding what to read next and helps you spot trends early. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    TaskMatrix

    TaskMatrix

    Enable sending and receiving images during chatting

    ...Originally introduced alongside the Visual ChatGPT concept, TaskMatrix acts as an orchestration framework where a central language model delegates subtasks to domain-specific AI systems such as image generators, segmentation tools, or recognition models. The architecture focuses on modularity, allowing new APIs and foundation models to be integrated as interchangeable task-solving components. The project also explores low-code human-AI interaction workflows that improve controllability and transparency during complex task execution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    TF2DeepFloorplan

    TF2DeepFloorplan

    TF2 Deep FloorPlan Recognition using a Multi-task Network

    TF2 Deep FloorPlan Recognition using a Multi-task Network with Room-boundary-Guided Attention. Enable tensorboard, quantization, flask, tflite, docker, github actions and google colab. This repo contains a basic procedure to train and deploy the DNN model suggested by the paper 'Deep Floor Plan Recognition using a Multi-task Network with Room-boundary-Guided Attention'.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Vosk Desktop

    Vosk Desktop

    Desktop software for controlling the Vosk Speech Recognition Toolkit

    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo