Showing 204 open source projects for "recognition"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    anaGo

    anaGo

    Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition

    anaGo is a Python library for sequence labeling(NER, PoS Tagging,...), implemented in Keras. anaGo can solve sequence labeling tasks such as named entity recognition (NER), part-of-speech tagging (POS tagging), semantic role labeling (SRL) and so on. Unlike traditional sequence labeling solver, anaGo doesn't need to define any language-dependent features. Thus, we can easily use anaGo for any language. In anaGo, the simplest type of model is the Sequence model. Sequence model includes essential methods like fit, score, analyze and save/load. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    mAP

    mAP

    Evaluates the performance of your neural net for object recognition

    In practice, a higher mAP value indicates a better performance of your neural net, given your ground truth and set of classes. The performance of your neural net will be judged using the mAP criteria defined in the PASCAL VOC 2012 competition. We simply adapted the official Matlab code into Python (in our tests they both give the same results). First, your neural net detection-results are sorted by decreasing confidence and are assigned to ground-truth objects. We have "a match" when they...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Siamese and triplet learning

    Siamese and triplet learning

    Siamese and triplet networks with online triplet mining in PyTorch

    ...These types of networks learn to map images into a compact feature space where the distance between vectors reflects the similarity between inputs. Such embeddings are commonly used in applications like face recognition, image similarity search, and few-shot learning. The repository demonstrates how to train these models using contrastive loss and triplet loss functions, which encourage embeddings of similar samples to be close while pushing dissimilar samples farther apart. It includes data loaders, training scripts, neural network architectures, and evaluation metrics that allow researchers to experiment with different embedding learning strategies. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Neural Network signal recognition rtlsdr

    Neural Network signal recognition rtlsdr

    Deep learning signal classification (recognition) using rtl-sdr dongle

    WARNING: Outdated version here. Everything has been moved to github: https://github.com/randaller/cnn-rtlsdr
    Downloads: 3 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Lip Reading

    Lip Reading

    Cross Audio-Visual Recognition using 3D Architectures

    ...This code is aimed to provide the implementation for Coupled 3D Convolutional Neural Networks for audio-visual matching. Lip-reading can be a specific application for this work. Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the extracted information from one modality to improve the recognition ability of the other modality by complementing the missing information. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    JAVT - Just Another Voice Transformer

    JAVT - Just Another Voice Transformer

    Just Another Speech Recognition and Text to Speech software.

    JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Gamera is a framework for the creation of structured document analysis applications by domain experts. It combines a programming library with GUI tools for the training and interactive development of recognition systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    TalkMaths is a speech user interface that extends the speech recognition program Dragon NaturallySpeaking by parsing spoken mathematical expressions into MathML and/or LaTeX. TalkMaths allows the user to create documents in MathML or LaTeX hands-free.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Neural Libs

    Neural Libs

    Neural network library for developers

    This project includes the implementation of a neural network MLP, RBF, SOM and Hopfield networks in several popular programming languages. The project also includes examples of the use of neural networks as function approximation and time series prediction. Includes a special program makes it easy to test neural network based on training data and the optimization of the network.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    piffle

    Speech recognition for Ubuntu

    Speech recognition system for Ubuntu which takes Palaver as codebase and integrates Pocketsphinx instead of Google speech-api. Codebase is the minimal version of Palaver.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    ProximityForest

    Efficient Approximate Nearest Neighbors for General Metric Spaces

    ..., WACV 2013 (best student paper award). One application of a ProximityForest is given in the following CVPR publication: Stephen O'Hara and Bruce A. Draper, "Scalable Action Recognition with a Subspace Forest," IEEE Conference on Computer Vision and Pattern Recognition, 2012. This source code is provided without warranty and is available under the GPL license. More commercially-friendly licenses may be available. Please contact Stephen O'Hara for license options. Please view the wiki on this site for installation instructions and examples on reproducing the results of the papers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition. full installation and usage instructions given at http://sourceforge.net/p/rnnl/wiki/Home/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Domotic Speech-recognition interface

    Speech-recognition interface for a domotic system.

    This product recognizes oral commands and translates them to domotic orders for a domotic system. This product does not implement a domotic system. This product is an interface to be plugged to a domotic system. The speech recognition is done by an arduino UNO board and an EasyVR shield. Available oral commands are generated from a house description file in XML format. The oral commands have to be trained for a specific users. For this purpose 2 interfaces are provided: a command line interface and a web application. These interfaces allow to visualize oral commands, train and delete trainings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Ivolution

    Ivolution

    Timelapse creation using Face Recognition

    Ivolution is a face timelapse generator. Feed it with a bunch of images and it will generate a movie with your face centered on the screen. Ivolution uses face detection and modifies the images so that your face always keeps the same size and location over the movie. Images are processed in chronological order, so that you can see your face evoluate over time !
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ViAmI-Server

    ViAmI-Server

    Pattern recognition for ADL events

    This software uses computer vision algorithms for mining sequence data from telemonitoring data with CBRs. We propose an approach which treats the detection of changes in behavior detected with a sensor/video fusion, which occur at radically different time-scales, through a CBR in two levels: low and high level. The system is always updating the database with the daily data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A toolkit for the optical recognition of Psaltiki 19th century music notation. It is based on and requires the Gamera document image analysis framework (http://gamera.sf.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PyPR - Python Pattern Recognition. A small collection of useful pattern recognition methods. The code is still in its early stages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Pythia is a natural language question answering system, which uses Speech Recognition and Text To Speech technologies to communicate with the user.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Artificial Intelligence program - keyword based chat, personal voice in/out, animated 3D character, natural language recognition and translation, neural network based strong AI, personality, tone recognition
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Recognition of playing poker cards using a webcam in linux And Neural networks
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Orcus smartHome is a voice/sensor/web-interactive home automation, streaming media, security monitoring system with voice recognition, speech synthesis, scheduling via Google calendar, and web interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Voice Conference Manager uses VoiceXML and CCXML to control speech recognition, text to speech, and voice biometrics for a telephone conference service. Say the names or numbers of people and VCM places them into the call. Can be hosted on public servers
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo