Offline speech recognition API for Android, iOS, Raspberry Pi
Robust Speech Recognition via Large-Scale Weak Supervision
Speech to Text to Speech, sends text as OSC messages
Speech-to-text, text-to-speech, and speaker recognition
Port of OpenAI's Whisper model in C/C++
Speech recognition module for Python
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Captcha solver extension for humans
OpenVINO™ Toolkit repository
A Lightweight Face Recognition and Facial Attribute Analysis
C++ library for high performance inference on NVIDIA GPUs
A PyTorch-based Speech Toolkit
Han Language Processing
Toolkit for conversational AI
Statistical machine intelligence and learning engine
Training data (data labeling, annotation, workflow) for all data types
On-device Speech Recognition for Apple Silicon
Data manipulation and transformation for audio signal processing
Assistant SDK to build a multimodal conversational UX for Android
In-App assistant SDK to build a multimodal conversational UX for iOS
Multilingual Automatic Speech Recognition with word-level timestamps
NLP Cloud serves high performance pre-trained or custom models for NER
Convert AI papers to GUI
Industrial-strength Natural Language Processing (NLP)
Replace OpenAI GPT with another LLM in your app