A Lightweight Face Recognition and Facial Attribute Analysis
Speech recognition module for Python
Enable AI to control your desktop, mobile and HMI devices
Offline Text To Speech synthesis for python
Towards Human-Sounding Speech
Agent toolkit providing semantic retrieval and editing capabilities
Open source AI Agents hosted on the oTTomator Live Agent Studio
Pre-trained Deep Learning models and demos
The most powerful Android RPA agent framework
An undetectable, powerful, flexible, high-performance Python library
Advanced techniques for RAG systems
A Model Context Protocol server for searching and analyzing arXiv
Guiding Instruction-based Image Editing via Multimodal Large Language
Refer and Ground Anything Anywhere at Any Granularity
Synthetic data generators for tabular and time-series data
The open-source tool for building high-quality datasets
A feature rich discord Modmail bot
SOTA discrete acoustic codec models with 40/75 tokens per second
Implementation of Make-A-Video, new SOTA text to video generator
Creation of a Taylorplot for several machine learning models
A desktop weather app powered by AI
Leading free and open-source liveliness check &face recognition system
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting
Inference code for Llama models
Visual localization made easy with hloc