State-of-the-art 2D and 3D Face Analysis Project
NLP Cloud serves high performance pre-trained or custom models for NER
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
WAFW00F allows one to identify and fingerprint Web App Firewall
Formula recognition based on LaTeX-OCR and ONNXRuntime
Library for OCR-related tasks powered by Deep Learning
Obsei is a low code AI powered automation tool
Repo of Qwen2-Audio chat & pretrained large audio language model
Image processing in Python
Omnilingual ASR Open-Source Multilingual SpeechRecognition
OCR expert VLM powered by Hunyuan's native multimodal architecture
Replace OpenAI GPT with another LLM in your app
A library to generate LaTeX expression from Python code
Ready-to-use OCR with 80+ supported languages
Qwen3-Coder is the code version of Qwen3
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
Framework for building neural networks
Jittor is a high-performance deep learning framework
PyTorch code and models for VJEPA2 self-supervised learning from video
The leading agent orchestration platform for Claude
Label, clean and enrich text datasets with LLMs
This repository contains the complete code and data for studying primo
2D and 3D Face alignment library build using pytorch
Img2Txt - Extract Text From Images using AI