State-of-the-art 2D and 3D Face Analysis Project
NLP Cloud serves high performance pre-trained or custom models for NER
Library for OCR-related tasks powered by Deep Learning
Obsei is a low code AI powered automation tool
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Repo of Qwen2-Audio chat & pretrained large audio language model
Image processing in Python
OCR expert VLM powered by Hunyuan's native multimodal architecture
Replace OpenAI GPT with another LLM in your app
Ready-to-use OCR with 80+ supported languages
Qwen3-Coder is the code version of Qwen3
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
Jittor is a high-performance deep learning framework
Framework for building neural networks
PyTorch code and models for VJEPA2 self-supervised learning from video
The leading agent orchestration platform for Claude
Label, clean and enrich text datasets with LLMs
This repository contains the complete code and data for studying primo
2D and 3D Face alignment library build using pytorch
Img2Txt - Extract Text From Images using AI
Code release for ConvNeXt V2 model
Code release for ConvNeXt model
Kashgari is a production-level NLP Transfer learning framework
Implementation of LambdaNetworks, a new approach to image recognition