Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Open-source large language model family from Tencent Hunyuan
PyTorch code and models for VJEPA2 self-supervised learning from video
Detect faces in an image
fast C++ library for linear algebra & scientific computing
A Conversational Speech Generation Model
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Efficient 3D human pose estimation in video using 2D keypoint
VGGFace2 Dataset for Face Recognition
FaceAccess is an Access Control System based on Facial Recognition
Efficient Approximate Nearest Neighbors for General Metric Spaces
Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input
Russian ASR model fine-tuned on Common Voice and CSS10 datasets
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
CTC-based forced aligner for audio-text in 158 languages