PyTorch code and models for the DINOv2 self-supervised learning
Open-Source Financial Large Language Models
Reference PyTorch implementation and models for DINOv3
OCR expert VLM powered by Hunyuan's native multimodal architecture
Pokee Deep Research Model Open Source Repo
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
Small 3B-base multimodal model ideal for custom AI on edge hardware