Any model. Any hardware. Zero compromise
Open source RAG framework for building scalable modular AI apps
An on-premises, OCR-free unstructured data extraction
An open-source, modern-design AI training tracking and visualization
Open-source industrial-grade ASR models
Foundation model for image generation
Fast-stable-diffusion + DreamBooth
Ultimate meta-skill for generating best-in-class Claude Code skills
Motion-controllable Video Generation via Latent Trajectory Guidance
Hunyuan Translation Model Version 1.5
Persistent context and multi-instance coordination
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
"Big Model" trains a visual multimodal VLM with 26M parameters
Fast and accurate AI powered file content types detection
Implementation of "MobileCLIP" CVPR 2024
VMZ: Model Zoo for Video Modeling
Official implementation of Watermark Anything with Localized Messages
Video understanding codebase from FAIR for reproducing video models
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
Conditional GAN for generating synthetic tabular data
ETL framework to index data for AI, such as RAG
Operating LLMs in production
Automatically translates the text of a video based on a subtitle file