A Unified Framework for Image Customization
Flexible Photo Recrafting While Preserving Your Identity
A TTS model capable of generating ultra-realistic dialogue
code for Mesh R-CNN, ICCV 2019
PyTorch code and models for VJEPA2 self-supervised learning from video
The ChatGPT Retrieval Plugin lets you easily find personal documents
A simple forecasting package
Best practices on recommendation systems
This repo contains the code for 1D tokenizer and generator
A Universal Customization Method for Single and Multi Conditioning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Lightweight Python library for adding real-time multi-object tracking
A python library for self-supervised learning on images
Deep learning library
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
An advanced paper search agent powered by large language models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
CV, NLP, LLM project applications, and advanced engineering deployment
Open-source MCP server that gives your coding agent
MCP integration platforms for AI agents to use tools at any scale
Swirl queries any number of data sources with APIs
Anthropic's Interactive Prompt Engineering Tutorial
AIConfig is a config-based framework to build generative AI apps
Release for Improved Denoising Diffusion Probabilistic Models
A fast, powerful, and simple hierarchical vision transformer