Guiding Instruction-based Image Editing via Multimodal Large Language
PyTorch code and models for V-JEPA self-supervised learning from video
PyTorch code and models for the DINOv2 self-supervised learning
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Generate 3D objects conditioned on text or images
A Powerful Native Multimodal Model for Image Generation
Implementation of Vision Transformer, a simple way to achieve SOTA
Audiocraft is a library for audio processing and generation
LLM powered fuzzing via OSS-Fuzz
Set of tools to assess and improve LLM security
The official PyTorch implementation of Google's Gemma models
Volcano Engine Reinforcement Learning for LLMs
The best ChatGPT that $100 can buy
PPTAgent: Generating and Evaluating Presentations
Learn AI and LLMs from scratch using free resources
PyTorch code and models for VJEPA2 self-supervised learning from video
Official code for Style Aligned Image Generation via Shared Attention
Utilities intended for use with Llama models
Provides code for running inference with the SegmentAnything Model
Anthropic's Interactive Prompt Engineering Tutorial
Official implementation of DreamCraft3D
Examples and guides for using the OpenAI API
Implementation of the Surya Foundation Model for Heliophysics
The repository provides code for running inference with SAM 2
Code for Language models can explain neurons in language models paper