Stable Virtual Camera: Generative View Synthesis with Diffusion Models
ChatGPT interface with better UI
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
An AI-powered security review GitHub Action using Claude
Dataset of GPT-2 outputs for research in detection, biases, and more
Fast and Universal 3D reconstruction model for versatile tasks
Official code for Style Aligned Image Generation via Shared Attention
4M: Massively Multimodal Masked Modeling
Foundation Models for Time Series
FAIR Sequence Modeling Toolkit 2
A Production-ready Reinforcement Learning AI Agent Library
A PyTorch library for implementing flow matching algorithms
Official DeiT repository
Hackable and optimized Transformers building blocks
Memory-efficient and performant finetuning of Mistral's models
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Diffusion Transformer with Fine-Grained Chinese Understanding
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Unified Multimodal Understanding and Generation Models
DeepMind model for tracking arbitrary points across videos & robotics
Global weather forecasting model using graph neural networks and JAX
Tooling for the Common Objects In 3D dataset
code for Mesh R-CNN, ICCV 2019
Uncommon Objects in 3D dataset