Mixture-of-Experts Vision-Language Models for Advanced Multimodal
ChatGPT interface with better UI
FAIR Sequence Modeling Toolkit 2
Towards Real-World Vision-Language Understanding
AlphaFold 3 inference pipeline
Models for object and human mesh reconstruction
Easy Docker setup for Stable Diffusion with user-friendly UI
Provides convenient access to the Anthropic REST API from any Python 3
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Fast-stable-diffusion + DreamBooth
An AI-powered security review GitHub Action using Claude
The official PyTorch implementation of Google's Gemma models
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
AI-powered tool to quickly remove watermarks from images flawlessly
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)