Large Multimodal Models for Video Understanding and Editing
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Official implementation of Watermark Anything with Localized Messages
Genome modeling and design across all domains of life
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Pokee Deep Research Model Open Source Repo
FAIR Sequence Modeling Toolkit 2
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Open-source framework for intelligent speech interaction
RGBD video generation model conditioned on camera input
Pushing the Limits of Mathematical Reasoning in Open Language Models
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Example Discord bot written in Python that uses the completions API
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Let us control diffusion models
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A latent text-to-image diffusion model
A collection of high-quality models for the MuJoCo physics engine
Generate embeddings from large-scale graph-structured data
LL model providing reasoning and conversational capabilities