Python inference and LoRA trainer package for the LTX-2 audio–video
Video understanding codebase from FAIR for reproducing video models
High-resolution models for human tasks
Language modeling in a sentence representation space
Contexts Optical Compression
Industrial-level controllable zero-shot text-to-speech system
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Pokee Deep Research Model Open Source Repo
My personal Claude Code configuration
Unified Multimodal Understanding and Generation Models
Block Diffusion for Ultra-Fast Speculative Decoding
ICLR2024 Spotlight: curation/training code, metadata, distribution
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Open-source, high-performance Mixture-of-Experts large language model
Official repo for consistency models
Official PyTorch Implementation of "Scalable Diffusion Models"
Facebook AI Research Sequence-to-Sequence Toolkit
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201