An experimental version of DeepSeek model
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
RGBD video generation model conditioned on camera input
Long-form streaming TTS system for multi-speaker dialogue generation
A state-of-the-art open visual language model
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Inference script for Oasis 500M
FAIR Sequence Modeling Toolkit 2
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
A minimal PyTorch re-implementation of the OpenAI GPT