A Customizable Image-to-Video Model based on HunyuanVideo
Global weather forecasting model using graph neural networks and JAX
CodeGeeX2: A More Powerful Multilingual Code Generation Model
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Large Multimodal Models for Video Understanding and Editing
Large-language-model & vision-language-model based on Linear Attention
Genome modeling and design across all domains of life
General-purpose image editing model that delivers high-fidelity
Inference script for Oasis 500M
FAIR Sequence Modeling Toolkit 2
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Pokee Deep Research Model Open Source Repo
GPT4V-level open-source multi-modal model based on Llama3-8B
State-of-the-art (SoTA) text-to-video pre-trained model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Chat & pretrained large audio language model proposed by Alibaba Cloud
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Open Multilingual Multimodal Chat LMs
Example Discord bot written in Python that uses the completions API
Official repo for consistency models
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Repo for external large-scale work
A minimal PyTorch re-implementation of the OpenAI GPT
Code release for "Masked-attention Mask Transformer