Global weather forecasting model using graph neural networks and JAX
Tool for exploring and debugging transformer model behaviors
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
RGBD video generation model conditioned on camera input
Project Lyra: Open Generative 3D World Models
Audio foundation model excelling in audio understanding
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Easy Docker setup for Stable Diffusion with user-friendly UI
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
FAIR Sequence Modeling Toolkit 2
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
An AI-powered security review GitHub Action using Claude
Open-Source Financial Large Language Models
PyTorch code and models for the DINOv2 self-supervised learning
Open-source large language model family from Tencent Hunyuan
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Visual Causal Flow
The official PyTorch implementation of Google's Gemma models
An experimental version of DeepSeek model
A series of math-specific large language models of our Qwen2 series
Generating Immersive, Explorable, and Interactive 3D Worlds
Open-source multi-speaker long-form text-to-speech model
General-purpose image editing model that delivers high-fidelity