CodeGeeX2: A More Powerful Multilingual Code Generation Model
Release for Improved Denoising Diffusion Probabilistic Models
Qwen2.5-VL is the multimodal large language model series
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Hackable and optimized Transformers building blocks
GLM-4 series: Open Multilingual Multimodal Chat LMs
DeepSeek Coder: Let the Code Write Itself
LTX-Video Support for ComfyUI
Tool for exploring and debugging transformer model behaviors
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A Unified Framework for Text-to-3D and Image-to-3D Generation
Uncommon Objects in 3D dataset
Programmatic access to the AlphaGenome model
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
ChatGPT interface with better UI
Global weather forecasting model using graph neural networks and JAX
GPT4V-level open-source multi-modal model based on Llama3-8B
Pushing the Limits of Mathematical Reasoning in Open Language Models
Collection of Gemma 3 variants that are trained for performance
Official implementation of Watermark Anything with Localized Messages
Video understanding codebase from FAIR for reproducing video models
State-of-the-art (SoTA) text-to-video pre-trained model
Tooling for the Common Objects In 3D dataset
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
An AI-powered security review GitHub Action using Claude