Easy Docker setup for Stable Diffusion with user-friendly UI
Foundation Models for Time Series
Hackable and optimized Transformers building blocks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Repo for SeedVR2 & SeedVR
DeepSeek Coder: Let the Code Write Itself
Inference framework for 1-bit LLMs
LTX-Video Support for ComfyUI
Official implementation of Watermark Anything with Localized Messages
Tool for exploring and debugging transformer model behaviors
State-of-the-art (SoTA) text-to-video pre-trained model
Block Diffusion for Ultra-Fast Speculative Decoding
Revolutionizing Database Interactions with Private LLM Technology
The Clay Foundation Model - An open source AI model and interface
Pretrained time-series foundation model developed by Google Research
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Recovering the Visual Space from Any Views
Hunyuan Translation Model Version 1.5
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
GPT4V-level open-source multi-modal model based on Llama3-8B
Generate Any 3D Scene in Seconds
The official PyTorch implementation of Google's Gemma models
Foundation model for image generation