Proxy that exposes Antigravity provided claude / gemini models
GLM-4 series: Open Multilingual Multimodal Chat LMs
Industrial-level controllable zero-shot text-to-speech system
Code for running inference with the SAM 3D Body Model 3DB
Renderer for the harmony response format to be used with gpt-oss
My personal Claude Code configuration
AlphaFold 3 inference pipeline
gpt-oss-120b and gpt-oss-20b are two open-weight language models
PyTorch code and models for the DINOv2 self-supervised learning
New family of code large language models (LLMs)
Tooling for the Common Objects In 3D dataset
Tongyi Deep Research, the Leading Open-source Deep Research Agent
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Programmatic access to the AlphaGenome model
Block Diffusion for Ultra-Fast Speculative Decoding
Flux 2 image generation model pure C inference
Python SDK for Claude Agent
Tool for exploring and debugging transformer model behaviors
A Unified Framework for Text-to-3D and Image-to-3D Generation
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Sharp Monocular Metric Depth in Less Than a Second
GPT4V-level open-source multi-modal model based on Llama3-8B
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Chat & pretrained large vision language model