NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Unified Multimodal Understanding and Generation Models
code for Mesh R-CNN, ICCV 2019
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
An AI-powered security review GitHub Action using Claude
A Powerful Native Multimodal Model for Image Generation
A series of math-specific large language models of our Qwen2 series
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Repo of Qwen2-Audio chat & pretrained large audio language model
High-Fidelity and Controllable Generation of Textured 3D Assets
State-of-the-art (SoTA) text-to-video pre-trained model
OCR expert VLM powered by Hunyuan's native multimodal architecture
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Inference code for scalable emulation of protein equilibrium ensembles
The Clay Foundation Model - An open source AI model and interface
Audio foundation model excelling in audio understanding
PyTorch implementation of JiT
My personal Claude Code configuration
Tiny vision language model
The official PyTorch implementation of Google's Gemma models
Production-tested AI infrastructure tools
Programmatic access to the AlphaGenome model
26m function call model that runs on incredibly small devices
Open Source Speech Language Model