Programmatic access to the AlphaGenome model
Qwen2.5-VL is the multimodal large language model series
Python bindings for llama.cpp
FAIR Sequence Modeling Toolkit 2
DeepSeek Coder: Let the Code Write Itself
High-Resolution Image Synthesis with Latent Diffusion Models
An experimental version of DeepSeek model
Phi-3.5 for Mac: Locally-run Vision and Language Models
Python SDK for Claude Agent
State-of-the-art (SoTA) text-to-video pre-trained model
The official repo of Qwen chat & pretrained large language model
Easy Docker setup for Stable Diffusion with user-friendly UI
Hackable and optimized Transformers building blocks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Code for running inference with the SAM 3D Body Model 3DB
Official implementation of Watermark Anything with Localized Messages
MOSS‑TTS Family open‑source speech and sound generation model
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Sharp Monocular Metric Depth in Less Than a Second
code for Mesh R-CNN, ICCV 2019
Uncommon Objects in 3D dataset
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Generating Immersive, Explorable, and Interactive 3D Worlds
Fast-stable-diffusion + DreamBooth
Hunyuan Translation Model Version 1.5