AirLLM 70B inference with single 4GB GPU
"Big Model" trains a visual multimodal VLM with 26M parameters
LLM training in simple, raw C/CUDA
Tool for exploring and debugging transformer model behaviors
NVR with realtime local object detection for IP cameras
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Autonomous multi-session AI coding
Self-hosted AI accounting app. LLM analyzer for receipts
This repository contains the official implementation of FastVLM
We write your reusable computer vision tools
Skills for Real Engineers. Straight from my .claude directory
Chinese Little Black Weird Text Illustration Generation Skill
Diversity-driven optimization and large-model reasoning ability
Open-source GEO content production system with AI tasks
Sunfish: a Python Chess Engine in 111 lines of code
Reactive library to enable quick and easy development of bots
A light-weight and powerful meta-prompting, context engineering
Qwen2.5-VL is the multimodal large language model series
Personal AI Notebooks. Organize files & webpages and generate notes
The simplest self-building coding agent
Set of comprehensive computer vision & machine intelligence libraries
Python-free Rust inference server
Toolkit for making machine learning and data analysis applications
The official implementation of RAPTOR
Z80-μLM is a 2-bit quantized language model