Detecting silent model failure. NannyML estimates performance
Implementation of Imagen, Google's Text-to-Image Neural Network
Definitions for AI/ML tasks like dataset creation
Multi-agent autonomous startup system for Claude Code
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
SimpleMem: Efficient Lifelong Memory for LLM Agents
Improve human sleep through scientifically
Collection of reference environments, offline reinforcement learning
LLM training in simple, raw C/CUDA
Less Code, Lower Barrier, Faster Deployment
Code release for Cut and Learn for Unsupervised Object Detection
CLIP, Predict the most relevant text snippet given an image
Technical principles related to large models
An API standard for single-agent reinforcement learning environments
An Easy-to-use, Scalable and High-performance RLHF Framework
A code-first agent framework for seamlessly planning analytics tasks
LLM abstractions that aren't obstructions
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
General proxy performance testing tool based on Clash using Telegram
MiniSom is a minimalistic implementation of the Self Organizing Maps
Standalone, small, language-neutral
TFX is an end-to-end platform for deploying production ML pipelines
Pretrained (Language) Models for Probabilistic Time Series Forecasting
SkyPilot: Run AI and batch jobs on any infra
GPU environment management and cluster orchestration