CLIP, Predict the most relevant text snippet given an image
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
OpenDAN is an open source Personal AI OS
Open Source Document Management System for Digital Archives
A game theoretic approach to explain the output of ml models
An Efficient Agentic Model for Computer Use
Phi-3.5 for Mac: Locally-run Vision and Language Models
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Dough is a open source tool for steering AI animations with precision
Defang CLI and sample projects
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling
Documentation for Google's Gen AI site - including Gemini API & Gemma
Talk to ChatGPT via any Matrix client
Real-World Centric Foundation GUI Agents
AI discovers 520000 stable inorganic crystal structures for research
Generating Immersive, Explorable, and Interactive 3D Worlds
Implementation of Video Diffusion Models
Master Claude Code Hooks
Your Fully-Automated Personal AI Assistant
AI-Researcher: Autonomous Scientific Innovation
Tiny vision language model
A very simple framework for state-of-the-art NLP
State-of-the-art Parameter-Efficient Fine-Tuning
Implementation of Imagen, Google's Text-to-Image Neural Network
Detecting silent model failure. NannyML estimates performance