Practice implementing softmax, attention, GPT-2 and more
Powering Amazon custom machine learning chips
GLM-4 series: Open Multilingual Multimodal Chat LMs
Deep learning library
An open-source Chinese font derived from Fontworks' Klee One
Concatenate a directory full of files into a single prompt
An MCP server for interacting with Google Colab
Local RAG engine for private multimodal knowledge search on devices
Collection of Kaggle Solutions and Ideas
An agentless approach to automatically solve software development
The SOTA Open-Source Browser Agent
Follow along with my AI Agents Masterclass videos
A best practices guide for day 2 operations
A JAX-native LLM Post-Training Library
4M: Massively Multimodal Masked Modeling
PyTorch code and models for V-JEPA self-supervised learning from video
Code to accompany "A Method for Animating Children's Drawings"
The Unified Machine Learning Framework
On-device Speech-to-Intent engine powered by deep learning
dj-stripe automatically syncs your Stripe Data to your local database
Interpretable prompting and models for NLP
Making ALL Software Agent-Native
How to optimize some algorithm in cuda
All-in-one AI framework & toolkit for Claude Code & Cursor
Official code for StoryMem: Multi-shot Long Video Storytelling