Code for running inference with the SAM 3D Body Model 3DB
AI agents running research on single-GPU nanochat training
Helps scientists define testable, modular, self-documenting dataflow
Custom Home Assistant configuration with automations and scripts setup
Generate Any 3D Scene in Seconds
Quick illustration of how one can easily read books together with LLMs
AI agents running research on single-GPU nanochat training
Generate high-definition story short videos with one click using AI
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
An agentic Machine Learning Engineer
Multilingual Document Layout Parsing in a Single Vision-Language Model
Ultimate meta-skill for generating best-in-class Claude Code skills
Play couplet with seq2seq model
Inference script for Oasis 500M
Open source codebase for Scale Agentex
Automatically translates the text of a video based on a subtitle file
A subtitle generator for Japanese Adult Videos.
Code for the paper Language Models are Unsupervised Multitask Learners
The PyTorch-based audio source separation toolkit for researchers
Distributed training framework for TensorFlow, Keras, PyTorch, etc.
A walk along memory lane
Domain Agnostic Prompts for Savvy Professionals
Discord bot and Interface for Stable Diffusion
BCI: Breast Cancer Immunohistochemical Image Generation
Learning to Act by Watching Unlabeled Online Videos