4M: Massively Multimodal Masked Modeling
A Customizable Image-to-Video Model based on HunyuanVideo
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code
Provides CTP stock options and Zhongtai Securities XTP
RGBD video generation model conditioned on camera input
Offical Implementation for "Recursive Multi-Agent Systems"
AI Agent Source Code Deep Research Report
Fast State-of-the-Art Static Embeddings
Easily compute clip embeddings and build a clip retrieval system
A comprehensive quantitative trading system with AI-powered analysis
Sandbox for training deep learning networks
All course materials for the Zero to Mastery Machine Learning
Curated list of data science interview questions and answers
A curated list of applied machine learning and data science notebooks
Bridging LLM and Recommender System
Open-source evaluation toolkit of large multi-modality models (LMMs)
Ultimate meta-skill for generating best-in-class Claude Code skills
Block Diffusion for Ultra-Fast Speculative Decoding
Video understanding codebase from FAIR for reproducing video models
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Unifying 3D Mesh Generation with Language Models
I Agent designed to interact with ROS1- and ROS2-based robotics system
PyTorch code and models for VJEPA2 self-supervised learning from video
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Achieving 3+ generation speedup on reasoning tasks