RL research on Android devices
Large Multimodal Models for Video Understanding and Editing
MCP integration platforms for AI agents to use tools at any scale
4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
PyTorch code and models for the DINOv2 self-supervised learning
PPTAgent: Generating and Evaluating Presentations
Concatenate a directory full of files into a single prompt
Provider-agnostic, open-source evaluation infrastructure
A Powerful Native Multimodal Model for Image Generation
Audiocraft is a library for audio processing and generation
Fundamentals of Machine Learning and Deep Learning
The official PyTorch implementation of Google's Gemma models
PyTorch code and models for VJEPA2 self-supervised learning from video
Code for Language models can explain neurons in language models paper
Set of tools to assess and improve LLM security
Provides CTP stock options and Zhongtai Securities XTP
Examples and guides for using the OpenAI API
Collection of reference environments, offline reinforcement learning
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Utilities intended for use with Llama models
Flexible Photo Recrafting While Preserving Your Identity
Inference script for Oasis 500M
Framework for building neural networks
Implementation of Vision Transformer, a simple way to achieve SOTA