Diversity-driven optimization and large-model reasoning ability
Collection of CVPR 2025 papers and open source projects
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Laravel-focused MCP server for augmenting AI powered local development
A simple, open format for guiding coding agents
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Curated list of datasets and tools for post-training
The official Meta Llama 3 GitHub site
Fundamentals of Machine Learning and Deep Learning
Code release for Cut and Learn for Unsupervised Object Detection
CLIP, Predict the most relevant text snippet given an image
Implementation of Vision Transformer, a simple way to achieve SOTA
Generate 3D objects conditioned on text or images
A Powerful Native Multimodal Model for Image Generation
New set of lightweight state-of-the-art, open foundation models
A collection of various deep learning architectures, models, and tips
A MCP for Claude Desktop / Claude Code / Windsurf / Cursor
Open source full-stack AI webapp generator
4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
PyTorch code and models for V-JEPA self-supervised learning from video
PyTorch code and models for the DINOv2 self-supervised learning
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
The official PyTorch implementation of Google's Gemma models
tiktoken is a fast BPE tokeniser for use with OpenAI's models