Pushing the Limits of Mathematical Reasoning in Open Language Models
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Research code artifacts for Code World Model (CWM)
Uncommon Objects in 3D dataset
Provides convenient access to the Anthropic REST API from any Python 3
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Inference framework for 1-bit LLMs
Chat & pretrained large vision language model
GLM-4 series: Open Multilingual Multimodal Chat LMs
Inference code for scalable emulation of protein equilibrium ensembles
The Clay Foundation Model - An open source AI model and interface
Release for Improved Denoising Diffusion Probabilistic Models
Implementation of "MobileCLIP" CVPR 2024
Official implementation of Watermark Anything with Localized Messages
Video understanding codebase from FAIR for reproducing video models
Towards Real-World Vision-Language Understanding
Fast and Universal 3D reconstruction model for versatile tasks
Real-time behaviour synthesis with MuJoCo, using Predictive Control
A PyTorch library for implementing flow matching algorithms
Memory-efficient and performant finetuning of Mistral's models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
The ChatGPT Retrieval Plugin lets you easily find personal documents
Diversity-driven optimization and large-model reasoning ability
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning