INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Inference script for Oasis 500M
Open-weight, large-scale hybrid-attention reasoning model
Chinese and English multimodal conversational language model
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
High-resolution models for human tasks
Access to Anthropic's safety-first language model APIs
CLIP, Predict the most relevant text snippet given an image
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
4M: Massively Multimodal Masked Modeling
FAIR Sequence Modeling Toolkit 2
A Production-ready Reinforcement Learning AI Agent Library
Official DeiT repository
Foundational Models for State-of-the-Art Speech and Text Translation
Analyze computation-communication overlap in V3/R1
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Repo of Qwen2-Audio chat & pretrained large audio language model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
DeepMind model for tracking arbitrary points across videos & robotics
Global weather forecasting model using graph neural networks and JAX
code for Mesh R-CNN, ICCV 2019
Language modeling in a sentence representation space
An AI-powered security review GitHub Action using Claude