Official inference framework for 1-bit LLMs
Accurate × Fast × Comprehensive
An efficient forwarding service designed for LLMs
Multi-agent autonomous startup system for Claude Code
High-performance Inference and Deployment Toolkit for LLMs and VLMs
slime is an LLM post-training framework for RL Scaling
The best ChatGPT that $100 can buy
FAIR Sequence Modeling Toolkit 2
An implementation of a deep learning recommendation model (DLRM)
Hackable and optimized Transformers building blocks
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Multi-Agent daTa geneRation Infra and eXperimentation framework
Tensor search for humans
Large-language-model & vision-language-model based on Linear Attention
Official DeiT repository
A fast, powerful, and simple hierarchical vision transformer
A collection of reference Jupyter notebooks and demo AI/ML application
Serving multiple LoRA finetuned LLM as one
Building Mixture-of-Experts from LLaMA with Continual Pre-training
Framework for Accelerating LLM Generation with Multiple Decoding Heads
This repository contains the official implementation of research
Fast Forward Computer Vision (and other ML workloads!)
Feature selection and deep learning modeling for omic biomarker study
Reference implementation of the Transformer architecture optimized
Generate embeddings from large-scale graph-structured data