FAIR Sequence Modeling Toolkit 2
An implementation of a deep learning recommendation model (DLRM)
Hackable and optimized Transformers building blocks
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
A GPU-accelerated library containing highly optimized building blocks
QVAC Fabric: cross-platform LLM inference and fine-tuning
Mooncake is the serving platform for Kimi
Multi-Agent daTa geneRation Infra and eXperimentation framework
Tensor search for humans
Simple and distributed Machine Learning
Large-language-model & vision-language-model based on Linear Attention
Official DeiT repository
A fast, powerful, and simple hierarchical vision transformer
A collection of reference Jupyter notebooks and demo AI/ML application
Lightweight inference library for ONNX files, written in C++
Serving multiple LoRA finetuned LLM as one
Building Mixture-of-Experts from LLaMA with Continual Pre-training
Framework for Accelerating LLM Generation with Multiple Decoding Heads
This repository contains the official implementation of research
Fast Forward Computer Vision (and other ML workloads!)
Transformer related optimization, including BERT, GPT
A High Performance Library for Sequence Processing and Generation
Feature selection and deep learning modeling for omic biomarker study
Efficient approximate nearest neighbor search algorithm collections
Fast and user-friendly runtime for transformer inference