Unifying 3D Mesh Generation with Language Models
Neural Network architecture based on ideas of the original LSTM
CV, NLP, LLM project applications, and advanced engineering deployment
An Open-source Framework for Data-centric Language Agents
The first AI agent that builds permissionless integrations
MobileLLM Optimizing Sub-billion Parameter Language Models
User toolkit for analyzing and interfacing with Large Language Models
Implementation for MatMul-free LM
Accessible large language models via k-bit quantization for PyTorch
Implementation of model parallel autoregressive transformers on GPUs
Training and serving large-scale neural networks
An implementation of model parallel GPT-2 and GPT-3-style models