Neural Network architecture based on ideas of the original LSTM
CV, NLP, LLM project applications, and advanced engineering deployment
An Open-source Framework for Data-centric Language Agents
The first AI agent that builds permissionless integrations
Implementation for MatMul-free LM
User toolkit for analyzing and interfacing with Large Language Models
Accessible large language models via k-bit quantization for PyTorch
MobileLLM Optimizing Sub-billion Parameter Language Models
Training and serving large-scale neural networks