Unifying 3D Mesh Generation with Language Models
Neural Network architecture based on ideas of the original LSTM
CV, NLP, LLM project applications, and advanced engineering deployment
An Open-source Framework for Data-centric Language Agents
Distributed LLM and StableDiffusion inference
The first AI agent that builds permissionless integrations
UCCL is an efficient communication library for GPUs
MobileLLM Optimizing Sub-billion Parameter Language Models
User toolkit for analyzing and interfacing with Large Language Models
Real-time NVIDIA GPU dashboard
A secure low code honeypot framework
Implementation for MatMul-free LM
Accessible large language models via k-bit quantization for PyTorch
High-speed Large Language Model Serving for Local Deployment
Production ready toolkit to run AI locally
Your fully private, open-source, on-device AI assistant
Manages Unified Access to Generative AI Services
An e-book about the real-world application of LLM
Implementation of model parallel autoregressive transformers on GPUs
Training and serving large-scale neural networks
An implementation of model parallel GPT-2 and GPT-3-style models