C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
The deep learning toolkit for speech-to-text
LLM training code for MosaicML foundation models
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
State-of-the-art Parameter-Efficient Fine-Tuning
Create HTML profiling reports from pandas DataFrame objects
Single-cell analysis in Python
Replace OpenAI GPT with another LLM in your app
Uncover insights, surface problems, monitor, and fine tune your LLM
An Open-Source Programming Framework for Agentic AI
Fast inference engine for Transformer models
Build Production-ready Agentic Workflow with Natural Language
A Pythonic framework to simplify AI service building
Prem provides a unified environment to develop AI applications
A RWKV management and startup tool, full automation, only 8MB
A library for accelerating Transformer models on NVIDIA GPUs
State-of-the-art diffusion models for image and audio generation
Implementation of model parallel autoregressive transformers on GPUs
CPU/GPU inference server for Hugging Face transformer models
Visual Instruction Tuning: Large Language-and-Vision Assistant
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Operating LLMs in production
A scalable inference server for models optimized with OpenVINO
Superduper: Integrate AI models and machine learning workflows