Fast inference engine for Transformer models
C++ image processing and machine learning library with using of SIMD
oneAPI Deep Neural Network Library (oneDNN)
BitNet: Scaling 1-bit Transformers for Large Language Models
AIMET is a library that provides advanced quantization and compression
Official inference framework for 1-bit LLMs
Z80-μLM is a 2-bit quantized language model
Oobabooga - The definitive Web UI for local AI, with powerful features
A state-of-the-art open visual language model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
FlashMLA: Efficient Multi-head Latent Attention Kernels
Accessible large language models via k-bit quantization for PyTorch
A scientific machine learning (SciML) wrapper for the FEniCS
A Powerful Native Multimodal Model for Image Generation
Low-code framework for building custom LLMs, neural networks
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
100–200× Acceleration for Video Diffusion Models
PhantomBot is an actively developed open source interactive Twitch bot
Official implementation of Watermark Anything with Localized Messages
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
The leading agent orchestration platform for Claude
OSS standalone ChatGPT client
Open Source Document Management System for Digital Archives
PyTorch library of curated Transformer models and their components
A library for accelerating Transformer models on NVIDIA GPUs