Fast inference engine for Transformer models
C++ image processing and machine learning library with using of SIMD
oneAPI Deep Neural Network Library (oneDNN)
Official inference framework for 1-bit LLMs
AIMET is a library that provides advanced quantization and compression
BitNet: Scaling 1-bit Transformers for Large Language Models
Z80-μLM is a 2-bit quantized language model
Oobabooga - The definitive Web UI for local AI, with powerful features
Accessible large language models via k-bit quantization for PyTorch
A state-of-the-art open visual language model
A scientific machine learning (SciML) wrapper for the FEniCS
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
FlashMLA: Efficient Multi-head Latent Attention Kernels
A library for accelerating Transformer models on NVIDIA GPUs
PhantomBot is an actively developed open source interactive Twitch bot
PyTorch library of curated Transformer models and their components
Package that makes it trivial to create and evaluate machine learning
A Powerful Native Multimodal Model for Image Generation
High-performance Inference and Deployment Toolkit for LLMs and VLMs
An innovative library for efficient LLM inference
Low-code framework for building custom LLMs, neural networks
Open Source Document Management System for Digital Archives
The leading agent orchestration platform for Claude
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat