Official inference framework for 1-bit LLMs
BitNet: Scaling 1-bit Transformers for Large Language Models
Z80-μLM is a 2-bit quantized language model
Fast inference engine for Transformer models
Oobabooga - The definitive Web UI for local AI, with powerful features
AIMET is a library that provides advanced quantization and compression
C++ image processing and machine learning library with using of SIMD
Accessible large language models via k-bit quantization for PyTorch
A scientific machine learning (SciML) wrapper for the FEniCS
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Open Source Document Management System for Digital Archives
PhantomBot is an actively developed open source interactive Twitch bot
100–200× Acceleration for Video Diffusion Models
NeurIPS2025 Spotlight] Quantized Attention
Official implementation of Watermark Anything with Localized Messages
Package that makes it trivial to create and evaluate machine learning
Low-code framework for building custom LLMs, neural networks
High-performance Inference and Deployment Toolkit for LLMs and VLMs
A library for accelerating Transformer models on NVIDIA GPUs
oneAPI Deep Neural Network Library (oneDNN)
A state-of-the-art open visual language model
Capable of understanding text, audio, vision, video
OSS standalone ChatGPT client
TechNews365 OS Admin AI intègre un Assistant Vocal IA 100% local !