Sparsity-aware deep learning inference runtime for CPUs
OpenVINO™ Toolkit repository
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Large Language Model Text Generation Inference
LLM.swift is a simple and readable library
Openai style api for open large language models
Libraries for applying sparsification recipes to neural networks
Neural Network Compression Framework for enhanced OpenVINO
Efficient few-shot learning with Sentence Transformers
A Unified Library for Parameter-Efficient Learning
Bolt is a deep learning library with high performance
Build Production-ready Agentic Workflow with Natural Language
Bring the notion of Model-as-a-Service to life
Data manipulation and transformation for audio signal processing
Ready-to-use OCR with 80+ supported languages
The free, Open Source alternative to OpenAI, Claude and others
Library for OCR-related tasks powered by Deep Learning
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
OpenAI swift async text to image for SwiftUI app using OpenAI
An easy-to-use LLMs quantization package with user-friendly apis
A real time inference engine for temporal logical specifications
Self-contained Machine Learning and Natural Language Processing lib
Framework that is dedicated to making neural data processing
Database system for building simpler and faster AI-powered application
Framework for Accelerating LLM Generation with Multiple Decoding Heads