Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Openai style api for open large language models
Libraries for applying sparsification recipes to neural networks
Neural Network Compression Framework for enhanced OpenVINO
Efficient few-shot learning with Sentence Transformers
A Unified Library for Parameter-Efficient Learning
Bring the notion of Model-as-a-Service to life
Data manipulation and transformation for audio signal processing
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
An easy-to-use LLMs quantization package with user-friendly apis
Framework that is dedicated to making neural data processing
Database system for building simpler and faster AI-powered application
Framework for Accelerating LLM Generation with Multiple Decoding Heads