Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
OpenVINO™ Toolkit repository
LLM.swift is a simple and readable library
Neural Network Compression Framework for enhanced OpenVINO
Libraries for applying sparsification recipes to neural networks
Efficient few-shot learning with Sentence Transformers
Openai style api for open large language models
Bolt is a deep learning library with high performance
A Unified Library for Parameter-Efficient Learning
Build Production-ready Agentic Workflow with Natural Language
Data manipulation and transformation for audio signal processing
The free, Open Source alternative to OpenAI, Claude and others
An easy-to-use LLMs quantization package with user-friendly apis
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
OpenAI swift async text to image for SwiftUI app using OpenAI
A real time inference engine for temporal logical specifications
Self-contained Machine Learning and Natural Language Processing lib
Framework that is dedicated to making neural data processing
Database system for building simpler and faster AI-powered application
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Fast and user-friendly runtime for transformer inference