Port of OpenAI's Whisper model in C/C++
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
User-friendly AI Interface
Data manipulation and transformation for audio signal processing
ONNX Runtime: cross-platform, high performance ML inferencing
OpenVINO™ Toolkit repository
High-performance neural network inference framework for mobile
The free, Open Source alternative to OpenAI, Claude and others
Everything you need to build state-of-the-art foundation models
Protect and discover secrets using Gitleaks
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Pure C++ implementation of several models for real-time chatting
A high-throughput and memory-efficient inference and serving engine
Open-Source AI Camera. Empower any camera/CCTV
The official Python client for the Huggingface Hub
Standardized Serverless ML Inference Platform on Kubernetes
Replace OpenAI GPT with another LLM in your app
Run serverless GPU workloads with fast cold starts on bare-metal
Training and deploying machine learning models on Amazon SageMaker
Fast inference engine for Transformer models
Easiest and laziest way for building multi-agent LLMs applications
State-of-the-art diffusion models for image and audio generation
A Pythonic framework to simplify AI service building
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference