OCR expert VLM powered by Hunyuan's native multimodal architecture
State-of-the-art diffusion models for image and audio generation
A lightweight vision library for performing large object detection
Integrate, train and manage any AI models and APIs with your database
Low-latency AI inference engine optimized for mobile devices
Unified KV Cache Compression Methods for Auto-Regressive Models
Accessible large language models via k-bit quantization for PyTorch
Inference script for Oasis 500M
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Fast stable diffusion on CPU and AI PC
Multilingual speech recognition and audio understanding model
PyTorch library of curated Transformer models and their components
A Pythonic framework to simplify AI service building
The repository provides code for running inference with SAM 2
Pruna is a model optimization framework built for developers
Lightweight Python library for adding real-time multi-object tracking
Redundancy-aware KV Cache Compression for Reasoning Models
Offline inference engine for art, real-time voice conversations
Simplifies the local serving of AI models from any source
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Library for OCR-related tasks powered by Deep Learning
A program that can do anything to earn money without human operators
Utilities intended for use with Llama models