compression free download

AIMET

AIMET is a library that provides advanced quantization and compression

...QuIC has a mission to help migrate the ecosystem toward fixed-point inference. With this goal, QuIC presents the AI Model Efficiency Toolkit (AIMET) - a library that provides advanced quantization and compression techniques for trained neural network models. AIMET enables neural networks to run more efficiently on fixed-point AI hardware accelerators. Quantized inference is significantly faster than floating point inference. For example, models that we’ve run on the Qualcomm® Hexagon™ DSP rather than on the Qualcomm® Kryo™ CPU have resulted in a 5x to 15x speedup. ...

Downloads: 5 This Week

Last Update: 1 day ago

See Project

FlexLLMGen

Running large language models on a single GPU

...The system focuses on high-throughput generation workloads where large batches of text must be processed quickly, such as large-scale data extraction or document analysis tasks. Instead of requiring expensive multi-GPU systems, the framework uses techniques such as memory offloading, compression, and optimized batching to run large models on commodity hardware. The architecture distributes computation and memory usage across the GPU, CPU, and disk in order to maximize the number of tokens processed during inference. This design allows organizations to deploy powerful language models for high-volume tasks without the infrastructure costs typically associated with large-scale AI systems. ...

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

CLIP-as-service

Embed images and sentences into fixed-length vectors

...No learning curve, minimalist design on client and server. Intuitive and consistent API for image and sentence embedding. Async client support. Easily switch between gRPC, HTTP, WebSocket protocols with TLS and compression. Smooth integration with neural search ecosystem including Jina and DocArray. Build cross-modal and multi-modal solutions in no time.

Downloads: 0 This Week

Last Update: 2023-12-20

See Project

Neural Network Intelligence

AutoML toolkit for automate machine learning lifecycle

Neural Network Intelligence is an open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning. NNI (Neural Network Intelligence) is a lightweight but powerful toolkit to help users automate feature engineering, neural architecture search, hyperparameter tuning and model compression. The tool manages automated machine learning (AutoML) experiments, dispatches and runs experiments' trial jobs generated by tuning algorithms to search the best neural architecture and/or hyper-parameters in different training environments like Local Machine, Remote Servers, OpenPAI, Kubeflow, FrameworkController on K8S (AKS etc.) ...

Downloads: 2 This Week

Last Update: 2023-09-13

See Project

Minkowski Engine

Auto-diff neural network library for high-dimensional sparse tensors

...To run the examples, please install the package and run the command in the package root directory. Compressing a neural network to speed up inference and minimize memory footprint has been studied widely. One of the popular techniques for model compression is pruning the weights in convnets, is also known as sparse convolutional networks. Such parameter-space sparsity used for model compression compresses networks that operate on dense tensors and all intermediate activations of these networks are also dense tensors.

Downloads: 0 This Week

Last Update: 2022-08-11

See Project

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD

This repository provides pre-trained encoder-decoder models and its related optimization techniques developed by Alibaba's MinD (Machine IntelligeNce of Damo) Lab. Pre-trained models for natural language understanding (NLU). We extend BERT to a new model, StructBERT, by incorporating language structures into pre-training. Specifically, we pre-train StructBERT with two auxiliary tasks to make the most of the sequential order of words and sentences, which leverage language structures at the...

Downloads: 0 This Week

Last Update: 2022-08-17

See Project

Search Results for "compression"

Showing 6 open source projects for "compression"

AIMET

FlexLLMGen

CLIP-as-service

Neural Network Intelligence

Minkowski Engine

AliceMind

Search Results for "compression"

Showing 6 open source projects for "compression"

AIMET

FlexLLMGen

CLIP-as-service

Neural Network Intelligence

Minkowski Engine

AliceMind

Related Searches

Related Categories