NNCF (Neural Network Compression Framework) is an optimization toolkit for deep learning models, designed to apply quantization, pruning, and other techniques to improve inference efficiency.

Features

  • Supports quantization and pruning for model compression
  • Open-source and customizable for AI model optimization
  • Works with TensorFlow, PyTorch, and OpenVINO frameworks
  • Reduces model size while maintaining accuracy
  • Enables mixed precision training for optimized performance
  • Compatible with hardware acceleration (Intel CPUs, GPUs, and VPUs)

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow NNCF

NNCF Web Site

Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of NNCF!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Frameworks, Python Natural Language Processing (NLP) Tool, Python LLM Inference Tool

Registered

2025-01-24