LiteRT

LiteRT is Google's next-generation on-device machine learning framework and the successor to TensorFlow Lite, designed for high-performance AI and generative AI deployment across edge devices. It provides efficient model conversion, optimization, and runtime execution while leveraging hardware acceleration from CPUs, GPUs, and NPUs. LiteRT supports a wide range of platforms, including Android, iOS, Linux, macOS, Windows, web environments, and IoT devices. The framework simplifies on-device AI development through automated accelerator selection, asynchronous execution, and optimized memory handling. It also includes specialized support for large language models and generative AI workloads through LiteRT-LM and related tooling. With broad hardware compatibility and advanced performance optimizations, LiteRT enables developers to build fast, scalable, and efficient AI applications that run directly on user devices.

Features

Cross-Platform AI Deployment – Supports Android, iOS, Linux, macOS, Windows, web, and IoT environments from a unified framework.
Advanced Hardware Acceleration – Optimizes inference using CPUs, GPUs, and NPUs from leading chipset providers, including Google Tensor, Qualcomm, MediaTek, and Intel.
Compiled Model API – Automates accelerator selection, enables asynchronous execution, and improves I/O buffer management for streamlined development.
Generative AI Optimization – Provides dedicated tools and runtimes for deploying large language models, diffusion models, and other GenAI workloads on-device.
Model Conversion & Quantization – Converts and optimizes PyTorch and other machine learning models for efficient edge deployment and reduced resource usage.
High-Performance Runtime Engine – Delivers low-latency inference with zero-copy buffer interoperability, advanced GPU acceleration, and efficient model execution.

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow LiteRT

LiteRT Web Site

Other Useful Business Software

Ship Agents Faster

Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free

Rate This Project

User Reviews

Be the first to post a review of LiteRT!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

C++

Related Categories

C++ Artificial Intelligence Software

Registered

2025-06-04

Similar Business Software

LiteRT

LiteRT (Lite Runtime), formerly known as TensorFlow Lite, is Google's high-performance runtime for on-device AI. It enables developers to deploy machine learning models across various platforms and microcontrollers. LiteRT supports models from TensorFlow, PyTorch, and JAX, converting them into...

See Software
Google Compute Engine

Compute Engine is Google's infrastructure as a service (IaaS) platform for organizations to create and run cloud-based virtual machines. Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a...

See Software
Dialpad Support

Dialpad Support is a next-generation Agentic AI contact center platform. An AI-native platform that reasons, resolves, and delivers quality CX at scale. AI agents autonomously handle routine inquiries while freeing human agents to focus on complex, high-value interactions. Built-in connected...

See Software
Gemini Enterprise Agent Platform

Gemini Enterprise Agent Platform is a comprehensive solution from Google Cloud designed to help organizations build, scale, govern, and optimize AI agents. It represents the evolution of Vertex AI, combining advanced model development with new capabilities for agent orchestration and...

See Software
LM-Kit.NET

LM-Kit.NET is a complete local AI runtime for .NET that lets engineering teams ship AI-powered features without cloud dependencies, per-token costs, or data leaving the network. Most .NET AI integrations stop at inference. LM-Kit.NET covers the full range of capabilities production...

See Software
Runpod

Runpod offers a cloud-based platform designed for running AI workloads, focusing on providing scalable, on-demand GPU resources to accelerate machine learning (ML) model training and inference. With its diverse selection of powerful GPUs like the NVIDIA A100, RTX 3090, and H100, Runpod supports...

See Software