llm for java developers free download

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models

Gemma.cpp is a C++ implementation for running inference with Gemma models efficiently on CPUs and GPUs. Developed by Google, it allows running large language models (LLMs) like Gemma with minimal hardware, focusing on optimized performance and low latency. Gemma.cpp is intended for developers seeking to deploy LLMs in production environments without needing massive computational resources.

Downloads: 0 This Week

Last Update: 2025-03-25

See Project

ExecuTorch

On-device AI across mobile, embedded and edge for PyTorch

ExecuTorch is an end-to-end solution for enabling on-device inference capabilities across mobile and edge devices including wearables, embedded devices and microcontrollers. It is part of the PyTorch Edge ecosystem and enables efficient deployment of PyTorch models to edge devices.

Downloads: 2 This Week

Last Update: 2025-11-20

See Project

ONNX

Open standard for machine learning interoperability

ONNX is an open format built to represent machine learning models. ONNX defines a common set of operators - the building blocks of machine learning and deep learning models - and a common file format to enable AI developers to use models with a variety of frameworks, tools, runtimes, and compilers. Open Neural Network Exchange (ONNX) is an open ecosystem that empowers AI developers to choose the right tools as their project evolves. ONNX provides an open source format for AI models, both...

Downloads: 5 This Week

Last Update: 4 days ago

See Project

TensorRT

C++ library for high performance inference on NVIDIA GPUs

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference. It includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for deep learning inference applications. TensorRT-based applications perform up to 40X faster than CPU-only platforms during inference. With TensorRT, you can optimize neural network models trained in all major frameworks, calibrate for lower precision with high accuracy, and deploy to hyperscale data centers,...

Downloads: 26 This Week

Last Update: 2025-11-08

See Project

ncnn

High-performance neural network inference framework for mobile

ncnn is a high-performance neural network inference computing framework designed specifically for mobile platforms. It brings artificial intelligence right at your fingertips with no third-party dependencies, and speeds faster than all other known open source frameworks for mobile phone cpu. ncnn allows developers to easily deploy deep learning algorithm models to the mobile platform and create intelligent APPs. It is cross-platform and supports most commonly used CNN networks, including...

Downloads: 22 This Week

Last Update: 2025-09-16

See Project

DALI

A GPU-accelerated library containing highly optimized building blocks

The NVIDIA Data Loading Library (DALI) is a library for data loading and pre-processing to accelerate deep learning applications. It provides a collection of highly optimized building blocks for loading and processing image, video and audio data. It can be used as a portable drop-in replacement for built-in data loaders and data iterators in popular deep learning frameworks. Deep learning applications require complex, multi-stage data processing pipelines that include loading, decoding,...

Downloads: 0 This Week

Last Update: 2025-10-16

See Project

Su-render

An automatic knowledge inference engine. Given a set of statements it can derive actions and other statements from the the set of assumptions.

Downloads: 0 This Week

Last Update: 2021-12-12

See Project

Search Results for "llm for java developers"

Showing 7 open source projects for "llm for java developers"

gemma.cpp

ExecuTorch

ONNX

TensorRT

ncnn

DALI

Su-render

Search Results for "llm for java developers"

Showing 7 open source projects for "llm for java developers"

gemma.cpp

ExecuTorch

ONNX

TensorRT

ncnn

DALI

Su-render

Related Searches

Related Categories