hardware free download

ONNX Runtime

ONNX Runtime: cross-platform, high performance ML inferencing

...Support for a variety of frameworks, operating systems and hardware platforms. Built-in optimizations that deliver up to 17X faster inferencing and up to 1.4X faster training.

Downloads: 29 This Week

Last Update: 2026-06-22

See Project

hls4ml is an open-source framework that enables machine learning models to be implemented directly on hardware such as FPGAs and ASICs using high-level synthesis techniques. The system converts trained neural network models from common machine learning frameworks into hardware description code suitable for ultra-low-latency inference. This approach allows machine learning algorithms to run directly on specialized hardware, making them suitable for applications that require extremely fast response times and minimal power consumption. ...

Downloads: 0 This Week

Last Update: 2026-03-20

See Project

dm_control

DeepMind's software stack for physics-based simulation

...DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo physics. The MuJoCo Python bindings support three different OpenGL rendering backends: EGL (headless, hardware-accelerated), GLFW (windowed, hardware-accelerated), and OSMesa (purely software-based). At least one of these three backends must be available in order render through dm_control. Hardware rendering with a windowing system is supported via GLFW and GLEW. On Linux these can be installed using your distribution's package manager. ...

Downloads: 3 This Week

Last Update: 2026-06-22

See Project

LiteRT-LM

LiteRT-LM is Google's production-ready inference framework

LiteRT-LM is Google’s open-source inference framework for deploying large language models on edge devices. It is built for production-oriented local LLM execution across Android, iOS, desktop, web, embedded, and IoT environments. The framework focuses on performance, hardware acceleration, and efficient model serving close to the user instead of relying only on remote cloud inference. It supports CPU execution across major platforms and adds GPU or NPU acceleration where available. LiteRT-LM is especially relevant for developers building private, low-latency AI features on phones, laptops, Raspberry Pi-style devices, and other edge hardware. ...

Downloads: 4 This Week

Last Update: 2 days ago

See Project

tvm

Open deep learning compiler stack for cpu, gpu, etc.

Apache TVM is an open source machine learning compiler framework for CPUs, GPUs, and machine learning accelerators. It aims to enable machine learning engineers to optimize and run computations efficiently on any hardware backend. The vision of the Apache TVM Project is to host a diverse community of experts and practitioners in machine learning, compilers, and systems architecture to build an accessible, extensible, and automated open-source framework that optimizes current and emerging machine learning models for any hardware platform. Compilation of deep learning models in Keras, MXNet, PyTorch, Tensorflow, CoreML, DarkNet and more. ...

Downloads: 0 This Week

Last Update: 2026-06-19

See Project

ggml

Tensor library for machine learning

...Written primarily in C and C++, the library provides low-level tensor operations and automatic differentiation that allow developers to implement machine learning algorithms and neural networks efficiently. The project emphasizes portability and performance, enabling machine learning inference across a wide range of hardware environments including CPUs and specialized accelerators. It is widely used as a foundational component in projects that run large language models locally, including tools that perform inference for transformer-based models. The library also implements optimization algorithms and computation graph functionality so developers can build training and inference workflows directly on top of its tensor operations.

Downloads: 2 This Week

Last Update: 2026-06-26

See Project

DeepVariant

DeepVariant is an analysis pipeline that uses a deep neural networks

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data. DeepVariant is a deep learning-based variant caller that takes aligned reads (in BAM or CRAM format), produces pileup image tensors from them, classifies each tensor using a convolutional neural network, and finally reports the results in a standard VCF or gVCF file. DeepTrio is a deep learning-based trio variant caller built on top of DeepVariant. DeepTrio...

Downloads: 3 This Week

Last Update: 2026-03-05

See Project

FISSURE

The RF and reverse engineering framework for everyone

...The platform supports workflows related to signal discovery, demodulation, packet inspection, fuzzing, and attack simulation, making it useful for both defensive research and controlled lab testing. Its architecture is oriented toward extensibility, so users can integrate additional hardware, signal-processing components, and protocol-specific modules depending on their needs.

Downloads: 2 This Week

Last Update: 2026-03-12

See Project

Model Zoo

Please do not feed the models

...The examples serve both as educational tools for learning Flux and as practical starting points for building new models. GPU acceleration is supported for most models through CUDA integration, enabling efficient training on compatible hardware. With community contributions encouraged, the Model Zoo acts as a hub for sharing and exploring diverse machine learning applications in Julia.

Downloads: 15 This Week

Last Update: 4 days ago

See Project

HeavyDB

HeavyDB (formerly MapD/OmniSciDB)

...The database compiles queries into optimized machine code that executes efficiently on GPU hardware, significantly accelerating analytical workloads. It supports hybrid deployment environments where queries can run on both CPU and GPU architectures depending on the available resources.

Downloads: 1 This Week

Last Update: 2026-03-11

See Project

Humanoid-Gym

Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real

Humanoid-Gym is a reinforcement learning framework designed to train locomotion and control policies for humanoid robots using high-performance simulation environments. The system is built on top of NVIDIA Isaac Gym, which allows large-scale parallel simulation of robotic environments directly on GPU hardware. Its primary goal is to enable efficient training of humanoid robots in simulation while enabling policies to transfer effectively to real-world hardware without additional training. The framework emphasizes the concept of zero-shot sim-to-real transfer, meaning that behaviors learned in simulation can be deployed directly on physical robots with minimal adjustment. ...

Downloads: 0 This Week

Last Update: 2026-03-15

See Project

GPU Puzzles

Solve puzzles. Learn CUDA

...Instead of presenting traditional lecture-style explanations, the project immerses learners directly in hands-on programming tasks that demonstrate how GPU computation works. The exercises are implemented using Python with the Numba CUDA interface, which allows Python code to compile into GPU kernels that run on CUDA-enabled hardware. By solving progressively more complex puzzles, learners gain a practical understanding of how parallel algorithms operate on graphics processing units. The project emphasizes experimentation and problem solving, encouraging learners to discover GPU programming techniques through trial and exploration. It can be run in cloud environments such as Google Colab, making it easy for beginners to start experimenting without configuring local GPU hardware.

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

ANE Training

Training neural networks on Apple Neural Engine via APIs

...It is primarily intended as a research and educational proof of concept rather than a production library, highlighting what is technically possible with undocumented hardware access.

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

Machine Learning Systems

Introduction to Machine Learning Systems

Machine Learning Systems is an open educational repository that serves as the source and learning stack for the Machine Learning Systems textbook, a project focused on teaching how to engineer AI systems that work reliably in real-world environments. Rather than concentrating only on model training, the material emphasizes the broader discipline of AI engineering, covering efficiency, reliability, deployment, and evaluation across the full lifecycle of intelligent systems. The repository...

Downloads: 11 This Week

Last Update: 2026-06-24

See Project

OpenVINO Notebooks

Jupyter notebook tutorials for OpenVINO

...The repository provides practical tutorials that guide developers through various AI workflows including computer vision, natural language processing, and generative AI tasks. Each notebook demonstrates how to run pre-trained models, optimize inference performance, and deploy models across hardware such as CPUs, GPUs, and specialized accelerators. The tutorials also illustrate how OpenVINO integrates with models from frameworks like PyTorch, TensorFlow, and ONNX to accelerate inference workloads. Many notebooks include end-to-end examples that show how to prepare input data, load optimized models, run inference, and visualize results. ...

Downloads: 0 This Week

Last Update: 2026-06-23

See Project

FlexLLMGen

Running large language models on a single GPU

FlexLLMGen is an open-source inference engine designed to run large language models efficiently on limited hardware resources such as a single GPU. The system focuses on high-throughput generation workloads where large batches of text must be processed quickly, such as large-scale data extraction or document analysis tasks. Instead of requiring expensive multi-GPU systems, the framework uses techniques such as memory offloading, compression, and optimized batching to run large models on commodity hardware.

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

Colossal-AI

Making large AI models cheaper, faster and more accessible

The Transformer architecture has improved the performance of deep learning models in domains such as Computer Vision and Natural Language Processing. Together with better performance come larger model sizes. This imposes challenges to the memory wall of the current accelerator hardware such as GPU. It is never ideal to train large models such as Vision Transformer, BERT, and GPT on a single GPU or a single machine. There is an urgent demand to train models in a distributed environment. However, distributed training, especially model parallelism, often requires domain expertise in computer systems and architecture. ...

Downloads: 0 This Week

Last Update: 2025-05-28

See Project

ONNX

Open standard for machine learning interoperability

...It defines an extensible computation graph model, as well as definitions of built-in operators and standard data types. Currently we focus on the capabilities needed for inferencing (scoring). ONNX is widely supported and can be found in many frameworks, tools, and hardware. Enabling interoperability between different frameworks and streamlining the path from research to production helps increase the speed of innovation in the AI community.

Downloads: 9 This Week

Last Update: 2026-06-15

See Project

Burn

Burn is a new comprehensive dynamic Deep Learning Framework

Burn is a new comprehensive dynamic Deep Learning Framework from Tracel AI built using Rust with extreme flexibility, compute efficiency and portability as its primary goals. Burn emphasizes performance, flexibility, and portability for both training and inference. Developed in Rust, it is designed to empower machine learning engineers and researchers across industry and academia.

Downloads: 1 This Week

Last Update: 2026-05-07

See Project

OpenVINO

OpenVINO™ Toolkit repository

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference. Boost deep learning performance in computer vision, automatic speech recognition, natural language processing and other common tasks. Use models trained with popular frameworks like TensorFlow, PyTorch and more. Reduce resource demands and efficiently deploy on a range of Intel® platforms from edge to cloud. This open-source version includes several components: namely Model Optimizer, OpenVINO™ Runtime,...

Downloads: 15 This Week

Last Update: 2026-06-09

See Project

BitNet

BitNet: Scaling 1-bit Transformers for Large Language Models

...By limiting weight precision while maintaining efficient scaling and normalization strategies, the architecture aims to retain competitive performance while significantly reducing hardware requirements.

Downloads: 1 This Week

Last Update: 2026-03-12

See Project

Intel Extension for PyTorch

A Python package for extending the official PyTorch

Intel® Extension for PyTorch* extends PyTorch* with up-to-date features optimizations for an extra performance boost on Intel hardware. Optimizations take advantage of Intel® Advanced Vector Extensions 512 (Intel® AVX-512) Vector Neural Network Instructions (VNNI) and Intel® Advanced Matrix Extensions (Intel® AMX) on Intel CPUs as well as Intel Xe Matrix Extensions (XMX) AI engines on Intel discrete GPUs. Moreover, Intel® Extension for PyTorch* provides easy GPU acceleration for Intel discrete GPUs through the PyTorch* xpu device.

Downloads: 2 This Week

Last Update: 2025-08-08

See Project

ExecuTorch

On-device AI across mobile, embedded and edge for PyTorch

ExecuTorch is an end-to-end solution for enabling on-device inference capabilities across mobile and edge devices including wearables, embedded devices and microcontrollers. It is part of the PyTorch Edge ecosystem and enables efficient deployment of PyTorch models to edge devices.

Downloads: 1 This Week

Last Update: 2026-05-28

See Project

MLX Engine

LM Studio Apple MLX engine

MLX Engine is the Apple MLX-based inference backend used by LM Studio to run large language models efficiently on Apple Silicon hardware. Built on top of the mlx-lm and mlx-vlm ecosystems, the engine provides a unified architecture capable of supporting both text-only and multimodal models. Its design focuses on high-performance on-device inference, leveraging Apple’s MLX stack to accelerate computation on M-series chips. The project introduces modular VisionAddOn components that allow image embeddings to be integrated seamlessly into language model workflows. ...

Downloads: 2 This Week

Last Update: 2026-06-24

See Project

TensorFlow Lite for Microcontrollers

Infrastructure to enable deployment of ML models

TensorFlow Lite for Microcontrollers is a TensorFlow Lite runtime designed for running machine learning models on tiny embedded devices. It targets microcontrollers, DSPs, and other resource-constrained hardware where memory, compute, and power are limited. The project enables on-device inference without depending on an operating system, standard C or C++ libraries, or dynamic memory allocation. It is useful for applications such as wake-word detection, sensor analysis, gesture recognition, anomaly detection, and small vision or audio models. ...

Downloads: 1 This Week

Last Update: 3 days ago

See Project

Search Results for "hardware"

Showing 61 open source projects for "hardware"

ONNX Runtime

hls4ml

dm_control

LiteRT-LM

tvm

ggml

DeepVariant

FISSURE

Model Zoo

HeavyDB

Humanoid-Gym

GPU Puzzles

ANE Training

Machine Learning Systems

OpenVINO Notebooks

FlexLLMGen

Colossal-AI

ONNX

Burn

OpenVINO

BitNet

Intel Extension for PyTorch

ExecuTorch

MLX Engine

TensorFlow Lite for Microcontrollers

Search Results for "hardware"

Showing 61 open source projects for "hardware"

ONNX Runtime

hls4ml

dm_control

LiteRT-LM

tvm

ggml

DeepVariant

FISSURE

Model Zoo

HeavyDB

Humanoid-Gym

GPU Puzzles

ANE Training

Machine Learning Systems

OpenVINO Notebooks

FlexLLMGen

Colossal-AI

ONNX

Burn

OpenVINO

BitNet

Intel Extension for PyTorch

ExecuTorch

MLX Engine

TensorFlow Lite for Microcontrollers

Related Searches

Related Categories