Search Results for "hardware" - Page 20

Sort By:

Showing 2172 open source projects for "hardware"

View related business solutions

Mac Clear Filters & Widen Search

$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
1

llm.c

LLM training in simple, raw C/CUDA

...Its compact design makes it easy to trace execution, profile hotspots, and understand the cost of each operation. Portability is a goal: it aims to compile with common toolchains and run on modest hardware for small experiments. Rather than delivering a production-grade stack, it serves as a reference and learning scaffold for people who want to “see the metal” behind LLMs.

Downloads: 0 This Week

Last Update: 2025-10-15
See Project
2

MobileCLIP

Implementation of "MobileCLIP" CVPR 2024

...The repo provides training, inference, and evaluation code for MobileCLIP models trained on DataCompDR, and for newer MobileCLIP2 models trained on DFNDR. It includes an iOS demo app and Core ML artifacts to showcase practical, offline photo search and classification on iPhone-class hardware. Project notes highlight latency/accuracy trade-offs, with MobileCLIP2 variants matching or surpassing larger baselines at notably lower parameter counts and runtime on mobile devices. A companion “mobileclip-dr” repository details large-scale, distributed data-generation pipelines used to reinforce datasets across billions of samples on thousands of GPUs. ...

Downloads: 0 This Week

Last Update: 2026-04-15
See Project
3

EPLB

Expert Parallelism Load Balancer

EPLB is DeepSeek’s open implementation of a load balancing algorithm designed for expert parallelism (EP) settings in MoE architectures. In EP, different “experts” are mapped to different GPUs or nodes, so load imbalance becomes a performance bottleneck if certain experts are invoked much more often. EPLB solves this by duplicating heavily used experts (redundancy) and then placing those duplicates across GPUs to even out computational load. It uses policies like hierarchical load balancing...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
4

OpenAI Realtime Embedded

Instructions on how to use the Realtime API on Microcontrollers

openai-realtime-embedded is a repository that provides resources, SDKs, and example links for using OpenAI’s Realtime API on embedded hardware platforms (e.g. microcontrollers). The goal is to enable low-latency conversational agents (e.g. voice-based assistants) running directly on constrained devices, by leveraging WebRTC and streaming APIs to communicate with OpenAI systems. The repo includes pointers to an ESP32 implementation (maintained as esp32 branch) and documentation that Espressif offers an official example in openai_demo. ...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
5

Selkies-GStreamer

Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop

selkies-gstreamer is a GStreamer-based media streaming component used in the Selkies project, a cloud-native platform designed for interactive desktop and application streaming. This module acts as a high-performance media pipeline that captures video, encodes it with low latency, and streams it via WebRTC to client browsers. It is optimized for GPU-accelerated encoding and integrates with Kubernetes-based deployments to enable scalable, real-time remote desktop sessions. This component...

Downloads: 0 This Week

Last Update: 2025-03-27
See Project
6

ImplicitGlobalGrid.jl

Distributed parallelization of stencil-based GPU and CPU applications

...It renders the distributed parallelization of stencil-based GPU and CPU applications on a regular staggered grid almost trivial and enables close to ideal weak scaling of real-world applications on thousands of GPUs [1, 2, 3]. ImplicitGlobalGrid relies on the Julia MPI wrapper (MPI.jl) to perform halo updates close to hardware limit and leverages CUDA-aware or ROCm-aware MPI for GPU-applications. The communication can straightforwardly be hidden behind computation [1, 3] (how this can be done automatically when using ParallelStencil.jl is shown in; a general approach particularly suited for CUDA C applications is explained in.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
7

react-use

Component for React

...Re-renders component, while tweening a number from 0 to 1. Tracks long press gesture of some element. Tracks state of a CSS media query. Tracks state of connected hardware devices. Returns a callback, which re-renders component when called. Tracks state of device's motion sensor. Tracks state of mouse position. Tracks deltaY of scrolled mouse wheel. Provides way to read, update and delete a cookie. Tracks the state of browser's network connection. Query permission status for browser APIs.

Downloads: 0 This Week

Last Update: 2026-06-11
See Project
8

Phusion Passenger

A fast and robust web server and application server for Ruby

...Passenger was built with high performance and efficiency in mind. With Passenger’s zero-copy architecture, turbocaching and support for multiple concurrency models, you can count on Passenger to get the most out of your hardware. Passenger auto-manages your apps’ processes, restarting them and themselves if they ever crash. Passenger also keeps resource-hogging apps in check, giving your web app maximum uptime.

Downloads: 1 This Week

Last Update: 2026-06-27
See Project
9

Gogs

A painless self-hosted Git service

...All you have to do is run the binary on any platform that Go supports: Linux, macOS and Windows. You may also install from source, from packages, or ship with Docker or Vagrant. Gogs is very lightweight with minimal hardware requirements, running on Raspberry Pi and even on NAS devices. Gogs offers plenty of great features, including various access repositories, repository and organization webhooks, repository Git hooks, repository management and so much more. It also offers software, service and product support for various areas such as project management (Kanboard, Taiga), DevOps (Fabric8) and team communication (BearyChat).

Downloads: 1 This Week

Last Update: 2026-06-07
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles

...For those who only need synthesis, the project is published as a Python library (pip install style-bert-vits2) and can run on CPU without an NVIDIA GPU, though training still requires GPU hardware.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
11

TensorFlow Quantum

Open-source Python framework for hybrid quantum-classical ml learning

...The library also supports high-performance simulation of quantum circuits, enabling researchers to test and evaluate quantum models even without direct access to quantum hardware.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
12

LEANN

Local RAG engine for private multimodal knowledge search on devices

...It focuses on dramatically reducing the storage overhead typically required for vector search and embedding indexes, enabling efficient large-scale knowledge retrieval on consumer hardware. LEANN introduces a storage-efficient approximate nearest neighbor index combined with on-the-fly embedding recomputation to avoid storing large embedding vectors. By recomputing embeddings during queries and using compact graph-based indexing structures, LEANN can maintain high search accuracy while minimizing disk usage. It aims to act as a unified personal knowledge layer that connects different types of data such as documents, code, images, and other local files into a searchable context for language models.

Downloads: 0 This Week

Last Update: 2026-03-13
See Project
13

UCCL

UCCL is an efficient communication library for GPUs

...It supports a variety of communication patterns including collective operations such as all-reduce as well as peer-to-peer transfers that are commonly used in modern machine learning architectures. UCCL is designed to work with heterogeneous hardware environments, allowing GPUs from different vendors and network interfaces to communicate efficiently without vendor lock-in. The system also supports specialized workloads such as reinforcement learning weight transfers, key-value cache sharing, and expert parallelism for mixture-of-experts models. Its architecture emphasizes flexibility and extensibility so that developers can implement custom communication protocols tailored to specific machine learning workloads.

Downloads: 0 This Week

Last Update: 2026-05-10
See Project
14

GPU Hot

Real-time NVIDIA GPU dashboard

GPU Hot is an open-source, lightweight monitoring dashboard designed to provide real-time visibility into NVIDIA GPU performance across single machines or entire clusters. The project offers a self-hosted web interface that streams hardware metrics directly from GPU servers, enabling developers, ML engineers, and system administrators to observe GPU utilization and system behavior in real time through a browser. The dashboard collects and displays a wide range of performance metrics including temperature, memory usage, power consumption, clock speeds, fan speed, and active processes. ...

Downloads: 0 This Week

Last Update: 2026-05-28
See Project
15

MaxText

A simple, performant and scalable Jax LLM

...The project acts as both a reference implementation and a practical training library that demonstrates best practices for building and scaling transformer-based language models on modern accelerator hardware. It is optimized to run efficiently on Google Cloud TPUs and GPUs, enabling researchers and engineers to train models ranging from small experiments to extremely large distributed workloads. The framework focuses on simplicity while still supporting advanced techniques such as model sharding, distributed computation, and high-throughput training pipelines. ...

Downloads: 0 This Week

Last Update: 2026-06-12
See Project
16

MatMul-Free LM

Implementation for MatMul-free LM

...Since matrix multiplication is one of the most computationally expensive components of modern language models, the project explores alternative computational strategies that reduce hardware requirements while maintaining comparable performance. The architecture relies on quantization-aware training and lightweight operations to replace conventional dense matrix multiplications with more efficient alternatives. These optimizations can significantly reduce memory consumption and potentially improve computational efficiency during both training and inference. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
17

Chat with LLMs Everywhere

Run PyTorch LLMs locally on servers, desktop and mobile

...TorchChat supports running models through Python interfaces as well as integrating them directly into native applications written in languages such as C or C++. The project also demonstrates how modern LLMs like LLaMA-style models can be deployed locally while maintaining good performance across different hardware platforms.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
18

CUDA Agent

Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

CUDA Agent is a research-driven agentic reinforcement learning system designed to automatically generate and optimize high-performance CUDA kernels for GPU workloads. The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback. Its architecture combines large-scale data synthesis, a skill-augmented CUDA development environment, and long-horizon reinforcement learning to build intrinsic optimization capability rather than relying on simple post-hoc tuning. ...

Downloads: 0 This Week

Last Update: 2026-03-03
See Project
19

MongoDB Rust Driver

The official MongoDB Rust Driver

...Because it’s asynchronous by design, it works well with Rust async runtimes like Tokio and async-std, letting developers build highly concurrent networked services that efficiently use modern multicore hardware. The crate also includes BSON encoding and decoding that maps cleanly to Rust types, so developers can work with rich document structures while retaining Rust’s performance guarantees.

Downloads: 0 This Week

Last Update: 2026-05-21
See Project
20

Granite 3.0 Language Models

New set of lightweight state-of-the-art, open foundation models

This repository introduces Granite 3.0 language models as lightweight, state-of-the-art open foundation models built to natively support multilinguality, coding, reasoning, and tool usage. A central goal is efficient deployment, including the potential to run on constrained compute resources while remaining useful for a broad span of enterprise tasks. The repo positions the models for both research and commercial use under an Apache-2.0 license, signaling permissive adoption paths....

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
21

DreamCraft3D

Official implementation of DreamCraft3D

...The repository includes model code, inference scripts, sample prompts, and possibly dataset preparation pipelines. It may integrate rendering or post-processing modules (e.g. mesh smoothing, texturing) to make the outputs more output-ready. Because 3D generation is hardware‐intensive, the repository likely also includes optimizations like quantization, pruning, or inference accelerations (e.g. using FlashMLA or DeepEP) to make the generation pipeline faster or more efficient. DreamCraft3D may also support style or attribute control (e.g. “make this object metallic,” “add textures”) via prompt conditioning or guides.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
22

Perf Book

The book "Performance Analysis and Tuning on Modern CPU"

...It explains how caches, TLBs, prefetchers, branch predictors, and out-of-order execution influence real program speed, then connects those concepts to concrete optimization strategies. Readers learn how to design trustworthy benchmarks, avoid measurement traps (warmup, turbo, frequency scaling), and interpret hardware performance counters. The book walks through vectorization, memory layout, data-oriented design, and algorithmic choices, illustrating when compiler flags, intrinsics, or hand-rolled assembly make sense. It also demonstrates tool-driven workflows—using profilers and PMU events—to locate true bottlenecks and validate that changes actually help. ...

Downloads: 0 This Week

Last Update: 2025-09-23
See Project
23

Coolify

An open-source & self-hostable Heroku / Netlify / Vercel alternative

Coolify is an open-source & self-hostable alternative to Heroku / Netlify / Vercel / etc. It helps you manage your servers, applications, and databases on your own hardware; you only need an SSH connection. You can manage VPS, Bare Metal, Raspberry PIs, and anything else. Imagine having the ease of a cloud but with your own servers. That is Coolify. No vendor lock-in, which means that all the configurations for your applications/databases/etc are saved to your server. So, if you decide to stop using Coolify, you could still manage your running resources. ...

Downloads: 0 This Week

Last Update: 2026-06-04
See Project
24

CUDA.jl

CUDA programming in Julia

...JuliaGPU is a GitHub organization created to unify the many packages for programming GPUs in Julia. With its high-level syntax and flexible compiler, Julia is well-positioned to productively program hardware accelerators like GPUs without sacrificing performance. The latest development version of CUDA.jl requires Julia 1.8 or higher. If you are using an older version of Julia, you need to use a previous version of CUDA.jl. This will happen automatically when you install the package using Julia's package manager.

Downloads: 0 This Week

Last Update: 2026-06-11
See Project
25

Pipy

Pipy is a programmable proxy for the cloud, edge and IoT

...Pipy comes with a built-in JavaScript engine that allows easy custom logic implementation with the simple JS syntax, greatly reducing the complexity in high-performance network programming. It is suitable for a variety of hardware architectures including x86, ARM64, Loongson and RISC-V. It is also compatible with various other operating systems besides Linux.

Downloads: 0 This Week

Last Update: 2025-09-18
See Project