pixel-arm64 free download

Showing 45 open source projects for "pixel-arm64"

View related business solutions

Artificial Intelligence Windows Clear Filters & Widen Search

Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
1

Final2x

2^x Image Super-Resolution

...The tool is available in English, Chinese, and Japanese, allowing users from different countries to enjoy the benefits of super-resolution. The tool is available for Windows x64/arm64, MacOS x64/arm64, and Linux x64, allowing users to enjoy the benefits of super-resolution regardless of their operating system.

Downloads: 17 This Week

Last Update: 2025-10-05
See Project
2

Sprite Fusion Pixel Snapper

A tool to snap pixels to a perfect grid

Sprite Fusion Pixel Snapper is a utility designed to eliminate sub-pixel rendering issues that often arise in pixel art, UI icons, and 2D sprite graphics when displayed on screens with high DPI or during motion animations. The tool works by adjusting sprite rendering coordinates and texture sampling so that every pixel aligns cleanly to the screen’s pixel grid, avoiding blurring, distortion, or unintended smoothing artifacts.

Downloads: 2 This Week

Last Update: 2026-02-05
See Project
3

pixelmatch

The smallest, simplest JavaScript pixel-level image comparison library

The smallest, simplest and fastest JavaScript pixel-level image comparison library, originally created to compare screenshots in tests. Features accurate anti-aliased pixels detection and perceptual color difference metrics. Inspired by Resemble.js and Blink-diff. Unlike these libraries, pixelmatch is around 150 lines of code, has no dependencies, and works on raw typed arrays of image data, so it's blazing fast and can be used in any environment (Node or browsers).

Downloads: 0 This Week

Last Update: 2025-02-21
See Project
4

JEPA

PyTorch code and models for V-JEPA self-supervised learning from video

...Because the objective is non-autoregressive and operates in embedding space, JEPA tends to be compute-efficient and stable at scale. The approach has become a strong alternative to contrastive or pixel-reconstruction methods for representation learning.

Downloads: 1 This Week

Last Update: 2025-10-07
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
5

Porcupine

On-device wake word detection powered by deep learning

...Arm Cortex-M, STM32, PSoC, Arduino, and i.MX RT. Raspberry Pi, NVIDIA Jetson Nano, and BeagleBone. Android and iOS. Chrome, Safari, Firefox, and Edge. Linux (x86_64), macOS (x86_64, arm64), and Windows (x86_64). Scalable. It can detect multiple always-listening voice commands with no added runtime footprint. Self-service. Developers can train custom wake word models using Picovoice Console. Porcupine is the right product if you need to detect one or a few static (always-listening) voice commands. If you want to create voice experiences similar to Alexa or Google, see the Picovoice platform.

Downloads: 6 This Week

Last Update: 2025-12-11
See Project
6

vJEPA-2

PyTorch code and models for VJEPA2 self-supervised learning from video

...Instead of reconstructing pixels, it predicts the missing high-level embeddings of masked space-time regions using a context encoder and a slowly updated target encoder. This objective encourages the model to learn semantics, motion, and long-range structure without the shortcuts that pixel-level losses can invite. The architecture is designed to scale: spatiotemporal ViT backbones, flexible masking schedules, and efficient sampling let it train on long clips while remaining stable. Trained representations transfer well to downstream tasks such as action recognition, temporal localization, and video retrieval, often with simple linear probes or light fine-tuning. ...

Downloads: 0 This Week

Last Update: 2026-03-23
See Project
7

EasyOCR

Ready-to-use OCR with 80+ supported languages

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. EasyOCR is a python module for extracting text from image. It is a general OCR that can read both natural scene text and dense text in document. We are currently supporting 80+ languages and expanding. Second-generation models: multiple times smaller size, multiple times faster inference, additional characters and comparable accuracy to the first...

Downloads: 21 This Week

Last Update: 2024-09-24
See Project
8

CTranslate2

Fast inference engine for Transformer models

...The model serialization and computation support weights with reduced precision: 16-bit floating points (FP16), 16-bit integers (INT16), and 8-bit integers (INT8). The project supports x86-64 and AArch64/ARM64 processors and integrates multiple backends that are optimized for these platforms: Intel MKL, oneDNN, OpenBLAS, Ruy, and Apple Accelerate.

Downloads: 1 This Week

Last Update: 2026-02-04
See Project
9

AndroidEnv

RL research on Android devices

android_env is a reinforcement learning (RL) environment developed by Google DeepMind that enables agents to interact with Android applications directly as a learning environment. It provides a standardized API for training agents to perform tasks on Android apps, supporting tasks ranging from games to productivity apps, making it suitable for research in real-world RL settings.

Downloads: 1 This Week

Last Update: 2025-03-13
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
10

JiT

PyTorch implementation of JiT

JiT is an open-source PyTorch implementation of a state-of-the-art image diffusion model designed around a minimalist yet powerful architecture for pixel-level generative modeling, based on the paper Back to Basics: Let Denoising Generative Models Denoise. Rather than predicting noise, JiT models directly predict clean image data, which the research suggests aligns better with the manifold structure of natural images and leads to stronger generative performance at high resolution. ...

Downloads: 0 This Week

Last Update: 2026-02-05
See Project
11

Simd Library

C++ image processing and machine learning library with using of SIMD

The Simd Library is a free open-source image processing and machine learning library, designed for C and C++ programmers. It provides many useful high-performance algorithms for image processing such as pixel format conversion, image scaling and filtration, extraction of statistical information from images, motion detection, object detection and classification, neural networks. The algorithms are optimized with using of different SIMD CPU extensions. In particular, the library supports the following CPU extensions: SSE, AVX, AVX-512, and AMX for x86/x64, and NEON for ARM. ...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
12

StarVector

StarVector is a foundation model for SVG generation

...This approach allows StarVector to create scalable graphics that maintain visual quality regardless of resolution, which is especially useful for design tools and illustration workflows. Because the model produces SVG code rather than pixel images, the output can be edited programmatically or integrated directly into web and design environments.

Downloads: 1 This Week

Last Update: 2026-03-05
See Project
13

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion

Grounded-Segment-Anything is a research-oriented project that combines powerful open-set object detection with pixel-level segmentation and subsequent creative workflows, effectively enabling detection, segmentation, and high-level vision tasks guided by free-form text prompts. The core idea behind the project is to pair Grounding DINO — a zero-shot object detector that can locate objects described by natural language — with Segment Anything Model (SAM), which can produce detailed masks for objects once they are localized. ...

Downloads: 0 This Week

Last Update: 2026-02-03
See Project
14

Map-Anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

...Instead of stitching together many task-specific models, it uses a single architecture that supports a wide range of 3D tasks—multi-image structure-from-motion, multi-view stereo, monocular metric depth, registration, depth completion, and more. The model flexibly accepts different input combinations (images, intrinsics, poses, sparse or dense depth) and produces a rich set of outputs including per-pixel 3D points, camera intrinsics, camera poses, ray directions, confidence maps, and validity masks. Its inference path is fully feed-forward with optional mixed-precision and memory-efficient modes, making it practical to scale to long image sequences while keeping latency predictable.

Downloads: 1 This Week

Last Update: 2026-03-23
See Project
15

LISA

LISA: Reasoning Segmentation via Large Language Model

LISA is an open-source multimodal AI system designed to enable language models to perform pixel-level reasoning and segmentation tasks on images. The project introduces a framework where a large language model can interpret natural language instructions and produce segmentation masks that highlight relevant regions in an image. Instead of relying solely on predefined object categories, the model is capable of reasoning about complex textual queries and translating them into visual segmentation outputs. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
16

Color Thief

Grab the color palette from an image using just Javascript

...When run in Node, this argument expects a path to the image. quality is an optional argument that must be an Integer of value 1 or greater, and defaults to 10. The number determines how many pixels are skipped before the next one is sampled. We rarely need to sample every single pixel in the image to get good results. The bigger the number, the faster a value will be returned. Gets a palette from the image by clustering similar colors. The palette is returned as an array containing colors, each color itself an array of three integers.

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
17

Sa2VA

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA

...With minimal instruction tuning (often one-shot), Sa2VA can handle tasks such as “segment the main subject,” “what are the objects in this scene?”, or “track this object through the video,” outputting pixel-perfect masks or spoken/textual answers as appropriate.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
18

PyDenseCRF

Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs

...Conditional Random Fields are probabilistic graphical models used to model contextual relationships between neighboring pixels or features, improving prediction consistency across images. By implementing a fully connected CRF model with Gaussian edge potentials, the library enables efficient inference across all pixel pairs in an image rather than only local neighborhoods. The Python wrapper is implemented using Cython, allowing high-performance CRF computations while maintaining a Python-friendly interface for experimentation and development.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
19

Recurrent Interface Network (RIN)

Implementation of Recurrent Interface Network (RIN)

...The big surprise is that the generations can reach this level of fidelity. Will need to verify this on my own machine. Additionally, we will try adding an extra linear attention on the main branch as well as self-conditioning in the pixel space. The insight of being able to self-condition on any hidden state of the network as well as the newly proposed sigmoid noise schedule are the two main findings.

Downloads: 0 This Week

Last Update: 2024-02-14
See Project
20

AudioMuse-AI

...Using tools such as Librosa and ONNX, it performs sonic analysis on your audio files locally, allowing you to curate playlists for any mood or occasion without relying on external APIs. Deploy it easily on your local machine with Docker Compose or Podman, or scale it in a Kubernetes cluster (supports AMD64 and ARM64). It integrates with the main music servers' APIs such as Jellyfin, Navidrome, LMS, Lyrion, and Emby. More integrations may be added in the future. AudioMuse-AI lets you explore your music library in innovative ways, just start with an initial analysis, and you’ll unlock features like Clustering, Instant Playlist, Music Playlist and many more

Downloads: 2 This Week

Last Update: 2026-02-01
See Project
21

FLUX.1 Krea

Powerful open source image generation model

FLUX.1 Krea [dev] is an open-source 12-billion parameter image generation model developed collaboratively by Krea and Black Forest Labs, designed to deliver superior aesthetic control and high image quality. It is a rectified-flow model distilled from the original Krea 1, providing enhanced sampling efficiency through classifier-free guidance distillation. The model supports generation at resolutions between 1024 and 1280 pixels with recommended inference steps between 28 and 32 for optimal...

1 Review

Downloads: 7 This Week

Last Update: 2025-08-05
See Project
22

Dead Deer 3.14.82.2025

3D modeler, 3D game maker, 3D demo maker

...Support for: Direct3D9 (SM3) Direct3D10 (SM4) Direct3D11 (SM5) Direct3D12 (SM5) OpenGL and GLSL OpenGLES 2/3 Apple METAL Retina, UHD. Intel x86/64, ARMv7/ARM64, RISCV. Linux (Ubuntu/wxWidgets(Gtk3)). iOS /iPasOS (with XCode) (GLES20/METAL) Windows Phone Windows VR (Steam/Oculus) WebAsm/WebGL UWP Windows/XBOX SDL2 Linux ARM 32/64 RISCV OpenXR (Quest?/Pico) 3.14.82.2025

6 Reviews

Downloads: 9 This Week

Last Update: 4 days ago
See Project
23

CoTracker

CoTracker is a model for tracking any point (pixel) on a video

...By reasoning about all tracks together, it can maintain temporal consistency, handle mutual occlusions, and reduce identity swaps when trajectories cross. The model takes sparse point queries on one frame and predicts their sub-pixel locations and a visibility score for every subsequent frame, producing long, coherent trajectories. Its transformer-style architecture aggregates information both along time and across points, allowing it to recover tracks even after brief disappearances. The repository ships with inference scripts, pretrained weights, and simple interfaces to seed points, run tracking, and export trajectories for downstream tasks. ...

Downloads: 0 This Week

Last Update: 2025-10-12
See Project
24

video-subtitle-remover

AI-based tool for removing hardsubs and text-like watermarks

Video-subtitle-remover (VSR) is an AI-based software that removes hardcoded subtitles from videos or Pictures.

Downloads: 52 This Week

Last Update: 2024-01-09
See Project
25

PIFuHD

High-Resolution 3D Human Digitization from A Single Image

PIFuHD (Pixel-Aligned Implicit Function for 3D human reconstruction at high resolution) is a method and codebase to reconstruct high-fidelity 3D human meshes from a single image. It extends prior PIFu work by increasing resolution and detail, enabling fine geometry in cloth folds, hair, and subtle surface features. The method operates by learning an implicit occupancy / surface function conditioned on the image and camera projection; at inference time it queries dense points to reconstruct a mesh via marching cubes. ...

Downloads: 5 This Week

Last Update: 2025-10-06
See Project