Showing 45 open source projects for "avx"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    Thorium

    Thorium

    High-performance, privacy-focused browser built as a fork of Chromium

    Thorium is a high-performance, privacy-focused web browser built as a fork of Chromium. It is designed to deliver significantly faster browsing speeds through aggressive compiler optimizations such as SSE4.2 and AVX. These low-level enhancements allow Thorium to outperform standard Chrome and Chromium builds in responsiveness and efficiency. The browser removes much of Google’s built-in bloat to create a leaner browsing experience. Thorium also includes additional privacy features that reduce tracking and unnecessary background activity. Bug fixes and compatibility patches improve stability across Linux and other supported platforms. ...
    Downloads: 187 This Week
    Last Update:
    See Project
  • 2
    Mercury Browser

    Mercury Browser

    Privacy-focused web browser fork of Firefox

    Mercury Browser is an optimized, privacy-focused web browser that is a fork of Mozilla Firefox. It incorporates compiler optimizations such as AVX, AES, LTO, and PGO to enhance performance and security. With features derived from projects like LibreWolf, Waterfox, and Ghostery, Mercury disables telemetry and debugging elements by default, ensuring a more private browsing experience. It also includes usability patches that bring back features like the classic top bar and supports unsigned extensions for added flexibility. ...
    Downloads: 74 This Week
    Last Update:
    See Project
  • 3
    SIMD

    SIMD

    C++ wrappers for SIMD intrinsics

    ...However, differences across CPU architectures and compilers make direct usage complex, which xsimd addresses by offering a unified API that maps efficiently to underlying hardware capabilities. The library supports a wide range of instruction sets, including SSE, AVX, NEON, and WebAssembly SIMD, ensuring portability across platforms. It provides vectorized implementations of common mathematical operations, allowing developers to operate on batches of values using familiar syntax. xsimd is widely adopted in performance-critical applications.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    R3nzSkin

    R3nzSkin

    Skin changer for League of Legends (LOL)

    R3nzSkin is an internal skin changer for League of Legends. Change the skin of your champion, your ward, other champions, towers, minions, and jungle monsters in the game. If your CPU supports the AVX / AVX2 / AVX-512 instruction set, you can enable it in project settings. This should result in more performant code, optimized for your CPU. Currently, SSE2 instructions are selected in project settings. Change skins anytime and unlimited times in a single game. Supports all popular languages ​​in the world.
    Downloads: 29 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Simd Library

    Simd Library

    C++ image processing and machine learning library with using of SIMD

    ...It provides many useful high-performance algorithms for image processing such as pixel format conversion, image scaling and filtration, extraction of statistical information from images, motion detection, object detection and classification, neural networks. The algorithms are optimized with using of different SIMD CPU extensions. In particular, the library supports the following CPU extensions: SSE, AVX, AVX-512, and AMX for x86/x64, and NEON for ARM. The Simd Library has C API and also contains useful C++ classes and functions to facilitate access to C API. The library supports dynamic and static linking, 32-bit and 64-bit Windows and Linux, MSVS, G++ and Clang compilers, MSVS projects, and CMake build systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    whisper.cpp

    whisper.cpp

    Port of OpenAI's Whisper model in C/C++

    whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples....
    Downloads: 374 This Week
    Last Update:
    See Project
  • 7
    FEX

    FEX

    A fast usermode x86 and x86-64 emulator for Arm64

    ...FEX presents a Linux 5.0 interface to the guest, and supports both AArch64 and x86-64 as hosts. FEX is very much work in progress, so expect things to change. FEX has been tested to build and run on ARMv8.0, ARMv8.1+, and x86-64(AVX or newer) hardware. ARMv7 and older x86 hardware will not work. Expected operating system usage is Linux. FEX has been tested with Ubuntu 20.04, 20.10, and 21.04. Also Arch Linux. AArch64 hosts require a rootfs for running applications.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 8
    ispc

    ispc

    Intel SPMD Program Compiler

    ...Under the SPMD model, the programmer writes a program that generally appears to be a regular serial program, though the execution model is actually that a number of program instances execute in parallel on the hardware. ispc compiles a C-based SPMD programming language to run on the SIMD units of CPUs and GPUs; it frequently provides a 3x or more speedup on architectures with 4-wide vector SSE units and 5x-6x on architectures with 8-wide AVX vector units, without any of the difficulty of writing intrinsics code. Parallelization across multiple cores is also supported by ispc, making it possible to write programs that achieve performance improvement that scales by both numbers of cores and vector unit size. Build a small set of extensions to the C language that would deliver excellent performance to performance-oriented programmers.
    Downloads: 38 This Week
    Last Update:
    See Project
  • 9
    Intel Extension for PyTorch

    Intel Extension for PyTorch

    A Python package for extending the official PyTorch

    Intel® Extension for PyTorch* extends PyTorch* with up-to-date features optimizations for an extra performance boost on Intel hardware. Optimizations take advantage of Intel® Advanced Vector Extensions 512 (Intel® AVX-512) Vector Neural Network Instructions (VNNI) and Intel® Advanced Matrix Extensions (Intel® AMX) on Intel CPUs as well as Intel Xe Matrix Extensions (XMX) AI engines on Intel discrete GPUs. Moreover, Intel® Extension for PyTorch* provides easy GPU acceleration for Intel discrete GPUs through the PyTorch* xpu device.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Tullio.jl

    Tullio.jl

    Tullio is a very flexible einsum macro

    Tullio is a very flexible einsum macro. It understands many array operations written in index notation -- not just matrix multiplication and permutations, but also convolutions, stencils, scatter/gather, and broadcasting. Used by itself the macro writes ordinary nested loops much like Einsum.@einsum. One difference is that it can parse more expressions, and infer ranges for their indices. Another is that it will use multi-threading (via Threads.@spawn) and recursive tiling, on large enough arrays.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    HLSL++

    HLSL++

    Math library using HLSL syntax with multiplatform SIMD support

    ...It provides vector, matrix, and math operations with a syntax identical or very similar to HLSL, allowing seamless transition between shader code and application code. The library is optimized for performance and supports SIMD instructions across multiple architectures, including SSE, AVX, AVX2, AVX512, and ARM NEON, ensuring high efficiency on modern hardware. It also extends beyond standard HLSL capabilities by introducing additional features such as quaternion support, advanced matrix operations, and extended vector types like float8. The library is particularly valuable for game developers who need consistency between CPU and GPU computations, reducing errors and improving maintainability.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    XNNPACK

    XNNPACK

    High-efficiency floating-point neural network inference operators

    ...Rather than serving as a standalone ML framework, XNNPACK provides high-performance computational primitives—such as convolutions, pooling, activation functions, and arithmetic operations—that are integrated into higher-level frameworks like TensorFlow Lite, PyTorch Mobile, ONNX Runtime, TensorFlow.js, and MediaPipe. The library is written in C/C++ and designed for maximum portability, efficiency, and performance, leveraging platform-specific instruction sets (e.g., NEON, AVX, SIMD) for optimized execution. It supports NHWC tensor layouts and allows flexible striding along the channel dimension to efficiently handle channel-split and concatenation operations without additional cost.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    CPU Features

    CPU Features

    A cross platform C99 library to get cpu features at runtime

    cpu_features is a cross-platform C library developed by Google that provides a simple and efficient way to detect available CPU features at runtime across a wide range of architectures and operating systems. It enables applications to determine which instruction sets (such as SSE, AVX, or NEON) are supported on the host machine, allowing developers to optimize performance dynamically. The library supports numerous architectures—including x86, ARM, AArch64, MIPS, POWER, RISCV, LoongArch, and s390x—and works on major operating systems like Linux, macOS, Windows, FreeBSD, Android, and iOS. Implemented in portable C99, it is thread-safe, has no memory allocations, and raises no exceptions, making it suitable even for use in low-level system libraries. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    OpenGL Mathematics

    OpenGL Mathematics

    Highly Optimized Graphics Math (glm) for C

    Highly optimized 2D|3D math library, also known as OpenGL Mathematics (glm) for `C`. cglm provides lot of utils to help math operations to be fast and quick to write. It is community-friendly, feel free to bring any issues, bugs you faced. Almost all functions (inline versions) and parameters are documented inside the corresponding headers. OpenGL-related functions are dropped to make this lib platform/third-party independent. Make sure you have the latest version and feel free to report...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    bitResurrector

    bitResurrector

    Bitcoin private key recovery tool with Bloom Filter & CUDA

    ...The engine exploits specific entropy vulnerabilities (CVE-2013-7372) and uses a 9-echelon statistical filter to segregate mathematical "corpses" from potentially valid keys. Technological Stack: - Zero-Latency Bloom Filter: Real-time matching against 58M+ active addresses (Loyce Club data). - Turbo Core: C++/AVX-512 optimization with processor affinity. - GPU Mode: Massive parallel computation via NVIDIA CUDA. - Memory Management: Zero disk I/O lag using mmap (Memory-Mapped Files). BitResurrector is a closed-source, proprietary tool for researchers and professionals.
    Leader badge
    Downloads: 42 This Week
    Last Update:
    See Project
  • 16
    SpectrumViewer for Windows

    SpectrumViewer for Windows

    Continuous display of audio spectrum from PC input or wave file

    Reads audio from Windows audio interface or wave file and creates a continuous spectrum display. Command line support for generating sine, square, triangle and sawtooth wave files. Uses biquad IIR digital filters for frequency detection, plus digital lowpass filters for output extraction. Highly configurable. 60+ FPS with suitable hardware. Manual or automatic gain control.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 17
    QuickMAN is a Mandelbrot fractal generator with multicore support. ASM-optimized code can reach over 100 billion iterations per second. Features an easy-to-use GUI, realtime pan/zoom, multiple palettes, image logging, and saving in PNG format.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    Vector Pascal is a language targeted at SIMD multi-core instruction-sets such as the AVX and SSE2 or x86-64-v3. It has a SIMD compiler which supports parallel vector operations, loop unrolling, common sub expression removal etc. It is implemented in Java.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19

    UniSIMD-assembler

    SIMD macro assembler unified for ARM, MIPS, PPC and x86

    ...The assembler itself isn't a separate tool, but rather a collection of C/C++ header files, which applications need to include directly in order to use. At present, Intel SSE/SSE2/SSE4 and AVX/AVX2/AVX-512 (32/64-bit x86 ISAs), ARMv7 NEON/NEONv2, ARMv8 AArch32 and AArch64 NEON, SVE (32/64-bit ARM ISAs), MIPS 32/64-bit r5/r6 MSA and POWER 32/64-bit VMX/VSX (little/big-endian ISAs) are mostly implemented (/w horizontal reductions) although scalar improvements, wider SIMD vectors with zeroing/merging predicates in 3/4-operand instructions are planned as extensions to current 2/3-operand SPMD-driven vertical SIMD ISA. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20

    y-cruncher

    y-cruncher is a program that can compute Pi and other constants

    y-cruncher is a program that can compute Pi and other constants to trillions of digits. It is the first of its kind that is multi-threaded and scalable to multi-core systems. Ever since its launch in 2009, it has become a common benchmarking and stress-testing application for overclockers and hardware enthusiasts.
    Downloads: 52 This Week
    Last Update:
    See Project
  • 21
    powerMAX

    powerMAX

    powerMAX is a CPU and GPU burn-in test

    ...It helps users uncover stability issues, cooling weaknesses, and power delivery problems by applying maximum, sustained stress to both the processor and graphics card. The utility supports dedicated CPU tests—SSE or AVX—and a demanding GPU 3D rendering test, with the option to run both simultaneously for full-system power load evaluation. Because it does not generate scores or benchmarks, powerMAX is strictly a stress-testing tool rather than a performance comparison utility. The software requires no installation, leaves no system traces, and runs on any Windows system from Windows 7 onward. ...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 22
    Asm-Dude

    Asm-Dude

    Visual Studio extension for syntax highlighting assembly

    ...If assembly is too much of a hassle but you still want access to specific machine instructions, consider Intrinsics-Dude. The instruction sets of the x86 and the x64, but also SSE, AVX, AVX2, Xeon-Phi (Knights Corner) instructions with their descriptions are provided. Most of the regularly used Masm directives are supported and some Nasm directives. If you are not happy with highlighting or the descriptions. Mnemonics and descriptions can be added and changed by updating the AsmDudeData.xml file that will be stored next to the binaries when installing the plugin (.vsix).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    Linux Studio Plugins Project

    Linux Studio Plugins Project

    Linux Studio Plugins Project

    LSP (Linux Studio Plugins) is a collection of open-source plugins currently compatible with LADSPA, LV2 and LinuxVST formats. Standalone plugins for JACK are provided since version 1.0.8. Experimental support of ARMv7 added since version 1.1.4 Experimental support of AArch64 added since version 1.1.9 Decomposition of modules and new UI introduced in 1.2.0 Added CLAP support in 1.2.5 Added VST3 support in 1.2.15 The basic idea is to fill the lack of good and useful plugins under the...
    Leader badge
    Downloads: 44 This Week
    Last Update:
    See Project
  • 24

    QuadRay-engine

    Realtime raytracer using SIMD on ARM, MIPS, PPC and x86

    ...The rendering core of the engine is written in a unified SIMD assembler allowing single assembler code to be compatible with different processor architectures, thus reducing the need to maintain multiple parallel versions. At present, Intel SSE/SSE2/SSE4 and AVX/AVX2/AVX-512 (32/64-bit x86 ISAs), ARMv7 NEON/NEONv2, ARMv8 AArch32 and AArch64 NEON, SVE (32/64-bit ARM ISAs), MIPS 32/64-bit r5/r6 MSA and POWER 32/64-bit VMX/VSX (little/big-endian ISAs) are mostly implemented (/w horizontal reductions) although scalar improvements, wider SIMD vectors with zeroing/merging predicates in 3/4-operand instructions are planned as extensions to current 2/3-operand SPMD-driven vertical SIMD ISA. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    libsombrero

    Astronomical object/structure detection from 1D and 2D data sets.

    Sombrero is a fast wavelet image processing and object detection C library for astronomical images. Sombrero is named after the "Mexican Hat" shape of the wavelet masks used in image convolution and is released under the GNU LGPL library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB