Showing 71 open source projects for "simd"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • 1
    sleef

    sleef

    Vectorized libm

    SLEEF stands for SIMD Library for Evaluating Elementary Functions. SLEEF implements vectorized versions of all C99 math functions, that utilize SIMD instructions of modern processors to make computation more efficient. The library also includes vectorized DFT subroutines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Vector Pascal is a language targeted at SIMD multi-core instruction-sets such as the AVX and SSE2 or x86-64-v3. It has a SIMD compiler which supports parallel vector operations, loop unrolling, common sub expression removal etc. It is implemented in Java.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    HighwayHash

    HighwayHash

    Fast strong hash functions: SipHash/HighwayHash

    HighwayHash is a fast, keyed hash function intended for scenarios where you need strong, DoS-resistant hashing without the full overhead of a general-purpose cryptographic hash. It’s designed to defeat hash-flooding attacks by mixing input with wide SIMD operations and a branch-free inner loop, so adversaries can’t cheaply craft many colliding keys. The implementation targets multiple CPU families with vectorized code paths while keeping a portable fallback, yielding high throughput across platforms. It exposes simple one-shot and streaming APIs, so you can hash short keys or long byte streams with the same function. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    GNSS-SDR

    GNSS-SDR

    An open source software-defined GNSS receiver

    An open source software-defined Global Navigation Satellite Systems (GNSS) receiver written in C++ and based on the GNU Radio framework.
    Leader badge
    Downloads: 1,589 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 5
    LinAsm

    LinAsm

    Collection of fast and optimized assembly libraries for x86-64 Linux

    LinAsm is collection of very fast and SIMD optimized assembly written libraries for x86-64 Linux. It implements many common and widely used algorithms for array manipulations: searching, sorting, arithmetic and vector operations, unit conversions; fast mathematical and statistic functions; numbers and time converting algorithms; finite impulse response (FIR) digital filters; spectrum analysis algorithms, Fast Hartley transformation; CPU cache friendly functions and extremely fast abstract data types (ADT) such as hash tables b-trees, and much more. ...
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 6
    Armadillo

    Armadillo

    fast C++ library for linear algebra & scientific computing

    * Fast C++ library for linear algebra (matrix maths) and scientific computing * Easy to use functions and syntax, deliberately similar to Matlab / Octave * Uses template meta-programming techniques to increase efficiency * Provides user-friendly wrappers for OpenBLAS, Intel MKL, LAPACK, ATLAS, ARPACK, SuperLU and FFTW libraries * Useful for machine learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. * Downloads:...
    Leader badge
    Downloads: 2,993 This Week
    Last Update:
    See Project
  • 7

    Fosite - advection problem solver

    numerical simulation code for solving transport equations in 1D/2D/3D

    ...Fosite is written with object-oriented patterns in Fortran 2003 and follows the Structure of Arrays (SoA) layout, operating on generic field datatypes. This allows for high performance on modern architectures (SIMD). It is parallelized and vectorized. The software is thereby optimized for the NEC SX-Aurora TSUBASA Vector Engine .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    libsombrero

    Astronomical object/structure detection from 1D and 2D data sets.

    Sombrero is a fast wavelet image processing and object detection C library for astronomical images. Sombrero is named after the "Mexican Hat" shape of the wavelet masks used in image convolution and is released under the GNU LGPL library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    libjpeg-turbo

    libjpeg-turbo

    SIMD-accelerated libjpeg-compatible JPEG codec library

    libjpeg-turbo is a JPEG image codec that uses SIMD instructions (MMX, SSE2, NEON, AltiVec) to accelerate baseline JPEG compression and decompression on x86, x86-64, ARM, and PowerPC systems. On such systems, libjpeg-turbo is generally 2-6x as fast as libjpeg, all else being equal. On other types of systems, libjpeg-turbo can still outperform libjpeg by a significant amount, by virtue of its highly-optimized Huffman coding routines.
    Leader badge
    Downloads: 43,119 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 10
    TurboPFor

    TurboPFor

    Fastest Integer Compression

    Fastest Integer Compression. ALL functions are available for AMD/Intel, 64-bit ARMv8 NEON Linux+MacOS/M1 & Power9 Altivec. 100% C (C++ headers), as simple as memcpy. OS:Linux amd64, arm64, Power9, MacOs (Amd/intel + Apple M1).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Forma

    Forma

    An efficient vector-graphics renderer

    ...The project aims to achieve portability, performance, simplicity, and small footprint through a streamlined four-stage rendering pipeline. Forma provides both CPU (software) and GPU (hardware) backends, relying on Rust’s SIMD auto-vectorization, Rayon for multithreading, and WebGPU (wgpu) for hardware acceleration. The renderer processes Bézier curves, line segments, and pixels through stages of flattening, rasterization, sorting, and painting, updating only changed tiles for efficiency. This design allows Forma to render complex vector scenes—such as large-scale SVGs—at interactive frame rates even on CPUs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    dispy

    Distributed and Parallel Computing with/for Python.

    dispy is a generic and comprehensive, yet easy to use framework for creating and using compute clusters to execute computations in parallel across multiple processors in a single machine (SMP), among many machines in a cluster, grid or cloud. dispy is well suited for data parallel (SIMD) paradigm where a computation (Python function or standalone program) is evaluated with different (large) datasets independently. dispy supports public / private / hybrid cloud computing, fog / edge computing.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Klogg

    Klogg

    Really fast log explorer based on glogg project

    Klogg is an open source multi-platform GUI application to search through all kinds of text log files using regular expressions. It has started as fork of glogg project created by Nicolas Bonnefon and has evolved into a separate project with a lot of new features and improvements.
    Downloads: 75 This Week
    Last Update:
    See Project
  • 14
    libfacedetection

    libfacedetection

    Library for face detection in images

    ...The source code does not depend on any other libraries. What you need is just a C++ compiler. You can compile the source code under Windows, Linux, ARM and any platform with a C++ compiler. SIMD instructions are used to speed up the detection. You can enable AVX2 if you use Intel CPU or NEON for ARM. The model file has also been provided in directory ./models/. The file examples/detect-image.cpp and examples/detect-camera.cpp show how to use the library. The library was trained by libfacedetection.train. You can copy the files in directory src/ into your project, and compile them as the other files in your project. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Simd

    Simd

    High performance image processing library in C++

    ...The algorithms are optimized with using of different SIMD CPU extensions. In particular the library supports following CPU extensions: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. The Simd Library has C API and also contains useful C++ classes and functions to facilitate access to C API. The library supports dynamic and static linking, 32-bit and 64-bit Windows, Android and Linux, MSVS, G++ and Clang compilers, MSVS project and CMake build systems.
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 16
    Pikkr

    Pikkr

    JSON parser to pick up values directly without performing tokenization

    ...Pikkr is a JSON parser that picks up values directly without performing tokenization in Rust. Creates an index which maps logical locations of queried fields to their physical locations by using SIMD instructions and bit manipulation. Finds values of queried fields by scanning a JSON record using the index created in the previous process and learns their logical locations (i.e. pattern of the JSON structure) in the early stages. Speculates logical locations of queried fields by using the learned result information, jumps directly to their physical locations and extracts values in the later stages. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MathFu

    MathFu

    C++ math library developed primarily for games focused on simplicity

    ...It provides fixed-size vector, matrix, and quaternion types with APIs that mirror the operations you actually use in inner loops—dot, cross, normalize, transforms, and projections. Implementations are carefully tuned to leverage SIMD instruction sets such as SSE and NEON while preserving a clean fallback path for portability. Because types are fixed at compile time, the compiler can aggressively inline and unroll code, producing predictable performance and tiny binaries. The design emphasizes minimal dependencies and clear ownership so the library can be embedded into engines without dragging in heavyweight frameworks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    LightPCC

    Parallel pairwise correlation computation on Intel Xeon Phi clusters

    The first parallel and distributed library for pairwise correlation/dependence computation on Intel Xeon Phi clusters. This library is written in C++ template classes and achieves high speed by exploring the SIMD-instruction-level and thread-level parallelism within Xeon Phis as well as accelerator-level parallelism among multiple Xeon Phis. To facilitate balanced workload distribution, we have proposed a general framework for symmetric all-pairs computation by building provable bijective functions between job identifier and coordinate space for the first time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    smartIDS

    Lightweight intrusion detection for IoT and embedded devices.

    The aim of the project is a lightweight intrusion detection library for embedded devices which supports MSP430 and ARM Cortex based devices. Features include DSP/SIMD support, IoT and embedded protocols, distributed operation, event and history management, tool supported configuration and visualization. There is a Java port that supports less features.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    ...Yet for applications like metagenomics, where a large amount of data is involved, DIAMOND still takes too much time. We introduce AC-DIAMOND, which attempts to speed up DIAMOND via better SIMD parallelization and reference indexing. Experimental results show that AC-DIAMOND is about 4 times faster than DIAMOND on aligning DNA reads or contigs while retaining the essentially the same sensitivity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    RandomLib

    Random number library

    RandomLib is a C++ interface to the Mersenne Twister random number generator MT19937 and to the SIMD-oriented Fast Mersenne Twister random number generator, SFMT19937. For documentation, visit http://randomlib.sf.net
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    jpegant

    jpegant

    Embedded JPEG encoder

    ...Release 1.1 source code and Windows executables are on the download page. For the latest release code look into the repository in 'release-1-0' branch. The SSE2 inplementation is present in 'simd.0' branch in the repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DD-AVX

    DD-AVX

    DD-AVX: Library of high-precision operations accelerated by AVX

    ...author's page http://www.slis.tsukuba.ac.jp/~s1530534/index.html e-mail hishinuma@slis.tsukuba.ac.jp Double-Double (DD) precision operations are used to reduce rounding errors and improve the convergence of Krylov subspace methods. This library has Double-Double precision operations accelerated by AVX and AVX2. AVX and AVX2 are intel SIMD instructions. They operate four double precision operation simultaneously. This library needs to merge "Lis" for DD precision iterative solvers. Official support "Lis" version is "lis-1.4.58." After merge "lis" and "DD-AVX," the interface is same as "lis." Lis web page http://www.ssisc.org/lis/index.en.html This library is managed by git. old version can search "code" page.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    DeltaCalibrationSim

    Simulator for delta calibration parameters

    This is a program to simulate the errors in delta printers to help one understand how modifying any of the correction parameters will effect the height map. It can also display errors in x, and y directions, as well as the magnitude of xy, and xyz errors. Custom color gradients are supported, including using alpha, making it easy to find test points for a least squares calibration routine. Images can be exported to any format Qt supports (PNG, BMP, JPG, PDF, etc). Simulation parameters...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    SWAPHI-LS: Alignment on Xeon Phi Cluster

    Smith-Waterman long DNA sequence alignment on Xeon Phi clusters

    The first parallel Smith-Waterman algorithm exploiting Intel Xeon Phi clusters to accelerate the alignment of long DNA sequences. This algorithm is written in C++ (with a set of SIMD intrinsic extensions), OpenMP and MPI. The performance evaluation revealed that our algorithm achieves very stable performance, and yields a performance of up to 30.1 GCUPS on a single Xeon Phi and up to 111.4 GCUPS on four Xeon Phis sharing a host.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo