Download Latest Version OpenBLAS-0.3.30-woa64-64-static.zip (7.9 MB)
Email in envelope

Get an email when there's a new version of OpenBLAS

Home / v0.3.30
Name Modified Size InfoDownloads / Week
Parent folder
OpenBLAS-0.3.30-woa64-64-static.zip 2025-06-24 7.9 MB
OpenBLAS-0.3.30-woa64-dll.zip 2025-06-23 5.0 MB
OpenBLAS-0.3.30-woa64-static.zip 2025-06-23 8.0 MB
OpenBLAS-0.3.30-x64-64.zip 2025-06-19 40.1 MB
OpenBLAS-0.3.30-x64.zip 2025-06-19 40.6 MB
OpenBLAS-0.3.30-x86.zip 2025-06-19 22.2 MB
OpenBLAS-0.3.30.zip 2025-06-19 42.8 MB
OpenBLAS-0.3.30.tar.gz 2025-06-19 24.7 MB
OpenBLAS 0.3.30 version source code.tar.gz 2025-06-19 24.7 MB
OpenBLAS 0.3.30 version source code.zip 2025-06-19 43.1 MB
README.md 2025-06-19 7.7 kB
Totals: 11 Items   259.1 MB 14

general:

  • fixed an installation problem with the thread safety test in gmake builds
  • fixed spurious overwriting of an input array in complex GEMMT/GEMMTR
  • fixed naming of GEMMTR in error messages from XERBLA
  • fixed compilation of SBGEMMT/SBGEMMTR in CMake builds
  • fixed the implementation of ?NRM2 to handle INCX=0 correctly
  • removed tests for CSROT and ZDROT that relied on unspecified behavior
  • fixed a performance regression in multithreaded GEMM that was particularly serious on POWER targets
  • fixed linking issues when using LLVM's flang-new with gmake
  • fixed a potential thread safety problem with C11 atomic operations
  • further improved the workload partitioning in parallel GEMM
  • fixed omission of LAPACKE interfaces for CGESVDQ,CTRSYL3 and ?GEQPF in CMake builds
  • fixed mishandling of setting NO_LAPACK to FALSE, and incorrect dependencies for LAPACK function SPMV in CMake builds
  • added explicit CMake options for building LAPACKE and shared libraries
  • simplified and improved handling of OpenMP options in CMake builds
  • reworked Windows DLL generation in CMake builds to ensure correct symbol renaming (pre/postfixing) and optional generation of PDB files for debugging
  • updated the Perl script version of the gensymbol utility for use with Windows-on-Arm
  • Fixed building with (Mingw) gmake on Windows to ensure completeness of the LAPACK included in the static library (potential race condition due to the Windows version of the "ln" utility creating snapshot copies rather than links)
  • fixed unwanted deletion of the lapacke_mangling.h file by "make clean"
  • fixed potential duplication of a _64 suffix on library names in CMake builds
  • fixed compilation of the C fallback copies of the LAPACK code with GCC 15
  • included fixed from the Reference-LAPACK project:
  • fixed a truncated error message in the EIG part of the testsuite (Reference-LAPACK PR 1119)
  • fixed too strict check in LAPACKE_?gesdd_work (PR [#1126])
  • fixed memory corruption when calling ?GEEV with non-finite data (PR [#1128])
  • fixed missing initialization of a variable in C/GEQP3RK (PR [#1131])
  • fixed 2nd dimension chosen in C/ZUNMLQ transposition operation (PR [#1135])

x86_64:

  • fixed an error in the SBGEMV kernel for Cooper Lake/Sapphire Rapids
  • fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
  • improved the compiler identification code for flang-new
  • fixed a potential build issue in the ZSUM kernel
  • fixed "argument list too long" errors when building on MacOS
  • added cpu autodetection support for several new Arrow Lake models
  • fixed conditional inclusion of the fast path SGEMM kernel in DYNAMIC_ARCH
  • fixed compilation with the MinGW build of GCC 15

x86:

  • fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
  • worked around potential miscompilation of CDOT with very old binutils

arm64:

  • fixed cpu type detection of A64FX and some ThunderX models (broken in 0.3.29)
  • added support for the AmpereOne/1A cpus in DYNAMIC_ ARCH builds
  • added an optimized SBGEMM kernel for NEOVERSEV1
  • improved 1xN SBGEMM performance by forwarding to SBGEMV
  • introduced a stepwise increase of the thread count used for SGEMM and SGEMV on NEOVERSEV1/V2 in relation to problem size
  • introduced a stepwise increase of the thread count used for DGEMV on NEOVERSEV1 in relation to problem size
  • introduced a stepwise increase of the thread count used for SDOT and DDOT on NEOVERSEV1 in relation to problem size
  • worked around assembler limitations in LLVM for Windows-on-Arm
  • enabled cpu type autodetection from the registry on Windows-on-Arm
  • improved multithreading threshold for GEMV and GESV on Windows-on-Arm
  • fixed overoptimization issues with LLVM's flang in Windows-on-Arm
  • fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
  • added a fast path SGEMM kernel for small workloads on SME capable targets
  • improved performance of SGEMM and DGEMM kernels for small workloads
  • improved performance of SGEMV and DGEMV on SVE-capable targets
  • improved performance of SGEMV on NEOVERSEN1 and Apple M
  • added optimized SSYMV and DSYMV kernels for NEOVERSEN1, Apple M and all SVE capable targets
  • added optimized SBGEMV kernels for NEOVERSEV1/V2/N2
  • improved performance of SGEMM through faster NCOPY kernels
  • added compiler options for the NVIDIA HPC Compiler Suite
  • fixed compilation on OSX with XCode 16.3 and later
  • fixed cpu core type and cache size detection on Apple M4
  • updated GEMM parameter settings for Neoverse cpus in cross-builds with CMake
  • fixed default compiler options for NEOVERSEN1 and CORTEXX2 in CMake builds
  • fixed conditional inclusion of the fast path SGEMM kernel in DYNAMIC_ARCH
  • fixed potential miscompilation of the non-SVE SDOT kernel

arm:

  • fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
  • fixed unwanted generation of object files with a writable stack

riscv64:

  • added optimized SROTM and DROTM kernels for x280
  • fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
  • improved performance of GEMM_TCOPY on RVV1.0 targets with VLEN of 128 or 256
  • improved performance of OMATCOPY on targets with VLEN 256
  • greatly improved performance of SGEMV/DGEMV
  • improved performance of CGEMV and ZGEMV on C910V and all RVV targets with VLEN 256
  • improved performance of SAXPBY and DAXPBY on C910V and all RVV targets with VLEN 256
  • improved performance of AXPY and DOT on C910V and ZVL256B targets by falling back to non-vectorized code for very small N. (Thereby fixing poor performance of CHBMV/ZHBMV for very small K)
  • fixed CMake build failures of the TRMM kernels

loongarch64:

  • improved performance of the LSX versions of SSYMV/DSYMV
  • made the LASX versions of the DSYMV and SSYMV kernels compatible with hardware changes in LA664 and future targets
  • fixed inaccuracies in several LASX kernels
  • improved compatibility of LSX kernels with LA264 targets
  • fixed handling of deprecated target names in CMake builds
  • fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL

power:

  • fixed building for PPCG4 with CMake
  • fixed SSCAL/DSCAL on PPC970 running FreeBSD
  • fixed a potential alignment issue in the POWER8 SGEMV kernel
  • fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL

zarch:

  • fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
  • fixed unwanted generation of object files with a writable stack

sparc:

  • fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL

alpha:

  • fixed build failure caused by spurious Windows-only typecasts

cell:

  • fixed probable build issue caused by spurious Windows-only typecasts

md5sums 8db3d57f4d4485c6ae3f21ea465660e7 OpenBLAS-0.3.30.tar.gz a6075d90482591784da1dec6451d66aa OpenBLAS-0.3.30.zip 2cd46445f78509f48b6c7a541a4fd21b OpenBLAS-0.3.30-x86.zip 35035a89ea55065c867444983707b4f7 OpenBLAS-0.3.30-x64.zip d07457f9ae31a4e38713309491139e3b OpenBLAS-0.3.30-x64-64.zip 22a6c5f91a9a26180e86e829ddabc0d9 OpenBLAS-0.3.30-woa64-static.zip 63bcb67ac1c3b3d125f55af92690c5f3 OpenBLAS-0.3.30-woa64-dll.zip a96489fae4a9f746e57efc771dbf1935 OpenBLAS-0.3.30-woa64-64-static.zip

Download OpenBLAS

Source: README.md, updated 2025-06-19