| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2026-04-01 | 5.2 kB | |
| Simd v7.0.160 source code.tar.gz | 2026-04-01 | 4.8 MB | |
| Simd v7.0.160 source code.zip | 2026-04-01 | 6.5 MB | |
| Totals: 3 Items | 11.4 MB | 1 | |
Algorithms
New features
- Possibility to use non constant B matrix in framework SynetInnerProduct32f.
- Function SimdSynetInnerProduct32fExternalBufferSize.
- Parameter 'activation' to function SimdSynetInnerProduct16bInit.
- Parameter 'params' to function SimdSynetInnerProduct16bSetParams.
- Base implementation of class SynetGatherElements.
- Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function SynetNormalizeLayerForward16bV2.
- Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function SynetSoftmax16b.
- Support of HVX extension (Hexagon platform).
- HVX optimizations of function AbsDifference.
- HVX optimizations of function AbsDifferenceSum.
- HVX optimizations of function AbsGradientSaturatedSum.
- HVX optimizations of function AddFeatureDifference.
- HVX optimizations of function BgrToGray.
- HVX optimizations of function BgrToRgb.
- HVX optimizations of function FillBgra.
- HVX optimizations of function FillPixel.
- HVX optimizations of function AbsSecondDerivativeHistogram.
- HVX optimizations of function HistogramMasked.
- HVX optimizations of function HistogramConditional.
- HVX optimizations of function OperationBinary8u.
- HVX optimizations of function GetStatistic.
- HVX optimizations of function GetRowSums.
- HVX optimizations of function GetColSums.
- HVX optimizations of function GetAbsDyRowSums.
- HVX optimizations of function GetAbsDxColSums.
- HVX optimizations of function ValueSum.
- HVX optimizations of function SquareSum.
- HVX optimizations of function ValueSquareSum.
- HVX optimizations of function ValueSquareSums.
- HVX optimizations of function CorrelationSum.
- SSE4.1, AVX2, AVX-512BW optimizations of function BgrToHsl.
Improving
- AMX-BF16 optimizations of class SynetMergedConvolution16bCdc.
- AMX-BF16 optimizations of class SynetMergedConvolution16bCd.
- AMX-BF16 optimizations of class SynetMergedConvolution16bDc.
- AMX-BF16 optimizations of class SynetInnerProduct16bGemmNN.
Bug fixing
- Error in SSE4.1, AVX2, AVX-512BW optimizations of function SynetQuantizedPreluLayerForward (possible aligned read of unaligned memory).
- Error in SSE4.1, AVX2, AVX-512BW optimizations of function SynetQuantizedScaleLayerForward (possible aligned read of unaligned memory).
- Error in SSE4.1 optimizations of class ResizerFloatBilinear (possible aligned read of unaligned memory).
- Error in SSE4.1 optimizations of class ResizerBf16Bilinear (possible aligned read of unaligned memory).
- Error in SSE4.1, AVX2 optimizations of class ResizerByteBilinear (possible aligned read of unaligned memory).
- Error in SSE4.1 optimizations of class ResizerByteBilinear (possible aligned write to unaligned memory).
- Error in SSE4.1 optimizations of class ResizerFloatBilinear (possible aligned write to unaligned memory).
- Memory leak in function Simd::Detection::LoadStringXml.
- Possible crash in function Simd::ImageLoadFromFile.
- Possible crash in function Simd::Base64Decode.
- Possible crash in AVX-512BW optimization of function TransformImageRotate270.
- Performance bug in AVX-512BW optimization of class SynetMergedConvolution32fCdc.
- Performance bug in AMX-BF16 optimization of class SynetMergedConvolution16bCdc.
- Compiler error in assert conditions in function Simd::DeinterleaveBgra.
- Compiler error in assert conditions in function Simd::DeinterleaveRgb.
- Compiler error in assert conditions in function Simd::DeinterleaveRgba.
- Compiler error in assert conditions in function Simd::GetObjectMoments.
- MSVS compiler bug in Base implementation of function BgrToHsl (Release, x64).
- MSVS compiler bug in Base implementation of function SynetQuantizedScaleLayerForward (Release, Win32).
Renaming
- Function SynetSoftmaxLayerForward to SynetSoftmax32f.
Test framework
New features
- Tests for verifying functionality of class SynetGatherElements.
- Tests for verifying functionality of function SynetNormalizeLayerForward16bV2.
- Tests for verifying functionality of function SynetSoftmax16b.
Improving
- Add thread save state to functions Test::Rand and Test::Srand.
- Add smoothing to function Test::CreateTestImage.
Bug fixing
- Wrong parsing of 'testStatistics' command line option.
- Wrong parsing of 'testRepeats' command line option.
- Too long github action test step in msbuild.yml.
- Too long github action test step in cmake.yml.
Documentation
Bug fixing
- Syntax and lexical errors in project documentation.