| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2025-11-10 | 1.6 kB | |
| Simd v6.2.155 source code.tar.gz | 2025-11-10 | 4.7 MB | |
| Simd v6.2.155 source code.zip | 2025-11-10 | 6.3 MB | |
| Totals: 3 Items | 11.0 MB | 0 | |
Algorithms
New features
- SSE4.1, AVX2, AVX-512BW optimizations of function SynetQuantizedScaleLayerForward.
- SSE4.1, AVX2, AVX-512BW optimizations of function SynetQuantizedPreluLayerForward.
- Arbitrary activation function in Base implementation of class SynetQuantizedConvolutionGemm.
- Arbitrary activation function in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8 optimizations of class SynetQuantizedConvolutionNhwcGemm.
- Arbitrary activation function in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8 optimizations of class SynetQuantizedConvolutionNhwcSpecV0.
- Arbitrary activation function in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI optimizations of class SynetQuantizedConvolutionNhwcDepthwiseV2.
- Arbitrary activation function in Base implementation, AVX-512VNNI optimizations of class SynetQuantizedConvolutionNhwcDepthwiseV3.
Improve
- AMX-BF16 optimizations of class SynetConvolution16bNhwcGemm (case of small srcC).
Bug fixing
- Performance bug in AMX-INT8 optimizations of class SynetQuantizedConvolutionNhwcGemm.
- Error in SSE4.1, AVX2, AVX-512BW, AVX-512VNNI optimizations of class SynetQuantizedInnerProductGemmNN.
- Error in SSE4.1 optimizations of class SynetQuantizedConvolutionNhwcSpecV0.
- Error in Base implementation of class SynetQuantizedConvolutionNhwcSpecV0.
- Error in Base implementation of class SynetQuantizedConvolutionNhwcGemm.