Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2025-08-01 | 1.1 kB | |
Simd v6.2.152 source code.tar.gz | 2025-08-01 | 4.6 MB | |
Simd v6.2.152 source code.zip | 2025-08-01 | 6.2 MB | |
Totals: 3 Items | 10.8 MB | 1 |
Algorithms
New features
- AVX2, AVX-512BW optimizations of class SynetQuantizedAddUniform.
- Base implementation of class SynetQuantizedInnerProductRef.
- Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8 optimizations of class SynetQuantizedInnerProductGemmNN.
- Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8 optimizations of class SynetQuantizedConvolutionNhwcSpecV0.
- Base implementation, SSE4.1, AVX2 optimizations of class SynetQuantizedConvolutionNhwcDepthwise.
Improve
- AMX-INT8 optimizations of class SynetQuantizedConvolutionNhwcGemm.
Bug fixing
- Error in NEON optimization of function Float32ToBFloat16.
- Error in Base implementation of class SynetQuantizedConvolutionNhwcGemm.
- Error in Base implementation of class SynetQuantizedConvolutionGemm.
Test framework
New features
- Tests for verifying functionality of SynetQuantizedInnerProduct framework.