| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| rocm-python-wheels-Linux.zip | 2026-02-03 | 284.1 MB | |
| rocm-python-wheels-Windows.zip | 2026-02-03 | 137.0 MB | |
| README.md | 2026-02-03 | 1.8 kB | |
| v4.7.0 source code.tar.gz | 2026-02-03 | 3.4 MB | |
| v4.7.0 source code.zip | 2026-02-03 | 3.6 MB | |
| Totals: 5 Items | 428.1 MB | 0 | |
New features
- Introduce AMD GPU support with ROCm HIP (#1989) @sssshhhhhh
- Compatibility with Transformers v5 (#1999) by @jordimas
Fixes and improvements
- Assume less about whisper vocab (#2000) by @sssshhhhhh
- Use LLVM ThreadSanitizer instead of Google (#1993) by @3manifold
- Optimize all builds with parallel execution (#1992) by @3manifold
- Remove unecessary zero init from conv1d (#1990) by @sssshhhhhh
- Integrate Clang AddressSanitizer in tests (#1903) by @3manifold
- Enable multiple of 16 padding for INT8 Tensor Cores (#1982) by @Purfview
- Add activation and dilation to conv1d (#1979) by @sssshhhhhh
- Minor refactor to CMakeLists.txt (#1980) by @sssshhhhhh
- Remove unnecessary check from wav2vec2 (#1977) by @plan9better
- Add optional residual add to gemm op (#1975) by @sssshhhhhh
- Implement cuda layernorm axis (#1971) by @sssshhhhhh
- Fix Eole conversion (#1998) by @vince62s
- Gemma 3 conversion improvements (#1991) by @sssshhhhhh
- Add causal flag to fa2 (#1976) by @sssshhhhhh
- Fixes cross attention tests and refactors code (#1974) by @jordimas
- Fix CUDA bf16 median filter (#1972) by @sssshhhhhh
- Fix various compiler warnings (#1970) by @sssshhhhhh