From: Michal B. (TAU) <mic...@tu...> - 2021-07-16 13:33:23
|
Hello, __private double8' (vector of 8 'double' values) without 'avx512f' enabled changes the ABI Can you make sure that LLVM inside the Conda environment doesn't misdetect the CPU ? Can be checked with llc command from LLVM package. Running llc --version should print something like: Optimized build. Default target: x86_64-pc-linux-gnu Host CPU: znver1 Finally, after all these warnings the code segfaults, which I did not see in the distribution provided pocl version. This definitely shouldn't happen even with the warnings. code can be generated that includes the type double16, but is never actually called as function in the kernel. Using wider vectors than CPU has should work properly, so i think that's fine. Can you try looking into doc/sphinx/source/debug.rst document, the "Debugging with GDB" section has a guide on compiling & running non-optimized -O0 version of your kernel. If it doesn't crash with -O0 code, it could be a bug somewhere in pocl's LLVM code. Regards, Michal |