Menu

#54 NEON GEMM for 3.9.x

developer
closed-accepted
5
2012-05-24
2011-12-14
Tom Wallace
No

Adds a single precision NEON GEMM kernel (SGEMM+CGEMM) to 3.9.x. Note that NEON is not fully IEEE 754 compliant, and so should be used only when specifically requested. This patch does not contain the changes to configure necessary to accomplish this, and will use NEON wherever it works when applied.

Discussion

  • Tom Wallace

    Tom Wallace - 2011-12-16

    The first patch failed to properly update scases.flg and ccases.flg to use the NEON kernel because the author of the patch had failed to consume sufficient caffeine to activate his brain. The corrected patch is neon-3.9.x-corrected.diff.

     
  • Tom Wallace

    Tom Wallace - 2011-12-16

    Corrected NEON patch

     
  • R. Clint Whaley

    R. Clint Whaley - 2012-01-16

    Tom,

    I notice you submitted your kernel with a non-standard license header that you hacked up. I'd like to use the standard ATLAS one instead. I attach what that would look like below, let me know if this is OK with you.

    Thanks,
    Clint

     
  • R. Clint Whaley

    R. Clint Whaley - 2012-01-16
     
  • Tom Wallace

    Tom Wallace - 2012-01-17

    Yes, that's the same file as the one I've been distributing, so it still has my header. I'll get you a revised copy next week when I get a few minutes. I also hope to have a few more NEON kernels ready in the next month or so.

     
  • R. Clint Whaley

    R. Clint Whaley - 2012-01-18

    Tom,

    This is currently blocking the next developer list. Can I just swap your header with the one I posted below? Otherwise, I will remove it so I am free to do the next release.

    Thanks,
    Clint

     
  • Tom Wallace

    Tom Wallace - 2012-01-18

    Please swap the header; sorry for delaying things.

     
  • R. Clint Whaley

    R. Clint Whaley - 2012-05-24
    • status: open --> closed-accepted
     

Log in to post a comment.