Skip to content
Snippets Groups Projects

Add -march=core-avx2 for AMD CPUs compiled using ICC

Merged Peter W. Draper requested to merge intel-amd-optimization into master

Speeds up self-gravity on my EAGLE_50 tests using Intel/2018 plus:

./configure --with-tbbmalloc --with-parmetis --enable-debug

full steps go from 155 down to 105 (x2 in log message included some I/O). These are MPI runs on a single node using all the cores (4x16=64).

Only known to work for EPYC at Durham, but patterns for other AMDs that claim AVX2 support are included. See:

https://www.amd.com/system/files/documents/compiler-options-guide-amd-epyc-7xx1-series-processors.pdf

Merge request reports

Loading
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
Please register or sign in to reply
Loading