MPI + affinity
We need some thinking about this issues. The current code does not allow for good pinning when combined with MPI.
Might be the reason for the rather poorer scaling seen on cosma-5 (that has hyper-threading) compared to the other systems (that don't).
@alepper You may have some experience or thoughts to share here.