Use MPI communicators to reduce the use of tags.
It was pointed out at an IPCC meeting with Intel that MPI communicators have their own set of MPI tags.
The upshot of this is we can reduce the maximum tag no. by a factor of 6 by using one communicator per subtype.
Addresses part of the issues reported in #443 (closed).
Edited by Matthieu Schaller