Reduce the amount of swaps needed to sort the particles into the correct progeny, should be faster and scale better.