Parallel particle sorting
The previous particle (and gpart) sort relied on using OpenMP tasks. These have been removed in !28 (merged). Since the particle sorting is now probably the heaviest serial bit in the code, it would be nice to parallelize it with the current scheduler.