Change the level at which the sort tasks are set
This introduces the two changes I mentioned this morning:
- Move the
stars resort
to a deeper level. - Move the hydro and stars
sort
tasks to two levels below the super-level
This increases parallelism and hence reduces dead time.
Merge request reports
Activity
added 1 commit
- fbb2faf4 - Also clear the sub-sort flag in the lower-level cells when aborting the recursion in the sorts
All my tests have run. @pdraper would you be able to also check that this did not break anything? Thanks!
added 1 commit
- c413c53e - Activate the drifts at the super-level alongside the sorts
Seeing a problem I've not come across for quite some time:
[0010] [00061.9] engine.c:engine_addlink():160: Link table overflow. Increase the value of `Scheduler:links_per_tasks`.
This is for EAGLE_50/256 running on 32 ranks of COSMA5 and fullish physics:
--with-subgrid=EAGLE
:mpirun -np $SLURM_NTASKS ../../swift_mpi --pin --cooling --star-formation --feedback --stars --cosmology --hydro --self-gravity -v 1 --threads=16 eagle_50.yml
Trying again with this value raised to 50 from 25.
So had to increase this value to 200, from 25, so a bit of a leap. That shows:
Nr. of links: 5082759 allocated links: 6889600 ratio: 0.737744 memory use: 105 MB.
So not that far above the actual requirement, also not a lot of memory, but I'm confused by all this, the actual reports suggest that a value of 2 should do:
Actual usage: tasks/cell: 1.363218 links/task: 1.957144
Which is about right as the number of tasks is about half the number of links. Going to look closely at that code.
I see, so we are using a different count of the no. tasks for these two uses. The first time we have a task count, but still have a lot of work to do creating more, but we have to start creating the links. Anyway the usage report is incorrect as that should use the first task count, not the final one. If we correct that we see:
Nr. of links: 6880308 allocated links: 13294000 ratio: 0.517550 memory use: 202 MB. Actual usage: tasks/cell: 1.517805 links/initial task: 103.509972
Not obviously sensible, but no longer misleading.