- 04 Aug, 2017 13 commits
-
-
James Willis authored
-
James Willis authored
-
James Willis authored
-
James Willis authored
-
James Willis authored
-
James Willis authored
-
James Willis authored
-
James Willis authored
Conflicts: src/hydro/Gadget2/hydro_iact.h src/vector.h
-
James Willis authored
Conflicts: src/kernel_hydro.h
-
James Willis authored
Use intrinsics for vector operations as arithmetic overloading is not supported by the Intel compiler yet for AVX-512 vectors. Conflicts: src/hydro/Gadget2/hydro_iact.h
-
James Willis authored
Use intrinsics for vector operations as arithmetic overloading is not supported by the Intel compiler yet. Updated masking operations with new macros so that AVX-512 masking is supported. Conflicts: src/kernel_hydro.h
-
James Willis authored
Conflicts: src/vector.h
-
James Willis authored
Use intrinsics for vector operations as arithmetic overloading is not supported by the Intel compiler yet.
-
- 03 Aug, 2017 24 commits
-
-
Matthieu Schaller authored
-
Matthieu Schaller authored
Updated version number to 0.6.0 to reflect the large number of changes to the threadpool, drifts, MPI and scaling performance in the last few months.
-
Matthieu Schaller authored
Periodic gravity speed and accuracy improvements See merge request !393
-
Matthieu Schaller authored
-
Matthieu Schaller authored
-
Matthieu Schaller authored
-
Matthieu Schaller authored
Threadpoolize all the things These are all the remaining function from the list in #338. I've tested this as far as possible on my laptop, i.e. MPI and gravity *should* work. See merge request !384
-
Matthieu Schaller authored
-
Peter W. Draper authored
-
Peter W. Draper authored
-
Matthieu Schaller authored
Merge branch 'threadpoolize_all_the_things' of gitlab.cosma.dur.ac.uk:swift/swiftsim into threadpoolize_all_the_things
-
Pedro Gonnet authored
-
Pedro Gonnet authored
-
Pedro Gonnet authored
-
Pedro Gonnet authored
-
Pedro Gonnet authored
convert engine_make_self_gravity_tasks to use a threadpool_map. still have to create the grav_ghost tasks sequentially, but i think that's ok since there are 128 times fewer of them.
-
Pedro Gonnet authored
-
Matthieu Schaller authored
Engine barrier Following the results in the `threadpool_task_plots` branch, I've replaced the elaborate hand-crafted `engine_barrier` function by two `pthread_barrier`s. As a result, the runner threads should all start, and synchronize, faster. See merge request !386
-
Matthieu Schaller authored
-
Matthieu Schaller authored
-
Pedro Gonnet authored
engine_launch is only ever called with all threads, remove the option to limit the number of running threads.
-
Pedro Gonnet authored
engine_launch is only ever called with all threads, remove the option to limit the number of running threads.
-
Pedro Gonnet authored
-
Pedro Gonnet authored
-
- 02 Aug, 2017 3 commits
-
-
Matthieu Schaller authored
-
Matthieu Schaller authored
-
Matthieu Schaller authored
Added the EAGLE_6 test case. This is a very small cosmological volume not representative of the workload of large runs.
-