Skip to content
Snippets Groups Projects
Commit f7b953c8 authored by Pedro Gonnet's avatar Pedro Gonnet
Browse files

updated ompss results.

parent 2fccc5da
No related branches found
No related tags found
No related merge requests found
%!PS-Adobe-3.0 EPSF-3.0
%%Creator: MATLAB, The MathWorks, Inc. Version 8.0.0.783 (R2012b). Operating System: Linux 3.8.0-30-generic #44-Ubuntu SMP Thu Aug 22 20:52:24 UTC 2013 x86_64.
%%Title: figures/QR_scaling.eps
%%CreationDate: 11/05/2013 22:03:16
%%CreationDate: 11/12/2013 22:26:21
%%DocumentNeededFonts: Helvetica
%%DocumentProcessColors: Cyan Magenta Yellow Black
%%LanguageLevel: 2
......@@ -271,14 +271,14 @@ gs 282 344 2822 2757 rc
45 -51 44 -38 45 -33 45 -40 45 -40 44 -51 45 -42 45 -40
45 -39 45 -43 44 -37 45 -44 45 -41 45 -43 44 -42 282 3056 64 MP stroke
DA
45 -363 45 32 45 306 45 -48 44 -202 45 130 45 -215 45 165
44 175 45 -238 45 -28 45 481 45 -484 44 81 45 291 45 -184
45 -32 44 -27 45 -50 45 225 45 -266 45 132 44 -38 45 28
45 -53 45 -36 44 113 45 -17 45 46 45 -14 45 -61 44 -13
45 -25 45 65 45 -57 44 -6 45 -40 45 -12 45 -32 45 -10
44 -52 45 -31 45 -29 45 -19 44 -72 45 7 45 -54 45 -30
45 -29 44 -53 45 -21 45 -50 45 -25 44 -47 45 -23 45 -46
45 -34 45 -41 44 -33 45 -46 45 -31 45 -40 44 -37 282 3063 64 MP stroke
45 -42 45 94 45 -40 45 -78 44 51 45 -83 45 -27 45 40
44 -85 45 -59 45 9 45 42 45 -110 44 57 45 -113 45 -15
45 -13 44 4 45 -5 45 -49 45 -33 45 -5 44 -49 45 -7
45 -12 45 -45 44 -17 45 -45 45 -10 45 -78 45 0 44 -31
45 -26 45 3 45 -43 44 -24 45 -7 45 -50 45 -12 45 -40
44 -36 45 -47 45 -12 45 -47 44 -32 45 -37 45 -16 45 -48
45 -32 44 -38 45 -24 45 -51 45 -21 44 -49 45 -24 45 -43
45 -37 45 -36 44 -33 45 -46 45 -31 45 -44 44 -31 282 3059 64 MP stroke
gr
24 w
......@@ -383,14 +383,14 @@ gs 3573 344 5643 2757 rc
90 -34 89 13 90 35 89 11 90 11 89 -52 90 0 90 8
89 25 90 -12 89 55 90 -19 89 33 90 -20 89 26 3573 594 64 MP stroke
DA
90 -319 90 42 89 305 90 -29 89 -181 90 145 89 -202 90 186
89 203 90 -233 90 -13 89 556 90 -522 89 114 90 368 89 -193
90 -12 89 -7 90 -37 90 326 89 -324 90 211 89 -18 90 76
89 -41 90 -18 89 218 90 17 90 125 89 28 90 -53 89 33
90 11 89 186 90 -46 89 57 90 -10 90 48 89 8 90 61
89 -40 90 13 89 21 90 48 89 -106 90 136 90 -56 89 22
90 32 89 -64 90 69 89 -60 90 61 89 -52 90 90 90 -60
89 31 90 -31 89 44 90 -104 89 86 90 -52 89 5 3573 952 64 MP stroke
90 -12 90 115 89 -10 90 -47 89 79 90 -53 89 3 90 72
89 -57 90 -31 90 41 89 81 90 -90 89 101 90 -96 89 18
90 21 89 43 90 34 90 -22 89 -3 90 38 89 -25 90 36
89 31 90 -19 89 24 90 -19 90 38 89 -78 90 57 89 4
90 13 89 71 90 -19 89 22 90 58 90 -31 89 54 90 -10
89 -3 90 -29 89 63 90 -30 89 10 90 -3 90 69 89 -40
90 15 89 -8 90 56 89 -62 90 77 89 -66 90 86 90 -38
89 3 90 14 89 45 90 -104 89 105 90 -159 89 290 3573 733 64 MP stroke
gr
24 w
......
No preview for this file type
This diff is collapsed.
No preview for this file type
This diff is collapsed.
No preview for this file type
......@@ -1195,16 +1195,20 @@ The timings are for {\tt qsched\_run}, including the cost of
Setting up the scheduler, tasks, and resources took, in all
cases, an average of 7.2\,ms.
The same decomposition was implementing using OmpSs v.\,1.99.0,
The same decomposition was implemented using OmpSs v.\,1.99.0,
calling the kernels directly using {\tt \#pragma omp task}
annotations with the respective dependencies.
The scaling and efficiency relative to QuickSched are
annotations with the respective dependencies, and
the runtime parameters
\begin{quote}
\tt --disable-yield --schedule=socket --cores-per-socket=16 --num-sockets=4
\end{quote}
\noindent The scaling and efficiency relative to QuickSched are
shown in \fig{QRResults} as well.
The difference in timings is the result of the different
task scheduling policies, as well as a smaller lag between the
individual tasks, as shown in \fig{QRTasks},
for a smaller $1024\times 1024$ matrix on 16 cores of the
same hardware.
which shows the assignment of the different tasks to cores for the
64 core run.
The most visible difference between both schedulers is that
the DGEQRF tasks (in red) are scheduled as soon as they
become available in QuickSched, thus preventing bottlenecks
......@@ -1226,7 +1230,7 @@ near the end of the computation.
\centerline{\epsfig{file=figures/tasks_qr.pdf,width=0.9\textwidth}}
\centerline{\epsfig{file=figures/tasks_qr_ompss.pdf,width=0.9\textwidth}}
\caption{Task scheduling in QuickSched (above) and OmpSs (below)
for a $1024\times 1024$ matrix on 16 cores.
for a $2048\times 2048$ matrix on 64 cores.
The task colors correspond to those in \fig{QR}.}
\label{fig:QRTasks}
\end{figure}
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment