Flipper performance
Scalapack xdlu
TIME N NB P Q LU Time Sol Time MFLOP/S Residual CHECK ---- ----- --- --- --- --------- --------- -------- -------- ------- WALL 21000 40 2 8 1395.52 2.09 4418.03 0.000213 PASSED
Streams2
Smallest time delta is 0.00999999978 Size Iter FILL COPY DAXPY SUM 30 5 800.0 1600.0 2400.0 320.0 33333.0 43 5 800.0 1600.0 2400.0 320.0 23255.5 61 5 800.0 1600.0 2399.9 320.0 16393.0 88 5 800.0 1600.0 2400.0 320.0 11363.5 126 5 800.0 3200.0 2400.0 320.0 7936.5 180 5 533.3 3200.0 2400.0 320.0 3703.7 258 5 799.9 3199.7 2399.7 320.0 3875.5 368 5 799.9 3199.5 2399.7 320.0 2717.0 527 5 800.0 3199.9 2400.0 320.0 1897.5 754 5 533.2 1599.7 2399.5 319.9 884.0 1079 5 799.8 1599.5 1599.5 319.9 926.5 1545 5 799.7 639.8 799.7 319.9 647.0 2210 5 532.8 639.3 799.1 319.7 301.3 3163 5 533.1 639.7 799.6 319.8 210.7 4525 5 399.1 638.6 798.2 319.3 110.3 6475 5 398.9 638.2 797.7 319.1 77.0 9266 5 531.3 637.5 796.9 318.7 71.7 13258 5 530.3 636.4 681.8 318.2 50.0 18971 5 531.2 531.2 597.6 318.7 35.0 27146 5 396.3 396.3 475.6 317.1 18.3 38844 5 396.2 317.0 365.7 317.0 12.7 55582 5 389.1 259.4 291.8 222.3 8.7 79532 5 265.1 227.2 280.7 198.8 4.2 113802 5 221.1 221.1 273.1 154.8 2.4 162840 5 223.3 208.4 275.9 156.3 1.7 233008 5 213.0 213.0 279.6 149.1 1.1 333411 5 222.3 205.2 285.8 148.2 0.8 477079 5 218.1 218.1 269.4 152.7 0.6 682653 5 218.5 198.6 273.1 136.5 0.4 976810 5 195.4 208.4 275.8 156.3 0.3 1397720 5 186.4 203.3 279.5 159.7 0.2 2000000 5 200.0 200.0 266.7 145.5 0.1
Bandwidth
Ping test (P0 -> P1) --- blocking standard, typesize=2 --------- Length(bytes) elapsed(s) rate(Mbytes/s) latency iterations 2 3.217843 0.326 6.137549 524288 4 3.055362 0.686 5.827640 524288 8 1.529763 1.371 5.835582 262144 16 2.160834 1.941 8.242929 262144 32 0.961611 4.362 7.336510 131072 64 1.424276 5.890 10.866364 131072 128 0.808042 10.381 0.000000 65536 256 0.975313 17.202 0.000000 65536 512 0.659568 25.437 0.000000 32768 1024 0.785272 42.730 0.000000 32768 2048 0.592653 56.617 0.000000 16384 4096 0.977825 68.631 0.000000 16384 8192 0.881250 76.152 0.000000 8192 16384 0.822966 81.545 0.000000 4096 32768 0.793670 84.555 0.000000 2048 65536 0.784351 85.560 0.000000 1024 131072 0.779129 86.133 0.000000 512 262144 0.910226 73.728 0.000000 256 524288 0.837323 80.147 0.000000 128 1048576 0.818512 81.989 0.000000 64 2097152 0.804363 83.431 0.000000 32 4194304 0.797825 84.115 0.000000 16 8388608 0.795604 84.350 0.000000 8 16777216 0.790831 84.859 0.000000 4 Elapsed(s) 0.602128 Ping latency(us) for 0 byte message 5.742378 Elapsed(s) 0.601759 Ping latency(us) for 0 byte message 5.738853 Elapsed(s) 1.344159 Ping_Pong/2 latency(us) for 0 byte message 6.409487 Ping_Pong test (P0 -> P1 -> P0) --- blocking standard, typesize=2 --------- Length(bytes) elapsed(s) rate(Mbytes/s) latency iterations Elapsed(s) 1.344054 Ping_Pong/2 latency(us) for 0 byte message 6.408986 4 7.444004 0.282 14.198311 524288 8 7.447085 0.563 14.204187 524288 16 3.756296 1.117 14.329132 262144 32 5.134555 1.634 19.586772 262144 64 2.439353 3.439 18.610787 131072 128 3.705299 4.528 0.000000 131072 256 2.106637 7.964 0.000000 65536 512 2.538962 13.216 0.000000 65536 1024 1.737593 19.311 0.000000 32768 2048 2.176442 30.834 0.000000 32768 4096 1.534133 43.744 0.000000 16384 8192 2.508369 53.508 0.000000 16384 16384 2.316788 57.933 0.000000 8192 32768 2.199389 61.025 0.000000 4096 65536 2.135401 62.854 0.000000 2048 131072 2.115161 63.455 0.000000 1024 262144 2.100988 63.883 0.000000 512 524288 1.793060 74.854 0.000000 256 1048576 1.691116 79.366 0.000000 128 2097152 1.647841 81.451 0.000000 64 4194304 1.611978 83.263 0.000000 32 8388608 1.596738 84.057 0.000000 16 16777216 1.588280 84.505 0.000000 8 33554432 1.582374 84.820 0.000000 4
Updated: 2024-11-01, 13:56