Gyro Timing Results on Teragrid

Application: Gyro

Platform: Teragrid at NCSA. | User Guide NCSA's TeraGrid cluster consists of 887 IBM cluster nodes: 256 nodes with dual 1.3 GHz Intel® Itanium® 2 processors (Phase 1) and 631 nodes with dual 1.5 GHz Intel® Itanium® 2 processors (Phase 2).

Simulation: B3-gtc

UnInstrumented version results:

Timing.out files: 64 | 128 | 192 | 256 | 320 | 384 | 448 | 512

Batch output files: 64 | 128 | 192 | 256 | 320 | 384 | 448 | 512

 

Graphs from the data extracted from the above results:

 

The consistent format being used for all the timing results:

Alternative representations:

 

 

The data that was used to generate the above graphs:(This is from the timing results for the first 50 timesteps; As can be seen from the scalability graph,there is no significant difference between the cumulative times for the first 50 and the next 50 timesteps)

  64 128 192 256 320 384 448 512
NL 6.34E+02 2.34E+02 1.59E+02 1.67E+02 9.79E+01 1.05E+02 9.34E+01 6.14E+01
NL_tr 4.66E+02 3.12E+02 2.59E+02 8.47E+01 9.35E+01 9.55E+01 6.97E+01 5.42E+01
Coll 0.00E+00 0.00E+00 0.00E+00 0.00E+00 0.00E+00 0.00E+00 0.00E+00 0.00E+00
Coll_tr 5.45E-01 2.49E-01 1.67E-01 8.67E-02 6.88E-02 3.49E-02 1.74E-02 1.54E-02
lin_RHS 4.67E+01 2.31E+01 1.60E+01 1.16E+01 9.37E+00 7.94E+00 7.21E+00 5.83E+00
field 5.53E+01 4.15E+01 2.24E+01 3.49E+01 2.14E+01 1.54E+01 1.99E+01 3.80E+01
extras 1.07E+01 4.67E+00 3.37E+00 2.35E+00 2.33E+00 6.06E+00 1.28E+00 1.27E+00
I/O 1.75E+00 1.12E+00 1.02E+00 8.50E-01 7.67E-01 7.52E-01 6.57E-01 7.41E-01

 

Processors

Timesteps/sec

Total Time

64

2.44E+03

0.04100041

128

1.23E+03

0.081234768

192

9.22E+02

0.108412836

256

6.03E+02

0.165727544

320

4.58E+02

0.218483723

384

4.48E+02

0.223463687

448

3.87E+02

0.258331181

512

3.23E+02

0.309693404