CPU2006 license: | 4 | Test date: | May-2009 |
---|---|---|---|
Test sponsor: | SGI | Hardware Availability: | Mar-2009 |
Tested by: | SGI | Software Availability: | Feb-2009 |
Hardware | |
---|---|
CPU Name: | Intel Xeon X5570 |
CPU Characteristics: | Quad Core, 2.93 GHz Intel Turbo Boost Technology up to 3.33 GHz |
CPU MHz: | 2933 |
FPU: | Integrated |
CPU(s) enabled: | 16 cores, 4 chips, 4 cores/chip, 2 threads/core |
CPU(s) orderable: | 1,2 chips per blade, 2-16384 blades |
Primary Cache: | 32 KB I + 32 KB D on chip per core |
Secondary Cache: | 256 KB I+D on chip per core |
L3 Cache: | 8 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 96 GB (2 x 12*4GB DDR3-1066 CL7 RDIMMs) |
Disk Subsystem: | 13 TB Lustre Parallel Filesystem 1 Metadata Server and 6 Object Storage Servers 96 x 136 GB SAS (Seagate Cheetah 15000 rpm) |
Other Hardware: | None |
Software | |
---|---|
Operating System: | SUSE Linux Enterprise Server 10 (x86_64) SP2 with patch Linux kernel 20080917, Kernel 2.6.16.60-0.30-smp |
Compiler: | Intel C++ and Fortran Compiler 11.0 for Linux Build 20090131 Package ID: l_cproc_p_11.0.080, l_cprof_p_11.0.080 |
Auto Parallel: | No |
File System: | lustre v1.6.7 over DDR Infiniband |
System State: | Multi-user, run level 3 |
Base Pointers: | 64-bit |
Peak Pointers: | 32/64-bit |
Other Software: | SGI ProPack 6 for Linux Service Pack 2 Binutils 2.18.50.0.7.20080502 |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 32 | 1268 | 343 | 1273 | 342 | 1273 | 342 | 16 | 622 | 349 | 621 | 350 | 621 | 350 |
416.gamess | 32 | 1541 | 406 | 1542 | 406 | 1545 | 406 | 16 | 765 | 410 | 766 | 409 | 766 | 409 |
433.milc | 32 | 939 | 313 | 938 | 313 | 938 | 313 | 32 | 941 | 312 | 941 | 312 | 941 | 312 |
434.zeusmp | 32 | 713 | 408 | 707 | 412 | 713 | 408 | 32 | 703 | 414 | 677 | 430 | 704 | 413 |
435.gromacs | 32 | 584 | 391 | 583 | 392 | 580 | 394 | 32 | 566 | 404 | 569 | 401 | 563 | 406 |
436.cactusADM | 32 | 854 | 448 | 855 | 447 | 861 | 444 | 32 | 885 | 432 | 890 | 430 | 910 | 420 |
437.leslie3d | 32 | 1233 | 244 | 1238 | 243 | 1234 | 244 | 16 | 616 | 244 | 616 | 244 | 616 | 244 |
444.namd | 32 | 709 | 362 | 701 | 366 | 702 | 366 | 32 | 690 | 372 | 687 | 373 | 691 | 371 |
447.dealII | 32 | 651 | 562 | 645 | 568 | 647 | 566 | 32 | 602 | 608 | 622 | 588 | 604 | 606 |
450.soplex | 32 | 1003 | 266 | 1004 | 266 | 1005 | 266 | 16 | 474 | 281 | 474 | 282 | 474 | 281 |
453.povray | 32 | 320 | 532 | 320 | 531 | 320 | 532 | 32 | 269 | 633 | 267 | 639 | 266 | 640 |
454.calculix | 32 | 569 | 464 | 572 | 461 | 574 | 460 | 32 | 577 | 457 | 576 | 458 | 580 | 455 |
459.GemsFDTD | 32 | 1572 | 216 | 1571 | 216 | 1572 | 216 | 16 | 768 | 221 | 769 | 221 | 769 | 221 |
465.tonto | 32 | 773 | 407 | 782 | 402 | 781 | 403 | 32 | 769 | 409 | 740 | 425 | 745 | 422 |
470.lbm | 32 | 2053 | 214 | 2053 | 214 | 2053 | 214 | 16 | 990 | 222 | 991 | 222 | 991 | 222 |
481.wrf | 32 | 862 | 415 | 862 | 414 | 865 | 413 | 32 | 862 | 415 | 862 | 414 | 865 | 413 |
482.sphinx3 | 32 | 1615 | 386 | 1618 | 386 | 1618 | 385 | 32 | 1542 | 404 | 1543 | 404 | 1548 | 403 |
The config file option 'submit' was used. A submit.pl script was used to distribute benchmark copies across the 2 blades and to pin processes to cores using dplace. Each blade runs a separate instance of the operating system.
Adjacent cache line prefetch enabled System has 2 blades with 2 chips/blade.
icc |
icpc |
ifort |
icc ifort |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
-xSSE4.2 -ipo -O3 -no-prec-div -static |
-xSSE4.2 -ipo -O3 -no-prec-div -static |
-xSSE4.2 -ipo -O3 -no-prec-div -static |
-xSSE4.2 -ipo -O3 -no-prec-div -static |
icc | |
482.sphinx3: | icc -m32 |
icpc | |
450.soplex: | icpc -m32 |
ifort | |
437.leslie3d: | ifort -m32 |
icc ifort |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
433.milc: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -fno-alias |
470.lbm: | -xSSE4.2 -ipo -O3 -no-prec-div -static -opt-prefetch -auto-ilp32 |
482.sphinx3: | -xSSE4.2 -ipo -O3 -no-prec-div -static -unroll2 |
444.namd: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -fno-alias -auto-ilp32 |
447.dealII: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -ansi-alias -scalar-rep- |
450.soplex: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -opt-malloc-options=3 |
453.povray: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll4 -ansi-alias |
410.bwaves: | -xSSE4.2 -ipo -O3 -no-prec-div -static -opt-prefetch |
416.gamess: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -Ob0 -ansi-alias -scalar-rep- |
434.zeusmp: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) |
437.leslie3d: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -opt-malloc-options=3 -opt-prefetch |
459.GemsFDTD: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -Ob0 -opt-prefetch |
465.tonto: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll4 -auto |
435.gromacs: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -opt-prefetch -auto-ilp32 |
436.cactusADM: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -opt-prefetch -auto-ilp32 |
454.calculix: | -xSSE4.2 -ipo -O3 -no-prec-div -static -auto-ilp32 |
481.wrf: | basepeak = yes |