SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

IBM Corporation

IBM BladeCenter HX5 (Intel Xeon E7-4820)

CPU2006 license: 11 Test date: May-2011
Test sponsor: IBM Corporation Hardware Availability: May-2011
Tested by: IBM Corporation Software Availability: Jan-2011
Benchmark results graph
Hardware
CPU Name: Intel Xeon E7-4820
CPU Characteristics: Intel Turbo Boost Technology up to 2.26 GHz
CPU MHz: 2000
FPU: Integrated
CPU(s) enabled: 32 cores, 4 chips, 8 cores/chip, 2 threads/core
CPU(s) orderable: 1,2,3,4 chips
Primary Cache: 32 KB I + 32 KB D on chip per core
Secondary Cache: 256 KB I+D on chip per core
L3 Cache: 18 MB I+D on chip per chip
Other Cache: None
Memory: 256 GB (32 x 8 GB 4Rx8 PC3-8500R-7, ECC, running
at 978 MHz)
Disk Subsystem: 4 x 50 GB SSD, RAID 0
Other Hardware: None
Software
Operating System: SUSE Linux Enterprise Server 11 SP1 (x86_64),
Kernel 2.6.32.12-0.7-default
Compiler: Intel C++ and Fortran Intel 64 Compiler XE for
applications running on Intel 64
Version 12.0.1.116 Build 20101116
Auto Parallel: No
File System: ext3
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 64 1919 453 1928 451 1916 454 64 1930 451 1919 453 1924 452
416.gamess 64 2357 532 2376 527 2362 531 64 2326 539 2322 540 2327 539
433.milc 64 1183 496 1184 496 1184 496 64 1136 517 1136 517 1137 517
434.zeusmp 64 1045 557 1050 555 1050 555 64 1045 557 1050 555 1050 555
435.gromacs 64 915 499 915 500 908 503 64 911 501 915 500 917 498
436.cactusADM 64 1358 563 1355 564 1355 564 64 1358 563 1355 564 1355 564
437.leslie3d 64 1844 326 1850 325 1853 325 64 1844 326 1850 325 1853 325
444.namd 64 1089 471 1092 470 1090 471 64 1060 484 1065 482 1067 481
447.dealII 64 819 894 821 892 824 888 64 819 894 821 892 824 888
450.soplex 64 1468 364 1469 363 1470 363 64 1387 385 1387 385 1388 384
453.povray 64 472 722 471 723 472 721 64 391 871 389 875 391 871
454.calculix 64 865 611 865 610 864 611 64 865 611 865 610 864 611
459.GemsFDTD 64 2186 311 2187 310 2183 311 64 2186 311 2187 310 2183 311
465.tonto 64 1021 617 1013 621 1018 619 64 1005 627 1000 630 997 632
470.lbm 64 1792 491 1792 491 1790 491 64 1279 687 1279 688 1280 687
481.wrf 64 1215 589 1218 587 1218 587 64 1215 589 1218 587 1218 587
482.sphinx3 64 2424 515 2423 515 2423 515 64 2320 538 2321 537 2321 538

Submit Notes

The config file option 'submit' was used.
 numactl was used to bind copies to the cores

Operating System Notes

 'ulimit -s unlimited' was used to set the stacksize to unlimited prior to run
 echo 1 > /proc/sys/vm/zone_reclaim_mode
 'mount -t hugetlbfs nodev /mnt/hugepages' was used to enable large pages
 echo 28800 > /proc/sys/vm/nr_hugepages
 export HUGETLB_MORECORE=yes
 export LD_PRELOAD=/usr/lib64/libhugetlbfs.so

Platform Notes

 BIOS Settings:
 Turbo Boost Power Optimization set to Traditional

General Notes

 Binaries were compiled on RHEL5.5

Base Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks:

 icpc -m64 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

C++ benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

Fortran benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static 

Benchmarks using both Fortran and C:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

Peak Compiler Invocation

C benchmarks (except as noted below):

 icc -m64 
482.sphinx3:  icc -m32 

C++ benchmarks (except as noted below):

 icpc -m64 
450.soplex:  icpc -m32 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 

Peak Optimization Flags

C benchmarks:

433.milc:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static   -auto-ilp32 
470.lbm:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-malloc-options=3   -ansi-alias   -opt-prefetch   -static   -auto-ilp32 
482.sphinx3:  -xSSE4.2   -ipo   -O3   -no-prec-div   -unroll2 

C++ benchmarks:

444.namd:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -fno-alias   -auto-ilp32 
447.dealII:  basepeak = yes 
450.soplex:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-malloc-options=3   -B /usr/share/libhugetlbfs/ -Wl,-hugetlbfs-link=BDT 
453.povray:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll4   -ansi-alias   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 

Fortran benchmarks:

410.bwaves:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static 
416.gamess:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll2   -inline-level=0   -scalar-rep-   -static 
434.zeusmp:  basepeak = yes 
437.leslie3d:  basepeak = yes 
459.GemsFDTD:  basepeak = yes 
465.tonto:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll4   -auto   -inline-calloc   -opt-malloc-options=3   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 

Benchmarks using both Fortran and C:

435.gromacs:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-prefetch   -static   -auto-ilp32 
436.cactusADM:  basepeak = yes 
454.calculix:  basepeak = yes 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/Intel-ic12.0-linux64-revB.html,
http://www.spec.org/cpu2006/flags/IBM-platform-linux64-revA.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/Intel-ic12.0-linux64-revB.xml,
http://www.spec.org/cpu2006/flags/IBM-platform-linux64-revA.xml.