SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Supermicro

Supermicro A+ Server 4022G-6F
(H8DG6-F, AMD Opteron 6212)

CPU2006 license: 001176 Test date: Jun-2012
Test sponsor: Supermicro Hardware Availability: Nov-2011
Tested by: Supermicro Software Availability: Dec-2011
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6212
CPU Characteristics: AMD Turbo CORE technology up to 3.20 GHz
CPU MHz: 2600
FPU: Integrated
CPU(s) enabled: 16 cores, 2 chips, 8 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 256 KB I on chip per chip,
64 KB I shared / 2 cores;
16 KB D on chip per core
Secondary Cache: 8 MB I+D on chip per chip, 2 MB shared / 2 cores
L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 4 cores
Other Cache: None
Memory: 128 GB (16 x 8 GB 2Rx4 PC3-12800R-11, ECC)
Disk Subsystem: 1 x 500 GB SATA, 7200 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.2,
Kernel 2.6.32-220.el6.x86_64
Compiler: C/C++/Fortran: Version 4.5.1 of x86 Open64
Compiler Suite (from AMD)
Auto Parallel: No
File System: ext4
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 16 820 265 819 266 820 265 16 741 293 741 293 741 293
416.gamess 16 1833 171 1827 171 1830 171 16 1741 180 1742 180 1746 179
433.milc 16 595 247 594 247 594 247 16 511 288 511 288 511 287
434.zeusmp 16 649 224 647 225 652 223 16 634 230 642 227 631 231
435.gromacs 16 573 199 572 200 572 200 16 432 264 435 263 436 262
436.cactusADM 16 700 273 703 272 701 273 16 755 253 655 292 656 292
437.leslie3d 16 800 188 800 188 799 188 16 783 192 792 190 785 192
444.namd 16 731 175 729 176 731 176 16 727 177 720 178 730 176
447.dealII 16 475 386 472 388 473 387 16 453 404 452 405 452 405
450.soplex 16 621 215 618 216 620 215 16 605 220 605 221 605 221
453.povray 16 382 223 381 223 379 225 16 343 248 343 248 344 247
454.calculix 16 455 290 452 292 453 291 16 435 304 437 302 440 300
459.GemsFDTD 16 869 195 867 196 871 195 16 818 208 818 208 818 208
465.tonto 16 817 193 815 193 815 193 16 754 209 753 209 755 209
470.lbm 16 718 306 717 307 725 303 16 719 306 707 311 706 311
481.wrf 16 656 272 650 275 646 277 16 650 275 654 273 644 278
482.sphinx3 16 1254 249 1254 249 1254 249 16 1139 274 1138 274 1141 273

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst

Set vm/nr_hugepages=14336 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "896"
LD_LIBRARY_PATH = "/home/test/cpu2006/amd1104-rate-libs-revC/32:/home/test/cpu2006/amd1104-rate-libs-revC/64"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Binaries were compiled on a system with 2x AMD Opteron 6274 chips + 64GB Memory using RHEL 6.1

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=bdver1   -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso 

C++ benchmarks:

 -march=bdver1   -Ofast   -static   -CG:load_exe=0   -OPT:malloc_alg=1   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -D__OPEN64_FAST_SET 

Fortran benchmarks:

 -march=bdver1   -Ofast   -LNO:blocking=off   -OPT:rsqrt=2   -OPT:unroll_size=256   -HP:bd=2m:heap=2m   -mso 

Benchmarks using both Fortran and C:

 -march=bdver1   -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso   -LNO:blocking=off   -OPT:rsqrt=2   -OPT:unroll_size=256 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=bdver1   -Ofast   -CG:movnti=1   -CG:locs_best=on   -HP:bdt=2m:heap=2m   -IPA:plimit=7000   -IPA:callee_limit=1200   -OPT:struct_array_copy=2   -OPT:alias=field_sensitive   -mso 
470.lbm:  -march=bdver1   -Ofast   -CG:cmp_peep=on   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -HP:bdt=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso 
482.sphinx3:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -IPA:plimit=1000   -OPT:malloc_alg=2   -CG:cmp_peep=on   -CG:local_sched_alg=2   -CG:p2align=0   -INLINE:aggressive=on   -LNO:prefetch=2   -LNO:prefetch_ahead=4   -mso 

C++ benchmarks:

444.namd:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -IPA:plimit=3000   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=bdver1   -Ofast   -D__OPEN64_FAST_SET   -static   -INLINE:aggressive=on   -LNO:opt=0   -LNO:simd=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -CG:movext_icmp=off   -TENV:frame_pointer=off 
450.soplex:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:ignore_feedback=off   -INLINE:aggressive=on   -OPT:RO=1   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -fno-exceptions   -CG:p2align=0   -m32   -HP:bdt=2m:heap=2m   -WOPT:sib=on 
453.povray:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -CG:pre_local_sched=off   -CG:p2align=0   -CG:p2align_split=on   -CG:dsched=on   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -OPT:transform=2   -OPT:alias=disjoint   -WOPT:aggcm=0 

Fortran benchmarks:

410.bwaves:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:Ofast   -OPT:treeheight=on   -LNO:blocking=off   -LNO:ignore_feedback=off   -LNO:fu=4   -LNO:loop_model_simd=on   -LNO:simd_rm_unity_remainder=on   -WOPT:aggstr=0   -HP:bdt=2m:heap=2m   -CG:cmp_peep=on 
416.gamess:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:simd=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -OPT:unroll_times_max=2   -CG:local_sched_alg=1   -HP:bdt=2m:heap=2m   -WOPT:sib=on 
434.zeusmp:  -march=bdver1   -Ofast   -LNO:blocking=off   -LNO:interchange=off   -IPA:plimit=1500   -HP:bdt=2m:heap=2m 
437.leslie3d:  -march=bdver1   -Ofast   -CG:pre_minreg_level=2   -LNO:simd=0   -LNO:fusion=2   -HP:bdt=2m:heap=2m   -mso 
459.GemsFDTD:  -march=bdver1   -Ofast   -IPA:plimit=1500   -OPT:unroll_size=0   -LNO:fission=2   -CG:load_exe=0   -CG:local_sched_alg=2   -HP 
465.tonto:  -march=bdver1   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP:bdt=2m:heap=2m 

Benchmarks using both Fortran and C:

435.gromacs:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m   -CG:local_sched_alg=2   -GRA:unspill=ON   -CG:load_exe=3   -LNO:simd=3 
436.cactusADM:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:blocking=off   -LNO:prefetch=2   -HP   -CG:locs_shallow_depth=1   -CG:load_exe=0   -CG:dsched=on   -WOPT:sib=on 
454.calculix:  -march=bdver1   -Ofast   -OPT:unroll_size=256   -GRA:optimize_boundary=on   -CG:dsched=on   -HP:bdt=2m:heap=2m 
481.wrf:  -march=bdver1   -Ofast   -LNO:blocking=off   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -CG:load_exe=1   -HP   -WOPT:sib=on 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-451-flags-rate-revC-I.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/x86-open64-451-flags-rate-revC-I.xml.