PPEP: ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK
|
|
- Neal Grant
- 5 years ago
- Views:
Transcription
1 PPEP: ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK BO SU JUNLI GU LI SHEN WEI HUANG JOSEPH L. GREATHOUSE ZHIYING WANG NUDT AMD RESEARCH DECEMBER 17, 2014
2 BACKGROUND Dynamic Voltage and Frequency Scaling (DVFS) Widely employed Boost performance, lower power, and improve energy efficiency 2 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
3 BACKGROUND Dynamic Voltage and Frequency Scaling (DVFS) Widely employed Boost performance, lower power, and improve energy efficiency Good DVFS decisions require Accurate performance & power predictions across Voltage Frequency (VF) states 3 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
4 BACKGROUND Dynamic Voltage and Frequency Scaling (DVFS) Widely employed Boost performance, lower power, and improve energy efficiency Good DVFS decisions require Accurate performance & power predictions across Voltage Frequency (VF) states Challenges Brought by Modern Processors Cores and north bridge Different clock domains and power planes Off-chip Memory Own clock domain 4 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
5 TWO QUESTIONS Q1: How does the application s performance change with the VF state? Q2: How does the application s power change with the VF state? 5 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
6 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS Processor: AMD FX nd Generation Family 15h Piledriver FX-8320 CPU 6 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
7 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS CU CU Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states FX-8320 CPU CU CU 7 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
8 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states The NB (North Bridge) Shared by all CUs L3$ and memory controller CU North Bridge CU FX-8320 CPU CU CU 8 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
9 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states The NB (North Bridge) Shared by all CUs L3$ and memory controller Power Measurement Pololu ACS711: Current sensor Power Supply CU North Bridge CU FX-8320 CPU ACS711 CU CU ASUS M5A97 9 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
10 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS CU CU Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states The NB (North Bridge) Shared by all CUs L3$ and memory controller Power Measurement Pololu ACS711: Current sensor Arduino: Power reading Power Supply North Bridge CU FX-8320 CPU ACS711 CU ASUS M5A97 Arduino 10 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
11 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS CU CU Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states The NB (North Bridge) Shared by all CUs L3$ and memory controller Power Measurement Pololu ACS711: Current sensor Arduino: Power reading Benchmark Suites Power Supply North Bridge FX-8320 CPU ACS711 SPEC CPU 2006, PARSEC, NAS Parallel Arduino Benchmarks CU CU ASUS M5A97 11 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
12 OUTLINE Background Experimental Setup Performance Prediction Across DVFS States Power Prediction Across DVFS States PPEP Framework Conclusion Q1: How does the application s performance change with the VF state? 12 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
13 Normalized CPI HOW CPI CHANGES WITH FREQUENCY (LOWER IS BETTER) Normalized Frequency 13 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
14 Normalized CPI HOW CPI CHANGES WITH FREQUENCY (LOWER IS BETTER) Estimate #1 CPI does not change when frequency increases Normalized Frequency bzip2 14 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
15 Normalized CPI HOW CPI CHANGES WITH FREQUENCY (LOWER IS BETTER) Estimate #1 Estimate #2 CPI does not change when frequency increases. CPI also increases when frequency increases mcf Normalized Frequency bzip2 15 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
16 Normalized CPI HOW CPI CHANGES WITH FREQUENCY (LOWER IS BETTER) Estimate #1 Estimate #2 CPI does not change when frequency increases. CPI also increases when frequency increases mcf Normalized Frequency GAP bzip2 16 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
17 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM Core clock cycles 17 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
18 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. 18 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
19 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. 19 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
20 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Cycles stalled on Memory Accessing. Scale with Core Frequency. 20 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
21 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Core Frequency: f -> f Cycles stalled on Memory Accessing. Scale with Core Frequency. 21 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
22 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Core Frequency: f -> f Does not scale. Does not scale. Cycles stalled on Memory Accessing. Scale with Core Frequency. 22 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
23 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Core Frequency: f -> f GAP Does not scale. Does not scale. Scales by f /f. Cycles stalled on Memory Accessing. Scale with Core Frequency. 23 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
24 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Core Frequency: f -> f GAP Does not scale. Does not scale. Scales by f /f. Cycles stalled on Memory Accessing. Scale with Core Frequency. Key: Memory Stall Cycles estimation 24 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
25 Core clock cycles Core L3+DRAM 25 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
26 Core clock cycles Core L3+DRAM 26 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
27 Core clock cycles Core L3+DRAM 27 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
28 Core clock cycles Core L3+DRAM 28 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
29 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Core clock cycles Core L3+DRAM 29 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
30 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Core clock cycles Core L3+DRAM Leading Load 30 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
31 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Core clock cycles Core L3+DRAM Leading Load Not Leading 31 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
32 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Core clock cycles Core L3+DRAM Leading Load Leading Load Not Leading 32 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
33 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Memory Stall Cycle Core clock cycles Memory Stall Cycles Core L3+DRAM Leading Load Leading Load Not Leading Leading Loads Model: Memory stall cycles = cycles that leading loads are active 33 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
34 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Memory Stall Cycle Core clock cycles Memory Stall Cycles Core L3+DRAM Leading Load Leading Load Not Leading Leading Loads Model: Memory stall cycles = cycles that leading loads are active AMD hardware has MAB0 performance counter which can estimate Leading Loads cycles. [USENIX-ATC 14 Su+] 34 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
35 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles 35 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
36 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1 36 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
37 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E3 E1 37 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
38 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 E1 38 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
39 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 E1 39 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
40 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 40 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
41 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 41 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
42 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 42 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
43 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 E3*(f /f) 43 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
44 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 E3*(f /f) E1+E3*(f /f - 1) 44 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
45 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 E3*(f /f) Instruction number = E2 E1+E3*(f /f - 1) 45 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
46 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 E3*(f /f) Instruction number = E2 CPI(f ) = (E1+E3*(f /f - 1))/E2 E1+E3*(f /f - 1) 46 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
47 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 E1 E1-E3 E3*(f /f) E1+E3*(f /f - 1) Instruction number = E2 CPI(f) = E1/E2 LL-MAB CPI Predictor Instruction number = E2 CPI(f ) = (E1+E3*(f /f - 1))/E2 47 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
48 CPI PREDICTOR 3 HW events Step=200ms Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 E1 E1-E3 E3*(f /f) E1+E3*(f /f - 1) Instruction number = E2 CPI(f) = E1/E2 LL-MAB CPI Predictor Instruction number = E2 CPI(f ) = (E1+E3*(f /f - 1))/E2 48 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
49 CPI CPI PREDICTED CPI CURVE V.S. MEASURED CPI CURVE 1.7GHZ (VF2) -> 3.5GHZ (VF5) 5 Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 5 Measured CPI of 3.5GHz (473.astar) % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 49 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
50 CPI PREDICTION ERROR FX-8320 Processor All 52 benchmarks in SPEC, PARSEC, and NPB 1.7GHz (VF2) -> 3.5GHz (VF5) 10% 8% 1.7GHz->3.5GHz 15.5% 14.2% 6% 4% 3.0% 2% 0% 483.xalancbmk 482.sphinx3 481.wrf 473.astar 471.omnetpp 470.lbm 465.tonto 464.h264ref 462.libquantum 459.GemsFDTD 458.sjeng 456.hmmer 454.calculix 453.povray 450.soplex 447.dealII 445.gobmk 444.namd 437.leslie3d 436.cactusADM 435.gromacs 434.zeusmp 433.milc 429.mcf 416.gobmk 410.bwaves 403.gcc 401.bzip2 400.perlbench UA SP MG LU IS FT EP DC CG BT streamcluster dedup canneal x264 vips swaptions raytrace freqmine fluidanimate ferret facesim bodytrack blackscholes Average 50 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
51 OUTLINE Background Experimental Methodology Performance Prediction Across DVFS States Power Prediction Across DVFS States PPEP Framework Conclusion Q2: How does the application s power change with the VF state? 51 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
52 IDLE POWER MODEL + DYNAMIC POWER MODEL Cooling 52 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
53 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Power (Sampled Point) Temperature Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
54 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
55 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
56 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
57 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Relationship? Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
58 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Relationship? Cooling Step (200ms) VF State Voltage(V) Frequency(GHz) Error of P idle VF % VF % VF % VF % VF % 58 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
59 IDLE POWER MODEL + DYNAMIC POWER MODEL 59 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
60 IDLE POWER MODEL + DYNAMIC POWER MODEL # Event Code E4 Retired_uOPs 0xc1 E5 FPU_Pipe_Assignment 0x00 E6 L1I$_Fetches 0x80 E7 L1D$_Accesses 0x40 E8 L2$_Requests 0x7d E9 Retired_Branches 0xc2 E10 Retired_MisBranches 0xc3 E11 L2$_Misses 0x7e E12 Dispatch_Stalls 0xd1 (PS= Per-Second Count Value) 60 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
61 POWER MODEL VALIDATION EXPERIMENT Benchmark Combination Totally 152 combinations Thread number: 1, 2, 3, and 4 SPEC: single-/multi- programed PARSEC: single-/multi- threaded NPB: single-/multi- threaded Number SPEC PARSEC NPB Total 1-thread thread thread thread Total PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
62 POWER MODEL VALIDATION EXPERIMENT Benchmark Combination Totally 152 combinations Thread number: 1, 2, 3, and 4 SPEC: single-/multi- programed PARSEC: single-/multi- threaded NPB: single-/multi- threaded Number SPEC PARSEC NPB Total 1-thread thread thread thread Total fold cross validation Divide 152 benchmark combination in to 4 sets Train on all combinations of 3 sets Test on the remaining set 62 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
63 Validation Error of Chip Power POWER MODEL VALIDATION (LOWER IS BETTER) Error: 4.6% Standard Deviation: 2.8% 8% 7% 6% 5% 4% 3% 2% 1% Average Standard Deviation 0% ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE VF5 VF4 VF3 VF2 VF1 63 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
64 PREDICTING POWER ACROSS VF STATES VF5 VF4 VF3 VF2 VF1 CPI(VF 5 ) 3 Events => CPI(VF 4 ) CPI(VF 3 ) CPI(VF 2 ) CPI(VF 1 ) LL-MAB T => P idle (VF 4 ) 9 Events => P dyn (VF 4 ) 64 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
65 PREDICTING POWER ACROSS VF STATES VF5 CPI(VF 5 ) P idle (VF 5 ) VF4 3 Events => CPI(VF 4 ) T => P idle (VF 4 ) 9 Events => P dyn (VF 4 ) VF3 CPI(VF 3 ) P idle (VF 3 ) VF2 CPI(VF 2 ) P idle (VF 2 ) VF1 CPI(VF 1 ) P idle (VF 1 ) LL-MAB 65 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
66 PREDICTING POWER ACROSS VF STATES VF5 CPI(VF 5 ) P idle (VF 5 ) VF4 3 Events => CPI(VF 4 ) T => P idle (VF 4 ) 9 Events => P dyn (VF 4 ) VF3 CPI(VF 3 ) P idle (VF 3 ) VF2 CPI(VF 2 ) P idle (VF 2 ) VF1 CPI(VF 1 ) P idle (VF 1 ) LL-MAB same T 66 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
67 PREDICTING POWER ACROSS VF STATES VF5 CPI(VF 5 ) P idle (VF 5 ) P dyn (VF 5 ) VF4 3 Events => CPI(VF 4 ) T => P idle (VF 4 ) 9 Events => P dyn (VF 4 ) VF3 CPI(VF 3 ) P idle (VF 3 ) P dyn (VF 3 ) VF2 CPI(VF 2 ) P idle (VF 2 ) P dyn (VF 2 ) VF1 CPI(VF 1 ) P idle (VF 1 ) P dyn (VF 1 ) LL-MAB same T 67 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
68 PREDICTING POWER ACROSS VF STATES VF5 VF4 VF3 VF2 VF1 CPI(VF 5 ) 3 Events => CPI(VF 4 ) P idle (VF 5 ) T => P idle (VF 4 ) P dyn (VF 5 ) 9 Events => P dyn (VF 4 ) CPI(VF 3 ) CPI(VF 2 ) CPI(VF 1 ) P idle (VF 3 ) P idle (VF 2 ) P idle (VF 1 ) P dyn (VF 3 ) P dyn (VF 2 ) P dyn (VF 1 ) LL-MAB same T HW event prediction? E PS(i) (VF5) E PS(i) (VF4) E PS(i) (VF3)? i=4,,12 E PS(i) (VF2) E PS(i) (VF1) 68 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
69 PREDICTING POWER ACROSS VF STATES VF5 VF4 VF3 VF2 VF1 CPI(VF 5 ) 3 Events => CPI(VF 4 ) P idle (VF 5 ) T => P idle (VF 4 ) P dyn (VF 5 ) 9 Events => P dyn (VF 4 ) CPI(VF 3 ) CPI(VF 2 ) CPI(VF 1 ) P idle (VF 3 ) P idle (VF 2 ) P idle (VF 1 ) P dyn (VF 3 ) P dyn (VF 2 ) P dyn (VF 1 ) LL-MAB same T HW event prediction? E PS(i) (VF5) E PS(i) (VF4) E PS(i) (VF3)? i=4,,12 E PS(i) (VF2) E PS(i) (VF1) 2 observations + LL-MAB CPI Predictor 69 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
70 OBSERVATION 1 + OBSERVATION 2 At any given point in the execution of a program, core-private HW event counts per instruction are independent of VF state. => To execute the same section of instructions in a program, the activities of core-private resources are independent of VF state. # Event Name E4 E5 E6 E7 E8 E9 E10 E11 Retired_uOPs FPU_Pipe_Assignment L1I$_Fetches L1D$_Accesses L2$_Requests Retired_Branches Retired_MisBranches L2$_Misses 70 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
71 OBSERVATION 1 + OBSERVATION 2 At any given point in the execution of a program, core-private HW event counts per instruction are independent of VF state. => To execute the same section of instructions in a program, the activities of core-private resources are independent of VF state. # Event Name E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses Error 0.6% 0.9% 0.7% 0.7% 5.0% 0.7% 1.3% 4.0% On FX-8320, between 3.5GHz (VF5) and 1.7GHz (VF2) 71 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
72 OBSERVATION 1 + OBSERVATION 2 # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls 72 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
73 OBSERVATION 1 + OBSERVATION 2 # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Gap Error 1.7% On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) 73 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
74 OBSERVATION 1 + OBSERVATION 2 # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% Cycles retiring 3 instructions Cycles retiring 1 instruction On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) 74 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
75 OBSERVATION 1 + OBSERVATION 2 Freq = f # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% Cycles retiring 3 instructions Cycles retiring 1 instruction On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) Start end Freq = f 75 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
76 OBSERVATION 1 + OBSERVATION 2 Freq = f # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% Cycles retiring 3 instructions Cycles retiring 1 instruction On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) Start Frequency Independent end Freq = f 76 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
77 OBSERVATION 1 + OBSERVATION 2 Freq = f # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) Cycles retiring 3 instructions Cycles retiring 1 instruction Overheads of mispredicted branches and exceptions Start Frequency Independent end Freq = f 77 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
78 OBSERVATION 1 + OBSERVATION 2 Freq = f # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) Cycles retiring 3 instructions Cycles retiring 1 instruction Overheads of mispredicted branches and exceptions Dispatch Stalls Start Frequency Independent end Freq = f 78 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
79 DYNAMIC POWER PREDICTION FLOW HW Events Prediction Flow (Bridge: LL-MAB + 2 OBs) # Event E1 CPU_Clock_not_Halted E2 Retired_Instructions E3 MAB0_Wait_Cycles E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses E12 Dispatch_Stalls PS=per-second PI=per-instruction 79 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
80 DYNAMIC POWER PREDICTION FLOW HW Events Prediction Flow (Bridge: LL-MAB + 2 OBs) # Event E1 CPU_Clock_not_Halted From PMC E2 Retired_Instructions E3 MAB0_Wait_Cycles E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses E12 Dispatch_Stalls PS=per-second PI=per-instruction 80 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
81 DYNAMIC POWER PREDICTION FLOW HW Events Prediction Flow (Bridge: LL-MAB + 2 OBs) # Event E1 CPU_Clock_not_Halted From PMC E2 Retired_Instructions E3 MAB0_Wait_Cycles E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses E12 Dispatch_Stalls PS=per-second PI=per-instruction 81 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
82 DYNAMIC POWER PREDICTION FLOW HW Events Prediction Flow (Bridge: LL-MAB + 2 OBs) # Event E1 CPU_Clock_not_Halted From PMC E2 Retired_Instructions LL-MAB E3 MAB0_Wait_Cycles E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses E12 Dispatch_Stalls PS=per-second PI=per-instruction 82 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
83 DYNAMIC POWER PREDICTION FLOW HW Events # Event E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11 E12 CPU_Clock_not_Halted Retired_Instructions MAB0_Wait_Cycles Retired_uOPs FPU_Pipe_Assignment L1I$_Fetches L1D$_Accesses L2$_Requests Retired_Branches Retired_MisBranches L2$_Misses Dispatch_Stalls PS=per-second PI=per-instruction Prediction Flow (Bridge: LL-MAB + 2 OBs) From PMC 83 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 LL-MAB Observation 1 Observation 2
84 DYNAMIC POWER PREDICTION FLOW HW Events # Event E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11 E12 CPU_Clock_not_Halted Retired_Instructions MAB0_Wait_Cycles Retired_uOPs FPU_Pipe_Assignment L1I$_Fetches L1D$_Accesses L2$_Requests Retired_Branches Retired_MisBranches L2$_Misses Dispatch_Stalls PS=per-second PI=per-instruction Prediction Flow (Bridge: LL-MAB + 2 OBs) From PMC 84 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 LL-MAB Observation 1 Observation 2
85 DYNAMIC POWER PREDICTION FLOW HW Events # Event E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11 E12 CPU_Clock_not_Halted Retired_Instructions MAB0_Wait_Cycles Retired_uOPs FPU_Pipe_Assignment L1I$_Fetches L1D$_Accesses L2$_Requests Retired_Branches Retired_MisBranches L2$_Misses Dispatch_Stalls PS=per-second PI=per-instruction Prediction Flow (Bridge: LL-MAB + 2 OBs) From PMC 85 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 LL-MAB Observation 1 Observation 2
86 Chip Power Prediction Error Across VF States PREDICTING ERROR OF AVERAGE CHIP POWER LOWER IS BETTER Error: 4.2% Standard Deviation: 3.6% 8% 7% 6% 5% 4% 3% 2% 1% Average Standard Deviation 0% VF1->VF1 VF1->VF2 VF1->VF3 VF1->VF4 VF1->VF5 VF2->VF1 VF2->VF2 VF2->VF3 VF2->VF4 VF2->VF5 VF3->VF1 VF3->VF2 VF3->VF3 VF3->VF4 VF3->VF5 VF4->VF1 VF4->VF2 VF4->VF3 VF4->VF4 VF4->VF5 VF5->VF1 VF5->VF2 VF5->VF3 VF5->VF4 VF5->VF5 86 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
87 OUTLINE Background Experimental Methodology Performance Prediction Across DVFS States Power Prediction Across DVFS States PPEP Framework Conclusion Q1: How does the application s performance change with the VF state? Q2: How does the application s power change with the VF state? 87 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
88 OUTLINE Background Experimental Methodology Performance Prediction Across DVFS States Power Prediction Across DVFS States PPEP Framework Conclusion 88 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
89 PPEP FRAMEWORK PMC based User Daemon Periodically Running All-in-one: PPEP PMC, VF 1 CPIs at all VFs Performance Prediction Model 2 CPI at all VFs Core 1's PPE Core n's PPE Dynamic Info: PMC, VF, Temperature 4 PMC VF Temperature, VF PMC at other VFs H/W Events Predictor 3 Dynamic Power Model Idle Power Model Dynamic Power at all VFs Energy Prediction Idle Power at all VFs 5 Core 0's PPE DVFS Exploring Space: Energy -Delay Scaling, Power Capping, etc. 6 VF Regulator DVFS Decision 89 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
90 PPEP FRAMEWORK PMC based User Daemon Periodically Running Performance Online Power Energy Prediction Commercial Processors All-in-one: PPEP PMC, VF 1 CPIs at all VFs Performance Prediction Model 2 CPI at all VFs Core 1's PPE Core n's PPE Dynamic Info: PMC, VF, Temperature 4 PMC VF Temperature, VF PMC at other VFs H/W Events Predictor 3 Dynamic Power Model Idle Power Model Dynamic Power at all VFs Energy Prediction Idle Power at all VFs 5 Core 0's PPE DVFS Exploring Space: Energy -Delay Scaling, Power Capping, etc. 6 VF Regulator DVFS Decision 90 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
91 CONCLUSION On Commercial Processor Demonstrated an across-vf CPI predictor LL-MAB According to the Leading Loads theory Demonstrated an across-vf power predictor Through the LL-MAB CPI predictor and 2 observations Combining them together: PPEP Framework Supplies performance, power, and energy information SW method w/o requiring HW or OS modification 91 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
92 THANK YOU! Questions 92 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
93 DISCLAIMER & ATTRIBUTION The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to product and roadmap changes, component and motherboard version changes, new model and/or product releases, product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes. AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. ATTRIBUTION 2014 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo and combinations thereof are trademarks of Advanced Micro Devices, Inc. in the United States and/or other jurisdictions. SPEC is a registered trademark of the Standard Performance Evaluation Corporation (SPEC). Other names are for informational purposes only and may be trademarks of their respective owners. 93 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
94 Backup Slides
95 PERFORMANCE & POWER PREDICTION Running VF state: VF4 Predicting performance, power of other VF states Core VF State Performance Power Higher VF 5 Running at: CPI(VF 5 ) Running at: Power(VF 5 ) VF 4 CPI(VF 4 ) Power(VF 4 ) VF 3 CPI(VF 3 ) Power(VF 3 ) VF 2 CPI(VF 2 ) Power(VF 2 ) Lower VF 1 Performance prediction? CPI(VF 1 ) CPU Power prediction? Power(VF 1 ) 95 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
96 EXPERIMENTAL METHODOLOGY SOFTWARE TOOLS Operating System Ubuntu LTS Desktop (kernel version ) Tools taskset: A2C mapping msr-tools: PMC control CPUFreq userspace governor: VF Scaling hwmon tree in sysfs: temperature measurement Benchmark Suites SPEC CPU 2006 v1.2 (29 benchmarks) PARSEC v2.1 (13 benchmarks) NAS Parallel Benchmarks v3.3.1 (10 benchmarks) 96 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
97 HOW TO ESTIMATE MEMORY STALL CYCLES? MODERN CORES MAKE THIS DIFFICULT Memory level parallelism Accesses overlap computation Variable latencies memory access counts do not estimate memory time accurately Core clock cycles Core Work L3+DRAM Variable Latencies Multiple Parallel Accesses Accesses Overlap Computation 97 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
98 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 98 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
99 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 99 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
100 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 100 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
101 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 101 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
102 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 102 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
103 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 103 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
104 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 104 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
105 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 105 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
106 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 106 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
107 MAB BASED LEADING LOADS MODEL: LL-MAB EXPERIMENT Benchmark Running on core 0. Manager Running on Core 7. Collects the counts every 200ms. CU 0 CU 1 CU 2 CU 3 Core 0 Core 1 Core 2 Core 3 Core 4 Core 5 Core 6 Core 7 Benchmark L2 Cache L2 Cache L2 Cache L2 Cache L3 Cache & Memory Controller Manager 200ms Core 0 Core 7 Measured CPI(f) CPI(f) CPI(f) CPI(f) CPI(f) CPI(f) CPI(f) CPI(f) Predicted CPI(f ) CPI(f ) CPI(f ) CPI(f ) CPI(f ) 107 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 CPI(f ) CPI(f ) CPI(f )
108 CPI CPI PREDICTED CPI V.S. MEASURED CPI 1.7GHZ -> 3.5GHZ 5 Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 5 Measured CPI of 3.5GHz (473.astar) % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 108 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
109 CPI CPI PREDICTED CPI V.S. MEASURED CPI 1.7GHZ -> 3.5GHZ % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) Measured CPI of 3.5GHz (473.astar) a) Divide each curve into 20 segments; Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) 0 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 109 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 b) Compare cycle numbers in each segment-pair; c) Average absolute error => CPI Prediction Error.
110 CPI CPI PREDICTED CPI V.S. MEASURED CPI 1.7GHZ -> 3.5GHZ % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) Measured CPI of 3.5GHz (473.astar) a) Divide each curve into 20 segments; Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) 0 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 110 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 b) Compare cycle numbers in each segment-pair; c) Average absolute error => CPI Prediction Error.
111 CPI CPI PREDICTED CPI V.S. MEASURED CPI 1.7GHZ -> 3.5GHZ % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) Measured CPI of 3.5GHz (473.astar) a) Divide each curve into 20 segments; Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) 0 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 111 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 b) Compare cycle numbers in each segment-pair; c) Average absolute error => CPI Prediction Error.
112 CPI Prediction Error CPI PREDICTION ERROR V.S. MEMORYBOUNDEDNESS 1.7GHZ -> 3.5GHZ 20% 15% 10% 5% 0% Memoryboundedness (Cycles of 3.5G/Cycles of 1.7G) 112 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
113 CHIP IDLE POWER MODEL MODEL AND ERROR 113 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
114 CHIP DYNAMIC POWER MODEL EXPERIMENT App threads mapping & Manager CU 0 CU 1 CU 2 CU 3 Core 0 Core 1 Core 2 Core 3 Core 4 Core 5 Core 6 Core 7 App thread App thread App thread App thread Manager L2 Cache L2 Cache L2 Cache L2 Cache L3 Cache & Memory Controller Time Step = 200ms 1st half-step (100ms) 2nd half-step (100ms) power-step (20ms) x10 Manager sleeping Manager working 114 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
115 Validation Error of Dynamic Power FITTING ERROR: DYNAMIC POWER MODEL (LOWER IS BETTER) Error: 10.6% Standard Deviation: 5.8% 18% 16% 14% 12% 10% 8% 6% 4% 2% 0% Average ALL NPB PAR SPE Standard Deviation ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE VF5 VF4 VF3 VF2 VF1 115 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
116 Dynamic Power Prediction Error Across VF States (LOWER IS BETTER) Error: 8.3% Standard Deviation: 6.9% 16% 14% Average Standard Deviation 12% 10% 8% 6% 4% 2% 0% VF1->VF1 VF1->VF2 VF1->VF3 VF1->VF4 VF1->VF5 VF2->VF1 VF2->VF2 VF2->VF3 VF2->VF4 VF2->VF5 VF3->VF1 VF3->VF2 VF3->VF3 VF3->VF4 VF3->VF5 VF4->VF1 VF4->VF2 VF4->VF3 VF4->VF4 VF4->VF5 VF5->VF1 VF5->VF2 VF5->VF3 VF5->VF4 VF5->VF5 116 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
117 THE IMPACT OF POWER GATING 117 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
118 THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle 118 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
119 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 119 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
120 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 120 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
121 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 3P dyn (A)+P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 121 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
122 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 3P dyn (A)+P idle (Chip) 2P dyn (A)+P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 122 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
123 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 3P dyn (A)+P idle (Chip) 2P dyn (A)+P idle (Chip) P dyn (A)+P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 123 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
124 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 3P dyn (A)+P idle (Chip) 2P dyn (A)+P idle (Chip) P dyn (A)+P idle (Chip) P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 124 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014
PowerChop: Identifying and Managing Non-critical Units in Hybrid Processor Architectures
PowerChop: Identifying and Managing Non-critical Units in Hybrid Processor Architectures Michael A. Laurenzano, Yunqi Zhang, Jiang Chen, Lingjia Tang and Jason Mars Department of Electrical Engineering
More informationParallelism I: Inside the Core
Parallelism I: Inside the Core 1 The final Comprehensive Same general format as the Midterm. Review the homeworks, the slides, and the quizzes. 2 Key Points What is wide issue mean? How does does it affect
More informationPractical Resource Management in Power-Constrained, High Performance Computing
Practical Resource Management in Power-Constrained, High Performance Computing Tapasya Patki*, David Lowenthal, Anjana Sasidharan, Matthias Maiterth, Barry Rountree, Martin Schulz, Bronis R. de Supinski
More informationDrowsy Caches Simple Techniques for Reducing Leakage Power Krisztián Flautner Nam Sung Kim Steve Martin David Blaauw Trevor Mudge
Drowsy Caches Simple Techniques for Reducing Leakage Power Krisztián Flautner Nam Sung Kim Steve Martin David Blaauw Trevor Mudge krisztian.flautner@arm.com kimns@eecs.umich.edu stevenmm@eecs.umich.edu
More informationHigh Performance Cache Replacement Using Re-Reference Interval Prediction (RRIP)
High Performance Cache Replacement Using Re-Reference Interval Prediction (RRIP) 1 T H E A C M I E E E I N T E R N A T I O N A L S Y M P O S I U M O N C O M P U T E R A R C H I T E C T U R E ( I S C A
More informationHow Much Power Does your Server Consume? Estimating Wall Socket Power Using RAPL Measurements
How Much Power Does your Server Consume? Estimating Wall Socket Power Using RAPL Measurements Kashif Nizam Khan Zhonghong Ou, Mikael Hirki, Jukka K. Nurminen, Tapio Niemi 1 Motivation The Large Hadron
More informationEasy Weigh ADVANCED COUNTING SCALE MODEL: AC-100 OWNER S MANUAL VER 1.00
Easy Weigh OWNER S MANUAL MODEL: AC-100 ADVANCED COUNTING SCALE VER 1.00 TABLE OF CONTENTS SPECIFICATIONS... 1 NOMENCLATURE... 1 FUNCTION KEYS... 2 DISPLAY... 3 ACCURACY SPECIFICATIONS... 4 UNPACKING &
More informationUKSM: Swift Memory Deduplication via Hierarchical and Adaptive Memory Region Distilling
UKSM: Swift Memory Deduplication via Hierarchical and Adaptive Memory Region Distilling Nai Xia* Chen Tian* Yan Luo + Hang Liu + Xiaoliang Wang* *: Nanjing University +: University of Massachusetts Lowell
More informationAN RPM to TACH Counts Conversion. 1 Preface. 2 Audience. 3 Overview. 4 References
AN 17.4 RPM to TACH Counts Conversion 1 Preface 2 Audience 3 Overview 4 References This application note provides look up tables for the calculation of RPM to TACH Counts for use with the EMC2103, EMC2104,
More informationATN3580 Series: Fixed Attenuator Pads
DATA SHEET ATN3580 Series: Fixed Attenuator Pads Applications Attenuators Features Specified flat response to 40 GHz Return loss > 15 db Available at 1-10, 12, 15, 20, 30, and 40 db Power handling to 1
More informationAdvanced Superscalar Architectures. Speculative and Out-of-Order Execution
6.823, L16--1 Advanced Superscalar Architectures Asanovic Laboratory for Computer Science M.I.T. http://www.csg.lcs.mit.edu/6.823 Speculative and Out-of-Order Execution Branch Prediction kill kill Branch
More informationA Power-Aware, Application-Based, Performance Study Of Moder. Cluster Interconnection Networks
A Power-Aware, Application-Based, Performance Study Of Modern Commodity Cluster Interconnection Networks Open Systems Lab Indiana University Bloomington, USA CAC 09 - IPDPS 09 Rome, Italy May, 25th 2009
More informationINSTITUTO SUPERIOR TÉCNICO. Architectures for Embedded Computing
UNIVERSIDADE TÉCNICA DE LISBOA INSTITUTO SUPERIOR TÉCNICO Departamento de Engenharia Informática Architectures for Embedded Computing MEIC-A, MEIC-T, MERC Lecture Slides Version 3.0 - English Lecture 02
More informationZC706 MIG Design Creation November 2014
ZC706 MIG Design Creation November 2014 XTP244 Revision History Date Version Description 11/24/14 10.0 Regenerated for 2014.4. 10/08/14 9.0 Regenerated for 2014.3. 06/09/14 8.0 Regenerated for 2014.2.
More informationEnhancing Energy Efficiency of Database Applications Using SSDs
Seminar Energy-Efficient Databases 29.06.2011 Enhancing Energy Efficiency of Database Applications Using SSDs Felix Martin Schuhknecht Motivation vs. Energy-Efficiency Seminar 29.06.2011 Felix Martin Schuhknecht
More informationFixing the Hyperdrive: Maximizing Rendering Performance on NVIDIA GPUs
Fixing the Hyperdrive: Maximizing Rendering Performance on NVIDIA GPUs Louis Bavoil, Principal Engineer Booth #223 - South Hall www.nvidia.com/gdc Full-Screen Pixel Shader SM TEX L2 DRAM CROP SM = Streaming
More informationDecoupling Loads for Nano-Instruction Set Computers
Decoupling Loads for Nano-Instruction Set Computers Ziqiang (Patrick) Huang, Andrew Hilton, Benjamin Lee Duke University {ziqiang.huang, andrew.hilton, benjamin.c.lee}@duke.edu ISCA-43, June 21, 2016 1
More informationTEST REPORT. Test Report for Mating Cycle Validation of Mini Circuits QBL Series Quick-Lock Test Cables.
TITLE: Test Report for Mating Cycle Validation of Mini Circuits QBL Series Quick-Lock Test Cables. 1. Background & Introduction Mini-Circuits has introduced a QBL Series of test cables is based upon an
More informationLecture 14: Instruction Level Parallelism
Lecture 14: Instruction Level Parallelism Last time Pipelining in the real world Today Control hazards Other pipelines Take QUIZ 10 over P&H 4.10-15, before 11:59pm today Homework 5 due Thursday March
More informationFiorano ESB 2007 Oracle Enterprise Gateway Integration Guide
An Oracle White Paper June 2011 Fiorano ESB 2007 Oracle Enterprise Gateway Integration Guide 1 / 25 Disclaimer The following is intended to outline our general product direction. It is intended for information
More informationAPPLICATION NOTE. ATAK51003-V1 Quick Start Guide. Atmel ATAN0033. Kit Contents
APPLICATION NOTE ATAK51003-V1 Quick Start Guide Atmel ATAN0033 Kit Contents The Atmel ATAK51003-V1 Kit includes the following components: 1 Atmel ATAB5279-V1.2 LF antenna driver 2 LF antenna modules 1
More informationOut-of-order Pipeline. Register Read. OOO execution (2-wide) OOO execution (2-wide) OOO execution (2-wide) OOO execution (2-wide)
Out-of-order Pipeline Register Read When do instructions read the register file? Fetch Decode Rename Dispatch Buffer of instructions Issue Reg-read Execute Writeback Commit Option #: after select, right
More informationZC706 MIG Design Creation November 2015
ZC706 MIG Design Creation November 2015 XTP244 Revision History Date Version Description 11/24/15 14.0 Regenerated for 2015.4. 10/06/15 13.0 Regenerated for 2015.3. 06/30/15 12.0 Regenerated for 2015.2.
More informationMACP TB. 23 db Coupler MHz Rev. V1. Features. Functional Schematic. Description. Pin Configuration 4. Ordering Information 1,2
23 Coupler Features 23 Coupling Ratio Surface Mount Package Available on Tape and Reel Excellent Temperature Stability RoHS Compliant and Pb Free 260 C Reflow Compatible Description The MACP-0145 is a
More informationLecture 20: Parallelism ILP to Multicores. James C. Hoe Department of ECE Carnegie Mellon University
18 447 Lecture 20: Parallelism ILP to Multicores James C. Hoe Department of ECE Carnegie Mellon University 18 447 S18 L20 S1, James C. Hoe, CMU/ECE/CALCM, 2018 18 447 S18 L20 S2, James C. Hoe, CMU/ECE/CALCM,
More informationWHITE PAPER. Informatica PowerCenter 8 on HP Integrity Servers: Doubling Performance with Linear Scalability for 64-bit Enterprise Data Integration
WHITE PAPER Informatica PowerCenter 8 on HP Integrity Servers: Doubling Performance with Linear Scalability for 64-bit Enterprise Data Integration This document contains Confi dential, Proprietary and
More informationCS 152 Computer Architecture and Engineering
CS 152 Computer Architecture and Engineering Lecture 23 Synchronization 2006-11-16 John Lazzaro (www.cs.berkeley.edu/~lazzaro) TAs: Udam Saini and Jue Sun www-inst.eecs.berkeley.edu/~cs152/ 1 Last Time:
More informationCode Scheduling & Limitations
This Unit: Static & Dynamic Scheduling CIS 371 Computer Organization and Design Unit 11: Static and Dynamic Scheduling App App App System software Mem CPU I/O Code scheduling To reduce pipeline stalls
More information8-bit. Application Note. Microcontrollers. AVR601: Atmel Modular Evaluation Kits for Motor Control Applications. 1.
AVR601: Atmel Modular Evaluation Kits for Motor Control Applications 1. Introduction Following the success of the MC100 and MC200 motor control demonstration kits, Atmel is expanding the support for motor
More informationFinite Element Based, FPGA-Implemented Electric Machine Model for Hardware-in-the-Loop (HIL) Simulation
Finite Element Based, FPGA-Implemented Electric Machine Model for Hardware-in-the-Loop (HIL) Simulation Leveraging Simulation for Hybrid and Electric Powertrain Design in the Automotive, Presentation Agenda
More informationE-AMOM: AN ENERGY-AWARE MODELING AND OPTIMIZATION METHODOLOGY FOR SCIENTIFIC APPLICATIONS ON MULTICORE SYSTEMS
E-AMOM: AN ENERGY-AWARE MODELING AND OPTIMIZATION METHODOLOGY FOR SCIENTIFIC APPLICATIONS ON MULTICORE SYSTEMS A Dissertation by CHARLES WESLEY LIVELY III Submitted to the Office of Graduate Studies of
More informationARC-H: Adaptive replacement cache management for heterogeneous storage devices
Journal of Systems Architecture 58 (2012) ARC-H: Adaptive replacement cache management for heterogeneous storage devices Young-Jin Kim, Division of Electrical and Computer Engineering, Ajou University,
More informationGetting the Lead Out December, 2007
Getting the Lead Out December, 2007 Tom DeBonis Assembly & Test Technology Development Technology and Manufacturing Group Summary Intel has removed the lead (Pb) from its manufacturing process across its
More informationTABLE OF CONTENTS. 8.3 Online Sensor Specifications. Series TS (Models TS1, TSP, TSH, TSL,TSF, TSB1, TSB2) According to Electromatic factory procedure
8.3 Online Sensor Specifications Series TS (Models TS1, TSP, TSH, TSL,TSF, TSB1, TSB2) Calibration: Accuracy: Remainder of range and other calibration material Overload protection Measuring principle Measuring
More informationCooling System Simulation for Indian Utility Vehicle using COOL3D
Indian GT SUITE Conference 2013 Cooling System Simulation for Indian Utility Vehicle using COOL3D Paper presented by Rishabh Pandey, M&M Gopakishore Gummadi, M&M Copyright 2012 Mahindra & Mahindra Ltd.
More informationTECHNICAL BRIEF Americas
TECHNICAL BRIEF Americas Design Considerations When AC Coupling IQ Micros to Battery-Based Systems Overview AC coupling allows use of Enphase Microinverters with battery-based inverter systems. These applications
More informationChina Intelligent Connected Vehicle Technology Roadmap 1
China Intelligent Connected Vehicle Technology Roadmap 1 Source: 1. China Automotive Engineering Institute, , Oct. 2016 1 Technology Roadmap 1 General
More informationEE Architecture for Highly Electrified Powertrain
EE Architecture for Highly Electrified Powertrain 2020-2030 M. Gleich, Senior Manager Marketing and Business Development Powertrain - restricted - Context Resources, Pollution, Climate Urbanization Moore
More informationMulti Core Processing in VisionLab
Multi Core Processing in Multi Core CPU Processing in 25 August 2014 Copyright 2001 2014 by Van de Loosdrecht Machine Vision BV All rights reserved jaap@vdlmv.nl Overview Introduction Demonstration Automatic
More informationABB June 19, Slide 1
Dr Simon Round, Head of Technology Management, MATLAB Conference 2015, Bern Switzerland, 9 June 2015 A Decade of Efficiency Gains Leveraging modern development methods and the rising computational performance-price
More informationFlo-Way. Measure Granular or Powdered Material. Stand Alone or Connect to Other Devices for Blending. Used by Successful Producers World-Wide
Flo-Way R Solids Impact Flow Meter Measure Granular or Powdered Material Stand Alone or Connect to Other Devices for Blending Used by Successful Producers World-Wide Visit our website: www.beltwayscales.com
More informationFluke 438-II Power Quality and Motor Analyzer
Test Equipment Depot - 800.517.8431-99 Washington Street Melrose, MA 02176 - TestEquipmentDepot.com TECHNICAL DATA Fluke 438-II Power Quality and Motor Analyzer Quickly and easily discover electrical and
More informationStorage and Memory Hierarchy CS165
Storage and Memory Hierarchy CS165 What is the memory hierarchy? L1
More informationARM. series SIGNUM SYSTEMS CORPORATION. ARM Board Setup. User Guide
ARM series SIGNUM SYSTEMS CORPORATION ARM Board Setup User Guide C O P Y R I G H T N O T I C E Copyright (c) 2016 by Signum Systems Corporation, an IAR Systems company. All rights are reserved worldwide.
More informationValveLink SNAP-ON Application
AMS Device Manager Product Data Sheet ValveLink SNAP-ON Application Communicate with both HART and Foundation Fieldbus FIELDVUE digital valve controllers in the same application Online, in-service performance
More informationCIS 662: Sample midterm w solutions
CIS 662: Sample midterm w solutions 1. (40 points) A processor has the following stages in its pipeline: IF ID ALU1 MEM1 MEM2 ALU2 WB. ALU1 stage is used for effective address calculation for loads, stores
More informationThe MathWorks Crossover to Model-Based Design
The MathWorks Crossover to Model-Based Design The Ohio State University Kerem Koprubasi, Ph.D. Candidate Mechanical Engineering The 2008 Challenge X Competition Benefits of MathWorks Tools Model-based
More informationTID MEMORY MAPS FOR MONZA SELF-SERIALIZATION
Application Note IMPINJ MONZA TID MEMORY MAPS FOR MONZA SELF-SERIALIZATION Version 3.0 2017, Impinj, Inc. www.impinj.com TABLE OF CONTENTS 1 Introduction... 1 2 Monza Tag Chip Models... 2 2.1 Monza Self-Serialization
More informationStorage-less and converter-less maximum power tracking of photovoltaic cells for a nonvolatile microprocessor
Seoul National University Storage-less and converter-less maximum power tracking of photovoltaic cells for a nonvolatile microprocessor Cong Wang, Naehyuck Chang, Y. Kim, S. Park, Yongpan Liu, Hyung Gyu
More informationOracle Utilities Customer Care and Billing Release Utility Reference Model Process Customer Request for Net Metering
Oracle Utilities Customer Care and Billing Release 2.3.1 Utility Reference Model 3.4.3.2 Process Customer Request for Net Metering July 2012 Oracle Utilities Customer Care and Billing Utility Reference
More informationEnergy Efficient Content-Addressable Memory
Energy Efficient Content-Addressable Memory Advanced Seminar Computer Engineering Institute of Computer Engineering Heidelberg University Fabian Finkeldey 26.01.2016 Fabian Finkeldey, Energy Efficient
More informationPower Consumption Reduction: Hot Spare
Power Consumption Reduction: Hot Spare A Dell technical white paper Mark Muccini Wayne Cook Contents Executive summary... 3 Introduction... 3 Traditional power solutions... 3 Hot spare... 5 Hot spare solution...
More informationComputer Architecture: Out-of-Order Execution. Prof. Onur Mutlu (editted by Seth) Carnegie Mellon University
Computer Architecture: Out-of-Order Execution Prof. Onur Mutlu (editted by Seth) Carnegie Mellon University Reading for Today Smith and Sohi, The Microarchitecture of Superscalar Processors, Proceedings
More informationBLAST CAPACITY ASSESSMENT AND TESTING A-60 OFFSHORE FIRE DOOR
BLAST CAPACITY ASSESSMENT AND TESTING Final Report December 11, 2008 A-60 OFFSHORE FIRE DOOR Prepared for: JRJ Alum Fab, Inc. Prepared by: Travis J. Holland Michael J. Lowak John R. Montoya BakerRisk Project
More informationAluminum Electrolytic Capacitors Radial Very Low Impedance
Aluminum Electrolytic Capacitors Radial Very Low Impedance FEATURES Very low impedance and low ESR Very long useful life: 4000 h to 0 000 h at 05 C, very high reliability Excellent ripple current capability
More informationPipelining A B C D. Readings: Example: Doing the laundry. Ann, Brian, Cathy, & Dave. each have one load of clothes to wash, dry, and fold
Pipelining Readings: 4.5-4.8 Example: Doing the laundry Ann, Brian, Cathy, & Dave A B C D each have one load of clothes to wash, dry, and fold Washer takes 30 minutes Dryer takes 40 minutes Folder takes
More informationElectro-Proportional Terms and Definitions
Electro-Proportional Terms and Definitions Valve Deadband The span of operation where there is no flow or pressure output for some specified range of command Hydraulic Valve Gain The characteristic relating
More informationOvercoming Motor Driver Heat-Transfer Challenges in Robotic Applications
Overcoming Motor Driver Heat-Transfer Challenges in Robotic Applications Jason D. Carr, M.S. and David W. Sundin, Ph.D. Page 1 This paper is protected by copyright law. Individuals may reproduce and distribute
More informationDataCore Virtualization Appliance
DataCore Virtualization Appliance Fujitsu-DataCore SVA November 2018 The Fujitsu Storage Virtualization Appliance (SVA) is a complete hardware solution for DataCore s SANsymphony SAN virtualization product.
More informationXC95288 In-System Programmable CPLD
R 0 XC95288 In-System Programmable CPLD 0 5 Product Specification Features 15 ns pin-to-pin logic delays on all pins f CNT to 95 MHz 288 macrocells with 6,400 usable gates Up to 166 user pins 5V in-system
More informationFor questions or technical support, 1. Wiring Reference:
Warning: Before proceeding you are obligated to read and agree to the terms and conditions attached to this manual. Misuse of this product may cause injury or death. Incorrect installation may cause damage
More informationTraffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7
Traffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7 1 Table of Contents Dimensions... 3 Technical Specifications... 4 Communication... 5 Programming... 6 Data Collection and
More informationA Predictive Delay Fault Avoidance Scheme for Coarse Grained Reconfigurable Architecture
A Predictive Fault Avoidance Scheme for Coarse Grained Reconfigurable Architecture Toshihiro Kameda 1 Hiroaki Konoura 1 Dawood Alnajjar 1 Yukio Mitsuyama 2 Masanori Hashimoto 1 Takao Onoye 1 hasimoto@ist.osaka
More informationTachometer. English. Users Manual
931 Tachometer Users Manual English PN 3409877 January 2009 2009 Fluke Corporation, All rights reserved. Printed in Germany. Specifications are subject to change without notice. All product names are trademarks
More informationAdvantage Memory Corporation reserves the right to change products and specifications without notice
SD872-8X8-72VS4 SDRAM DIMM 8MX72 SDRAM DIMM with ECC based on 8MX8, 4B, 4K Refresh, 3.3V DRAMs with SPD GENERAL DESCRIPTION The Advantage SD872-8X8-72VS4 is a 8MX72 Synchronous Dynamic RAM high-density
More informationL (LK9, LQ8) when use in: Saab 9-3
NOTE: Printing this file may require 8.5" x 14" (legal size) paper, depending on your printer setup. Catalytic Converter Monitoring P0420 Time for Rear O2 sensor signal Time for rear O2 to go low. Value
More informationWeb Volume Control Model WV220
WEB CONTROL PRODUCTS User Manual Web Volume Control Model WV220 1 In accordance with Nexen s established policy of constant product improvement, the specifications contained in this manual are subject
More informationFleet Micro Series Advanced Battery Charging System Dual and Quad Instructional Manual
Fleet Micro Series Advanced Battery Charging System Dual and Quad Instructional Manual Introduction: Thank you for your choice in purchasing a Core SWX Fleet Micro Charger. The charger is designed to provide
More informationDepartment of Civil Engineering The University of British Columbia. Nicolas Saunier
Department of Civil Engineering The University of British Columbia TRUCK SIGNAL PRIORITY Nicolas Saunier Wook Kang Why Truck Priority? Reduce Rd the Cost of Goods Transportation Reduce Red Light Running
More informationInstallation Guide High-end dynamic balancing instrumentation Page 1. Installation Guide for. EasyBalance 2.2
High-end dynamic balancing instrumentation Page 1 for EasyBalance 2.2 High-end dynamic balancing instrumentation High-end dynamic balancing instrumentation Page 2 1 Mechanical Modifications There are two
More informationTracer VMA with AutoReg
Tracer with AutoReg Flowmeter with Automatic Flow Regulation Operating Instructions RoHS Compliant General The Tracer Flowmeter with AutoReg provides: Analog Flow Output (Selectable 0 to 3.5V, 0 to 5V
More informationValvoline Fourth-Quarter Fiscal 2016 Earnings Conference Call. November 9, 2016
Valvoline Fourth-Quarter Fiscal 2016 Earnings Conference Call November 9, 2016 Forward-Looking Statements This presentation contains forward-looking statements within the meaning of Section 27A of the
More informationNear-Optimal Precharging in High-Performance Nanoscale CMOS Caches
Near-Optimal Precharging in High-Performance Nanoscale CMOS Caches Se-Hyun Yang and Babak Falsafi Computer Architecture Laboratory (CALCM) Carnegie Mellon University {sehyun, babak}@cmu.edu http://www.ece.cmu.edu/~powertap
More informationCatalog Issued 3-03 (PDF Rev )
W6/W9 series Magnetic Hydraulic P& Circuit reakers Features Designed for the international market. UL Recognized (UL77 and UL5), CS ccepted and VDE approved. Ratings to 5 amps. Heavy duty #-32 stud connections.
More informationUBI-2590 HC SMBus (Part No. UBBL13) Battery Specification
UBI-2590 HC SMBus (Part No. UBBL13) Battery Specification Copyright Copyright 2012, Ultralife Corporation. All Rights Reserved. Trademark Information Ultralife, the Ultralife logo design, SmartCircuit
More informationUsing SystemVerilog Assertions in Gate-Level Verification Environments
Using SystemVerilog Assertions in Gate-Level Verification Environments Mark Litterick (Verification Consultant) mark.litterick@verilab.com 2 Introduction Gate-level simulations why bother? methodology
More informationFrom Developing Credit Risk Models Using SAS Enterprise Miner and SAS/STAT. Full book available for purchase here.
From Developing Credit Risk Models Using SAS Enterprise Miner and SAS/STAT. Full book available for purchase here. About this Book... ix About the Author... xiii Acknowledgments...xv Chapter 1 Introduction...
More informationService Bulletin Buses
Prevost Saint-Nicolas, Quebec, Canada Service Bulletin Buses Date Group No. Release Page 6.2010 364 00 1(12) Vehicle Electronic Control Unit (VECU) MID 144, Diagnostic Trouble Code (DTC), Guide Vehicle
More informationFood Service Technology Center
Food Service Technology Center Convotherm C4eT 6.20EB Electric Combination Oven Test Report Application of ASTM Standard Test Method F2861-10 May 2015 Prepared by: Denis Livchak Fisher-Nickel, Inc. Contributors:
More informationLocal Memory Bus (LMB) V10 (v1.00a)
Local Memory Bus (LMB) V10 (v1.00a) DS445 December 2, 2009 Introduction The LMB V10 module is used as the LMB interconnect for Xilinx FPGA-based embedded processor systems. The LMB is a fast, local bus
More informationTMD90A Dual Input Digital Thermometer
TMD90A Dual Input Digital Thermometer User Manual For detailed specifications and ordering info go to www.testequipmentdepot.com TMD90A Dual Input Thermometer Contents Safety Information 2 Symbols Used
More informationCombiner Manual #OMBINER #OMBINER < 2* )NSTALLATION 7(&+12/ i 62/$5
Combiner Manual i ii Combiner Manual Combiner Manual Copyright 2004 SMA America, Inc. All rights reserved. All rights reserved. No part of this document may be reproduced, stored in a retrieval system,
More informationDual-Rail Domino Logic Circuits with PVT Variations in VDSM Technology
Dual-Rail Domino Logic Circuits with PVT Variations in VDSM Technology C. H. Balaji 1, E. V. Kishore 2, A. Ramakrishna 3 1 Student, Electronics and Communication Engineering, K L University, Vijayawada,
More informationLinking the Indiana ISTEP+ Assessments to the NWEA MAP Growth Tests. February 2017 Updated November 2017
Linking the Indiana ISTEP+ Assessments to the NWEA MAP Growth Tests February 2017 Updated November 2017 2017 NWEA. All rights reserved. No part of this document may be modified or further distributed without
More informationModel-Based Design and Hardware-in-the-Loop Simulation for Clean Vehicles Bo Chen, Ph.D.
Model-Based Design and Hardware-in-the-Loop Simulation for Clean Vehicles Bo Chen, Ph.D. Dave House Associate Professor of Mechanical Engineering and Electrical Engineering Department of Mechanical Engineering
More information128Mb Synchronous DRAM. Features High Performance: Description. REV 1.0 May, 2001 NT5SV32M4CT NT5SV16M8CT NT5SV8M16CT
Features High Performance: f Clock Frequency -7K 3 CL=2-75B, CL=3-8B, CL=2 Single Pulsed RAS Interface Fully Synchronous to Positive Clock Edge Four Banks controlled by BS0/BS1 (Bank Select) Units 133
More informationABOT CHASSIS ASSEMBLY GUIDE
ABOT CHASSIS ASSEMBLY GUIDE ABOT-0004 01/12 Features Abot assembly Distance sensor and servomotor data Description The guide for assembling the Abot robot chassis Including the basic data Table of Contents
More informationINSTALLATION, MAINTENANCE & SERVICE MANUAL
INSTALLATI, MAINTENANCE & SERVICE MANUAL M151217L Product Specifications 20" (506mm) 3.3 (85mm) 9.8 (250mm) 38" (974mm) 4.2 (106mm) 9.8 (250mm) 48" (1208mm) 4.2 (106mm) 56" (1442mm) 4.2 (106mm) Wiring
More informationAttitude Instructions Ford 6.0 Powerstroke
Attitude Instructions Ford 6.0 Powerstroke Attitude 2003, Edge Products Incorporated All rights reserved. Edge Products Incorporated 5875 South Adams Avenue Suite 100 Ogden, UT 84405 (801) 476-3343 www.edgeproductsinc.com
More informationWWW.MORETRACTION.COM TMS-5500-SL ELECTRONIC TRACTION CONTROL US PATENT 6,577,944 Other Patents Pending COPYRIGHT NOTICE Copyright 1999-2013 Davis Technologies, LLC. All rights reserved. Information in
More informationCormorant Engineering
Gas Well Deliquification Workshop Sheraton Hotel, Denver, Colorado February 27 March 2, 20 Selection and Validation of Hydraulic Dewatering Pumps using Predictive Performance Model and Downhole Simulator
More informationReplacement Part has been judged by Mini Circuits Engineering as a suitable replacement to Original Part a. Case Style DQ849 (No Change) Marking MN5A
REPLACEMENT PART REFERENCE GUIDE, MNA 5+ AN 60 088 ORIGINAL PART: REPLACEMENT PART: MNA 5+ MNA 5A+ Replacement Part has been judged by Mini Circuits Engineering as a suitable replacement to Original Part
More informationCMMI Opens the Gate: The Practical Relationship between CMMI and Stage- Gate Decision Models
NDIA CMMI Technology Conference & User Group Denver, CO CMMI Opens the Gate: The Practical Relationship between CMMI and Stage- Gate Decision Models ABB USCRC - 1 Agenda ABB Overview Stage-Gate Process
More informationAdvantage Memory Corporation reserves the right to change products and specifications without notice
SDRAM SODIMM 4MX64 SDRAM SO DIMM based on 4MX16, 4Banks, 4K Refresh, 3.3V DRAMs with SPD GENERAL DESCRIPTION The Advantage is a 4MX64 Synchronous Dynamic RAM high density memory module. The Advantage consists
More informationTraffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7
Traffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7 1 Table of Contents Dimensions... 3 Dimensions of SafePace 100 Radar Sign with Full YOUR SPEED Sign Face... 4 Dimensions of
More informationComputer Architecture 计算机体系结构. Lecture 3. Instruction-Level Parallelism I 第三讲 指令级并行 I. Chao Li, PhD. 李超博士
Computer Architecture 计算机体系结构 Lecture 3. Instruction-Level Parallelism I 第三讲 指令级并行 I Chao Li, PhD. 李超博士 SJTU-SE346, Spring 2018 Review ISA, micro-architecture, physical design Evolution of ISA CISC vs
More informationINDUSTRIAL CRANE MODERNIZATION THE SMART WAY TO EXTEND OVERHEAD CRANE SERVICE LIFE
INDUSTRIAL NUCLEAR PORT HEAVY-DUTY LIFT TRUCKS SERVICE MODERNIZATION SERVICES INDUSTRIAL CRANE MODERNIZATION THE SMART WAY TO EXTEND OVERHEAD CRANE SERVICE LIFE 2 Konecranes Crane Modernizations Crane
More informationFour-Position Gang Charger for SAFEMTX Li-ION Battery Packs
Four-Position Gang Charger for SAFEMTX Li-ION Battery Packs (P/N 10067285) Instruction Manual " WARNING THIS MANUAL MUST BE CAREFULLY READ BY ALL INDIVIDUALS WHO HAVE OR WILL HAVE THE RESPONSIBILITY FOR
More informationUNLEASH THE POWER. See More at: bullydog.com WITH BULLY DOG PERFORMANCE PRODUCTS. Doc.# BD41001 v1.0.3
Bully Dog Technologies, LLC is a team built on integrity that is dedicated to leading the vehicle performance industry with an uncompromising code of ethics demonstrated in the soundness of its employees,
More informationUsing on-line power modeling for server power capping
IBM Research and University of Texas at Austin Using on-line power modeling for server power capping Madhu Saravana Sibi Govindan, Charles Lefurgy, and Ajay Dholakia Workshop on Energy-Efficient Design
More information