PPEP: ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK

Size: px
Start display at page:

Download "PPEP: ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK"

Transcription

1 PPEP: ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK BO SU JUNLI GU LI SHEN WEI HUANG JOSEPH L. GREATHOUSE ZHIYING WANG NUDT AMD RESEARCH DECEMBER 17, 2014

2 BACKGROUND Dynamic Voltage and Frequency Scaling (DVFS) Widely employed Boost performance, lower power, and improve energy efficiency 2 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

3 BACKGROUND Dynamic Voltage and Frequency Scaling (DVFS) Widely employed Boost performance, lower power, and improve energy efficiency Good DVFS decisions require Accurate performance & power predictions across Voltage Frequency (VF) states 3 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

4 BACKGROUND Dynamic Voltage and Frequency Scaling (DVFS) Widely employed Boost performance, lower power, and improve energy efficiency Good DVFS decisions require Accurate performance & power predictions across Voltage Frequency (VF) states Challenges Brought by Modern Processors Cores and north bridge Different clock domains and power planes Off-chip Memory Own clock domain 4 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

5 TWO QUESTIONS Q1: How does the application s performance change with the VF state? Q2: How does the application s power change with the VF state? 5 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

6 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS Processor: AMD FX nd Generation Family 15h Piledriver FX-8320 CPU 6 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

7 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS CU CU Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states FX-8320 CPU CU CU 7 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

8 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states The NB (North Bridge) Shared by all CUs L3$ and memory controller CU North Bridge CU FX-8320 CPU CU CU 8 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

9 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states The NB (North Bridge) Shared by all CUs L3$ and memory controller Power Measurement Pololu ACS711: Current sensor Power Supply CU North Bridge CU FX-8320 CPU ACS711 CU CU ASUS M5A97 9 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

10 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS CU CU Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states The NB (North Bridge) Shared by all CUs L3$ and memory controller Power Measurement Pololu ACS711: Current sensor Arduino: Power reading Power Supply North Bridge CU FX-8320 CPU ACS711 CU ASUS M5A97 Arduino 10 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

11 EXPERIMENTAL SETUP PLATFORM & BENCHMARKS CU CU Processor: AMD FX nd Generation Family 15h Piledriver 4 CUs (Compute Units) 2 Cores in a CU 5 VF (Voltage-Frequency) states The NB (North Bridge) Shared by all CUs L3$ and memory controller Power Measurement Pololu ACS711: Current sensor Arduino: Power reading Benchmark Suites Power Supply North Bridge FX-8320 CPU ACS711 SPEC CPU 2006, PARSEC, NAS Parallel Arduino Benchmarks CU CU ASUS M5A97 11 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

12 OUTLINE Background Experimental Setup Performance Prediction Across DVFS States Power Prediction Across DVFS States PPEP Framework Conclusion Q1: How does the application s performance change with the VF state? 12 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

13 Normalized CPI HOW CPI CHANGES WITH FREQUENCY (LOWER IS BETTER) Normalized Frequency 13 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

14 Normalized CPI HOW CPI CHANGES WITH FREQUENCY (LOWER IS BETTER) Estimate #1 CPI does not change when frequency increases Normalized Frequency bzip2 14 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

15 Normalized CPI HOW CPI CHANGES WITH FREQUENCY (LOWER IS BETTER) Estimate #1 Estimate #2 CPI does not change when frequency increases. CPI also increases when frequency increases mcf Normalized Frequency bzip2 15 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

16 Normalized CPI HOW CPI CHANGES WITH FREQUENCY (LOWER IS BETTER) Estimate #1 Estimate #2 CPI does not change when frequency increases. CPI also increases when frequency increases mcf Normalized Frequency GAP bzip2 16 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

17 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM Core clock cycles 17 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

18 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. 18 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

19 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. 19 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

20 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Cycles stalled on Memory Accessing. Scale with Core Frequency. 20 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

21 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Core Frequency: f -> f Cycles stalled on Memory Accessing. Scale with Core Frequency. 21 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

22 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Core Frequency: f -> f Does not scale. Does not scale. Cycles stalled on Memory Accessing. Scale with Core Frequency. 22 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

23 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Core Frequency: f -> f GAP Does not scale. Does not scale. Scales by f /f. Cycles stalled on Memory Accessing. Scale with Core Frequency. 23 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

24 WHAT MAKES THE CPI DIFFERENCE? bzip2 Core L3+DRAM mcf Core L3+DRAM Core clock cycles Cycles working in the CPU Core. Core Frequency doesn t matter. Core Frequency: f -> f GAP Does not scale. Does not scale. Scales by f /f. Cycles stalled on Memory Accessing. Scale with Core Frequency. Key: Memory Stall Cycles estimation 24 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

25 Core clock cycles Core L3+DRAM 25 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

26 Core clock cycles Core L3+DRAM 26 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

27 Core clock cycles Core L3+DRAM 27 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

28 Core clock cycles Core L3+DRAM 28 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

29 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Core clock cycles Core L3+DRAM 29 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

30 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Core clock cycles Core L3+DRAM Leading Load 30 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

31 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Core clock cycles Core L3+DRAM Leading Load Not Leading 31 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

32 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Core clock cycles Core L3+DRAM Leading Load Leading Load Not Leading 32 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

33 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Memory Stall Cycle Core clock cycles Memory Stall Cycles Core L3+DRAM Leading Load Leading Load Not Leading Leading Loads Model: Memory stall cycles = cycles that leading loads are active 33 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

34 LEADING LOADS MEMORY TIME ESTIMATION [CF 10 Keramidas+], [TOC 10 Eyerman+], and [IGCC 11 Rountree+] Memory Stall Cycle Core clock cycles Memory Stall Cycles Core L3+DRAM Leading Load Leading Load Not Leading Leading Loads Model: Memory stall cycles = cycles that leading loads are active AMD hardware has MAB0 performance counter which can estimate Leading Loads cycles. [USENIX-ATC 14 Su+] 34 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

35 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles 35 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

36 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1 36 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

37 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E3 E1 37 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

38 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 E1 38 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

39 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 E1 39 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

40 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 40 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

41 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 41 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

42 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 42 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

43 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 E3*(f /f) 43 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

44 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 E3*(f /f) E1+E3*(f /f - 1) 44 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

45 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 E3*(f /f) Instruction number = E2 E1+E3*(f /f - 1) 45 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

46 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 Instruction number = E2 CPI(f) = E1/E2 E1 E1-E3 E3*(f /f) Instruction number = E2 CPI(f ) = (E1+E3*(f /f - 1))/E2 E1+E3*(f /f - 1) 46 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

47 CPI PREDICTOR 3 HW events Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 E1 E1-E3 E3*(f /f) E1+E3*(f /f - 1) Instruction number = E2 CPI(f) = E1/E2 LL-MAB CPI Predictor Instruction number = E2 CPI(f ) = (E1+E3*(f /f - 1))/E2 47 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

48 CPI PREDICTOR 3 HW events Step=200ms Running Frequency = f Core L3 + DRAM Another Frequency = f Core L3 + DRAM # Code Event Name E1 0x76 CPU_Clocks_not_Halted E2 0xc0 Retired_Instructions E3 0x69 MAB0_Wait_Cycles MAB based Leading Loads Cycles Core clock cycles E1-E3 E3 E1 E1-E3 E3*(f /f) E1+E3*(f /f - 1) Instruction number = E2 CPI(f) = E1/E2 LL-MAB CPI Predictor Instruction number = E2 CPI(f ) = (E1+E3*(f /f - 1))/E2 48 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

49 CPI CPI PREDICTED CPI CURVE V.S. MEASURED CPI CURVE 1.7GHZ (VF2) -> 3.5GHZ (VF5) 5 Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 5 Measured CPI of 3.5GHz (473.astar) % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 49 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

50 CPI PREDICTION ERROR FX-8320 Processor All 52 benchmarks in SPEC, PARSEC, and NPB 1.7GHz (VF2) -> 3.5GHz (VF5) 10% 8% 1.7GHz->3.5GHz 15.5% 14.2% 6% 4% 3.0% 2% 0% 483.xalancbmk 482.sphinx3 481.wrf 473.astar 471.omnetpp 470.lbm 465.tonto 464.h264ref 462.libquantum 459.GemsFDTD 458.sjeng 456.hmmer 454.calculix 453.povray 450.soplex 447.dealII 445.gobmk 444.namd 437.leslie3d 436.cactusADM 435.gromacs 434.zeusmp 433.milc 429.mcf 416.gobmk 410.bwaves 403.gcc 401.bzip2 400.perlbench UA SP MG LU IS FT EP DC CG BT streamcluster dedup canneal x264 vips swaptions raytrace freqmine fluidanimate ferret facesim bodytrack blackscholes Average 50 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

51 OUTLINE Background Experimental Methodology Performance Prediction Across DVFS States Power Prediction Across DVFS States PPEP Framework Conclusion Q2: How does the application s power change with the VF state? 51 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

52 IDLE POWER MODEL + DYNAMIC POWER MODEL Cooling 52 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

53 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Power (Sampled Point) Temperature Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

54 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

55 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

56 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

57 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Relationship? Cooling Step (200ms) PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

58 Normalized Chip Power Temperature (K) IDLE POWER MODEL + DYNAMIC POWER MODEL Heating Power (Sampled Point) Temperature Relationship? Cooling Step (200ms) VF State Voltage(V) Frequency(GHz) Error of P idle VF % VF % VF % VF % VF % 58 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

59 IDLE POWER MODEL + DYNAMIC POWER MODEL 59 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

60 IDLE POWER MODEL + DYNAMIC POWER MODEL # Event Code E4 Retired_uOPs 0xc1 E5 FPU_Pipe_Assignment 0x00 E6 L1I$_Fetches 0x80 E7 L1D$_Accesses 0x40 E8 L2$_Requests 0x7d E9 Retired_Branches 0xc2 E10 Retired_MisBranches 0xc3 E11 L2$_Misses 0x7e E12 Dispatch_Stalls 0xd1 (PS= Per-Second Count Value) 60 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

61 POWER MODEL VALIDATION EXPERIMENT Benchmark Combination Totally 152 combinations Thread number: 1, 2, 3, and 4 SPEC: single-/multi- programed PARSEC: single-/multi- threaded NPB: single-/multi- threaded Number SPEC PARSEC NPB Total 1-thread thread thread thread Total PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

62 POWER MODEL VALIDATION EXPERIMENT Benchmark Combination Totally 152 combinations Thread number: 1, 2, 3, and 4 SPEC: single-/multi- programed PARSEC: single-/multi- threaded NPB: single-/multi- threaded Number SPEC PARSEC NPB Total 1-thread thread thread thread Total fold cross validation Divide 152 benchmark combination in to 4 sets Train on all combinations of 3 sets Test on the remaining set 62 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

63 Validation Error of Chip Power POWER MODEL VALIDATION (LOWER IS BETTER) Error: 4.6% Standard Deviation: 2.8% 8% 7% 6% 5% 4% 3% 2% 1% Average Standard Deviation 0% ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE VF5 VF4 VF3 VF2 VF1 63 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

64 PREDICTING POWER ACROSS VF STATES VF5 VF4 VF3 VF2 VF1 CPI(VF 5 ) 3 Events => CPI(VF 4 ) CPI(VF 3 ) CPI(VF 2 ) CPI(VF 1 ) LL-MAB T => P idle (VF 4 ) 9 Events => P dyn (VF 4 ) 64 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

65 PREDICTING POWER ACROSS VF STATES VF5 CPI(VF 5 ) P idle (VF 5 ) VF4 3 Events => CPI(VF 4 ) T => P idle (VF 4 ) 9 Events => P dyn (VF 4 ) VF3 CPI(VF 3 ) P idle (VF 3 ) VF2 CPI(VF 2 ) P idle (VF 2 ) VF1 CPI(VF 1 ) P idle (VF 1 ) LL-MAB 65 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

66 PREDICTING POWER ACROSS VF STATES VF5 CPI(VF 5 ) P idle (VF 5 ) VF4 3 Events => CPI(VF 4 ) T => P idle (VF 4 ) 9 Events => P dyn (VF 4 ) VF3 CPI(VF 3 ) P idle (VF 3 ) VF2 CPI(VF 2 ) P idle (VF 2 ) VF1 CPI(VF 1 ) P idle (VF 1 ) LL-MAB same T 66 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

67 PREDICTING POWER ACROSS VF STATES VF5 CPI(VF 5 ) P idle (VF 5 ) P dyn (VF 5 ) VF4 3 Events => CPI(VF 4 ) T => P idle (VF 4 ) 9 Events => P dyn (VF 4 ) VF3 CPI(VF 3 ) P idle (VF 3 ) P dyn (VF 3 ) VF2 CPI(VF 2 ) P idle (VF 2 ) P dyn (VF 2 ) VF1 CPI(VF 1 ) P idle (VF 1 ) P dyn (VF 1 ) LL-MAB same T 67 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

68 PREDICTING POWER ACROSS VF STATES VF5 VF4 VF3 VF2 VF1 CPI(VF 5 ) 3 Events => CPI(VF 4 ) P idle (VF 5 ) T => P idle (VF 4 ) P dyn (VF 5 ) 9 Events => P dyn (VF 4 ) CPI(VF 3 ) CPI(VF 2 ) CPI(VF 1 ) P idle (VF 3 ) P idle (VF 2 ) P idle (VF 1 ) P dyn (VF 3 ) P dyn (VF 2 ) P dyn (VF 1 ) LL-MAB same T HW event prediction? E PS(i) (VF5) E PS(i) (VF4) E PS(i) (VF3)? i=4,,12 E PS(i) (VF2) E PS(i) (VF1) 68 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

69 PREDICTING POWER ACROSS VF STATES VF5 VF4 VF3 VF2 VF1 CPI(VF 5 ) 3 Events => CPI(VF 4 ) P idle (VF 5 ) T => P idle (VF 4 ) P dyn (VF 5 ) 9 Events => P dyn (VF 4 ) CPI(VF 3 ) CPI(VF 2 ) CPI(VF 1 ) P idle (VF 3 ) P idle (VF 2 ) P idle (VF 1 ) P dyn (VF 3 ) P dyn (VF 2 ) P dyn (VF 1 ) LL-MAB same T HW event prediction? E PS(i) (VF5) E PS(i) (VF4) E PS(i) (VF3)? i=4,,12 E PS(i) (VF2) E PS(i) (VF1) 2 observations + LL-MAB CPI Predictor 69 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

70 OBSERVATION 1 + OBSERVATION 2 At any given point in the execution of a program, core-private HW event counts per instruction are independent of VF state. => To execute the same section of instructions in a program, the activities of core-private resources are independent of VF state. # Event Name E4 E5 E6 E7 E8 E9 E10 E11 Retired_uOPs FPU_Pipe_Assignment L1I$_Fetches L1D$_Accesses L2$_Requests Retired_Branches Retired_MisBranches L2$_Misses 70 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

71 OBSERVATION 1 + OBSERVATION 2 At any given point in the execution of a program, core-private HW event counts per instruction are independent of VF state. => To execute the same section of instructions in a program, the activities of core-private resources are independent of VF state. # Event Name E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses Error 0.6% 0.9% 0.7% 0.7% 5.0% 0.7% 1.3% 4.0% On FX-8320, between 3.5GHz (VF5) and 1.7GHz (VF2) 71 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

72 OBSERVATION 1 + OBSERVATION 2 # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls 72 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

73 OBSERVATION 1 + OBSERVATION 2 # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Gap Error 1.7% On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) 73 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

74 OBSERVATION 1 + OBSERVATION 2 # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% Cycles retiring 3 instructions Cycles retiring 1 instruction On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) 74 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

75 OBSERVATION 1 + OBSERVATION 2 Freq = f # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% Cycles retiring 3 instructions Cycles retiring 1 instruction On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) Start end Freq = f 75 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

76 OBSERVATION 1 + OBSERVATION 2 Freq = f # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% Cycles retiring 3 instructions Cycles retiring 1 instruction On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) Start Frequency Independent end Freq = f 76 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

77 OBSERVATION 1 + OBSERVATION 2 Freq = f # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) Cycles retiring 3 instructions Cycles retiring 1 instruction Overheads of mispredicted branches and exceptions Start Frequency Independent end Freq = f 77 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

78 OBSERVATION 1 + OBSERVATION 2 Freq = f # Event E1 CPU_Clock_not_Halted E12 Dispatch_Stalls Cycles retiring 4 instructions Cycles retiring 2 instructions Cycles without retiring Gap Error 1.7% On FX-8320, between3.5ghz (VF5) and 1.7GHz (VF2) Cycles retiring 3 instructions Cycles retiring 1 instruction Overheads of mispredicted branches and exceptions Dispatch Stalls Start Frequency Independent end Freq = f 78 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

79 DYNAMIC POWER PREDICTION FLOW HW Events Prediction Flow (Bridge: LL-MAB + 2 OBs) # Event E1 CPU_Clock_not_Halted E2 Retired_Instructions E3 MAB0_Wait_Cycles E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses E12 Dispatch_Stalls PS=per-second PI=per-instruction 79 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

80 DYNAMIC POWER PREDICTION FLOW HW Events Prediction Flow (Bridge: LL-MAB + 2 OBs) # Event E1 CPU_Clock_not_Halted From PMC E2 Retired_Instructions E3 MAB0_Wait_Cycles E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses E12 Dispatch_Stalls PS=per-second PI=per-instruction 80 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

81 DYNAMIC POWER PREDICTION FLOW HW Events Prediction Flow (Bridge: LL-MAB + 2 OBs) # Event E1 CPU_Clock_not_Halted From PMC E2 Retired_Instructions E3 MAB0_Wait_Cycles E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses E12 Dispatch_Stalls PS=per-second PI=per-instruction 81 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

82 DYNAMIC POWER PREDICTION FLOW HW Events Prediction Flow (Bridge: LL-MAB + 2 OBs) # Event E1 CPU_Clock_not_Halted From PMC E2 Retired_Instructions LL-MAB E3 MAB0_Wait_Cycles E4 Retired_uOPs E5 FPU_Pipe_Assignment E6 L1I$_Fetches E7 L1D$_Accesses E8 L2$_Requests E9 Retired_Branches E10 Retired_MisBranches E11 L2$_Misses E12 Dispatch_Stalls PS=per-second PI=per-instruction 82 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

83 DYNAMIC POWER PREDICTION FLOW HW Events # Event E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11 E12 CPU_Clock_not_Halted Retired_Instructions MAB0_Wait_Cycles Retired_uOPs FPU_Pipe_Assignment L1I$_Fetches L1D$_Accesses L2$_Requests Retired_Branches Retired_MisBranches L2$_Misses Dispatch_Stalls PS=per-second PI=per-instruction Prediction Flow (Bridge: LL-MAB + 2 OBs) From PMC 83 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 LL-MAB Observation 1 Observation 2

84 DYNAMIC POWER PREDICTION FLOW HW Events # Event E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11 E12 CPU_Clock_not_Halted Retired_Instructions MAB0_Wait_Cycles Retired_uOPs FPU_Pipe_Assignment L1I$_Fetches L1D$_Accesses L2$_Requests Retired_Branches Retired_MisBranches L2$_Misses Dispatch_Stalls PS=per-second PI=per-instruction Prediction Flow (Bridge: LL-MAB + 2 OBs) From PMC 84 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 LL-MAB Observation 1 Observation 2

85 DYNAMIC POWER PREDICTION FLOW HW Events # Event E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11 E12 CPU_Clock_not_Halted Retired_Instructions MAB0_Wait_Cycles Retired_uOPs FPU_Pipe_Assignment L1I$_Fetches L1D$_Accesses L2$_Requests Retired_Branches Retired_MisBranches L2$_Misses Dispatch_Stalls PS=per-second PI=per-instruction Prediction Flow (Bridge: LL-MAB + 2 OBs) From PMC 85 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 LL-MAB Observation 1 Observation 2

86 Chip Power Prediction Error Across VF States PREDICTING ERROR OF AVERAGE CHIP POWER LOWER IS BETTER Error: 4.2% Standard Deviation: 3.6% 8% 7% 6% 5% 4% 3% 2% 1% Average Standard Deviation 0% VF1->VF1 VF1->VF2 VF1->VF3 VF1->VF4 VF1->VF5 VF2->VF1 VF2->VF2 VF2->VF3 VF2->VF4 VF2->VF5 VF3->VF1 VF3->VF2 VF3->VF3 VF3->VF4 VF3->VF5 VF4->VF1 VF4->VF2 VF4->VF3 VF4->VF4 VF4->VF5 VF5->VF1 VF5->VF2 VF5->VF3 VF5->VF4 VF5->VF5 86 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

87 OUTLINE Background Experimental Methodology Performance Prediction Across DVFS States Power Prediction Across DVFS States PPEP Framework Conclusion Q1: How does the application s performance change with the VF state? Q2: How does the application s power change with the VF state? 87 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

88 OUTLINE Background Experimental Methodology Performance Prediction Across DVFS States Power Prediction Across DVFS States PPEP Framework Conclusion 88 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

89 PPEP FRAMEWORK PMC based User Daemon Periodically Running All-in-one: PPEP PMC, VF 1 CPIs at all VFs Performance Prediction Model 2 CPI at all VFs Core 1's PPE Core n's PPE Dynamic Info: PMC, VF, Temperature 4 PMC VF Temperature, VF PMC at other VFs H/W Events Predictor 3 Dynamic Power Model Idle Power Model Dynamic Power at all VFs Energy Prediction Idle Power at all VFs 5 Core 0's PPE DVFS Exploring Space: Energy -Delay Scaling, Power Capping, etc. 6 VF Regulator DVFS Decision 89 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

90 PPEP FRAMEWORK PMC based User Daemon Periodically Running Performance Online Power Energy Prediction Commercial Processors All-in-one: PPEP PMC, VF 1 CPIs at all VFs Performance Prediction Model 2 CPI at all VFs Core 1's PPE Core n's PPE Dynamic Info: PMC, VF, Temperature 4 PMC VF Temperature, VF PMC at other VFs H/W Events Predictor 3 Dynamic Power Model Idle Power Model Dynamic Power at all VFs Energy Prediction Idle Power at all VFs 5 Core 0's PPE DVFS Exploring Space: Energy -Delay Scaling, Power Capping, etc. 6 VF Regulator DVFS Decision 90 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

91 CONCLUSION On Commercial Processor Demonstrated an across-vf CPI predictor LL-MAB According to the Leading Loads theory Demonstrated an across-vf power predictor Through the LL-MAB CPI predictor and 2 observations Combining them together: PPEP Framework Supplies performance, power, and energy information SW method w/o requiring HW or OS modification 91 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

92 THANK YOU! Questions 92 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

93 DISCLAIMER & ATTRIBUTION The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to product and roadmap changes, component and motherboard version changes, new model and/or product releases, product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes. AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. ATTRIBUTION 2014 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo and combinations thereof are trademarks of Advanced Micro Devices, Inc. in the United States and/or other jurisdictions. SPEC is a registered trademark of the Standard Performance Evaluation Corporation (SPEC). Other names are for informational purposes only and may be trademarks of their respective owners. 93 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

94 Backup Slides

95 PERFORMANCE & POWER PREDICTION Running VF state: VF4 Predicting performance, power of other VF states Core VF State Performance Power Higher VF 5 Running at: CPI(VF 5 ) Running at: Power(VF 5 ) VF 4 CPI(VF 4 ) Power(VF 4 ) VF 3 CPI(VF 3 ) Power(VF 3 ) VF 2 CPI(VF 2 ) Power(VF 2 ) Lower VF 1 Performance prediction? CPI(VF 1 ) CPU Power prediction? Power(VF 1 ) 95 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

96 EXPERIMENTAL METHODOLOGY SOFTWARE TOOLS Operating System Ubuntu LTS Desktop (kernel version ) Tools taskset: A2C mapping msr-tools: PMC control CPUFreq userspace governor: VF Scaling hwmon tree in sysfs: temperature measurement Benchmark Suites SPEC CPU 2006 v1.2 (29 benchmarks) PARSEC v2.1 (13 benchmarks) NAS Parallel Benchmarks v3.3.1 (10 benchmarks) 96 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

97 HOW TO ESTIMATE MEMORY STALL CYCLES? MODERN CORES MAKE THIS DIFFICULT Memory level parallelism Accesses overlap computation Variable latencies memory access counts do not estimate memory time accurately Core clock cycles Core Work L3+DRAM Variable Latencies Multiple Parallel Accesses Accesses Overlap Computation 97 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

98 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 98 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

99 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 99 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

100 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 100 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

101 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 101 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

102 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 102 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

103 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 103 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

104 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 104 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

105 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 105 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

106 LEADING LOADS ON AMD PROCESSORS L2 cache misses held in Miss Address Buffer (MAB) MAB entries have a static priority (e.g. MAB0 is highest priority) Highest priority empty MAB holds the miss until it returns from memory Core Clock Domain CPU Core & L1 Cache L2 Cache Miss Address Buffers MAB0 MAB1 MAB2 MABn NB and Memory Clock Domains L3 Cache & DRAM Performance event 0x69 allows SW to count # of cycles with filled MABs 106 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

107 MAB BASED LEADING LOADS MODEL: LL-MAB EXPERIMENT Benchmark Running on core 0. Manager Running on Core 7. Collects the counts every 200ms. CU 0 CU 1 CU 2 CU 3 Core 0 Core 1 Core 2 Core 3 Core 4 Core 5 Core 6 Core 7 Benchmark L2 Cache L2 Cache L2 Cache L2 Cache L3 Cache & Memory Controller Manager 200ms Core 0 Core 7 Measured CPI(f) CPI(f) CPI(f) CPI(f) CPI(f) CPI(f) CPI(f) CPI(f) Predicted CPI(f ) CPI(f ) CPI(f ) CPI(f ) CPI(f ) 107 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 CPI(f ) CPI(f ) CPI(f )

108 CPI CPI PREDICTED CPI V.S. MEASURED CPI 1.7GHZ -> 3.5GHZ 5 Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 5 Measured CPI of 3.5GHz (473.astar) % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 108 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

109 CPI CPI PREDICTED CPI V.S. MEASURED CPI 1.7GHZ -> 3.5GHZ % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) Measured CPI of 3.5GHz (473.astar) a) Divide each curve into 20 segments; Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) 0 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 109 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 b) Compare cycle numbers in each segment-pair; c) Average absolute error => CPI Prediction Error.

110 CPI CPI PREDICTED CPI V.S. MEASURED CPI 1.7GHZ -> 3.5GHZ % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) Measured CPI of 3.5GHz (473.astar) a) Divide each curve into 20 segments; Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) 0 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 110 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 b) Compare cycle numbers in each segment-pair; c) Average absolute error => CPI Prediction Error.

111 CPI CPI PREDICTED CPI V.S. MEASURED CPI 1.7GHZ -> 3.5GHZ % 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) Measured CPI of 3.5GHz (473.astar) a) Divide each curve into 20 segments; Predicted CPI of 3.5GHz from Measurements Taken at 1.7 GHz (473.astar) 0 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% Executing Rate (Retired Instructions/Total Instructions) 111 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014 b) Compare cycle numbers in each segment-pair; c) Average absolute error => CPI Prediction Error.

112 CPI Prediction Error CPI PREDICTION ERROR V.S. MEMORYBOUNDEDNESS 1.7GHZ -> 3.5GHZ 20% 15% 10% 5% 0% Memoryboundedness (Cycles of 3.5G/Cycles of 1.7G) 112 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

113 CHIP IDLE POWER MODEL MODEL AND ERROR 113 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

114 CHIP DYNAMIC POWER MODEL EXPERIMENT App threads mapping & Manager CU 0 CU 1 CU 2 CU 3 Core 0 Core 1 Core 2 Core 3 Core 4 Core 5 Core 6 Core 7 App thread App thread App thread App thread Manager L2 Cache L2 Cache L2 Cache L2 Cache L3 Cache & Memory Controller Time Step = 200ms 1st half-step (100ms) 2nd half-step (100ms) power-step (20ms) x10 Manager sleeping Manager working 114 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

115 Validation Error of Dynamic Power FITTING ERROR: DYNAMIC POWER MODEL (LOWER IS BETTER) Error: 10.6% Standard Deviation: 5.8% 18% 16% 14% 12% 10% 8% 6% 4% 2% 0% Average ALL NPB PAR SPE Standard Deviation ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE ALL NPB PAR SPE VF5 VF4 VF3 VF2 VF1 115 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

116 Dynamic Power Prediction Error Across VF States (LOWER IS BETTER) Error: 8.3% Standard Deviation: 6.9% 16% 14% Average Standard Deviation 12% 10% 8% 6% 4% 2% 0% VF1->VF1 VF1->VF2 VF1->VF3 VF1->VF4 VF1->VF5 VF2->VF1 VF2->VF2 VF2->VF3 VF2->VF4 VF2->VF5 VF3->VF1 VF3->VF2 VF3->VF3 VF3->VF4 VF3->VF5 VF4->VF1 VF4->VF2 VF4->VF3 VF4->VF4 VF4->VF5 VF5->VF1 VF5->VF2 VF5->VF3 VF5->VF4 VF5->VF5 116 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

117 THE IMPACT OF POWER GATING 117 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

118 THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle 118 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

119 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 119 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

120 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 120 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

121 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 3P dyn (A)+P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 121 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

122 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 3P dyn (A)+P idle (Chip) 2P dyn (A)+P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 122 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

123 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 3P dyn (A)+P idle (Chip) 2P dyn (A)+P idle (Chip) P dyn (A)+P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 123 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

124 Normalized Chip Power THE IMPACT OF POWER GATING Micro Benchmark: A Dataset is fitted in L1 data cache => No NB activity (dynamic power) N instances of A times the dynamic power of A by N A2C Mapping CU0 C0, C1 CU1 C2, C3 CU2 C4, C5 CU3 C6, C7 4CUs A, - A, - A, - A, - 3CUs A, - A, - A, - Idle 2CUs A, - A, - Idle Idle 1CU A, - Idle Idle Idle Idle Idle Idle Idle Idle P dyn (A)+P idle (Chip) 3P dyn (A)+P idle (Chip) 2P dyn (A)+P idle (Chip) P dyn (A)+P idle (Chip) P idle (Chip) 4CUs 3CUs 2CUs 1CU idle Disable Power Gating Enable Power Gating 124 PPEP : ONLINE PERFORMANCE, POWER, AND ENERGY PREDICTION FRAMEWORK DECEMBER 17, 2014

PowerChop: Identifying and Managing Non-critical Units in Hybrid Processor Architectures

PowerChop: Identifying and Managing Non-critical Units in Hybrid Processor Architectures PowerChop: Identifying and Managing Non-critical Units in Hybrid Processor Architectures Michael A. Laurenzano, Yunqi Zhang, Jiang Chen, Lingjia Tang and Jason Mars Department of Electrical Engineering

More information

Parallelism I: Inside the Core

Parallelism I: Inside the Core Parallelism I: Inside the Core 1 The final Comprehensive Same general format as the Midterm. Review the homeworks, the slides, and the quizzes. 2 Key Points What is wide issue mean? How does does it affect

More information

Practical Resource Management in Power-Constrained, High Performance Computing

Practical Resource Management in Power-Constrained, High Performance Computing Practical Resource Management in Power-Constrained, High Performance Computing Tapasya Patki*, David Lowenthal, Anjana Sasidharan, Matthias Maiterth, Barry Rountree, Martin Schulz, Bronis R. de Supinski

More information

Drowsy Caches Simple Techniques for Reducing Leakage Power Krisztián Flautner Nam Sung Kim Steve Martin David Blaauw Trevor Mudge

Drowsy Caches Simple Techniques for Reducing Leakage Power Krisztián Flautner Nam Sung Kim Steve Martin David Blaauw Trevor Mudge Drowsy Caches Simple Techniques for Reducing Leakage Power Krisztián Flautner Nam Sung Kim Steve Martin David Blaauw Trevor Mudge krisztian.flautner@arm.com kimns@eecs.umich.edu stevenmm@eecs.umich.edu

More information

High Performance Cache Replacement Using Re-Reference Interval Prediction (RRIP)

High Performance Cache Replacement Using Re-Reference Interval Prediction (RRIP) High Performance Cache Replacement Using Re-Reference Interval Prediction (RRIP) 1 T H E A C M I E E E I N T E R N A T I O N A L S Y M P O S I U M O N C O M P U T E R A R C H I T E C T U R E ( I S C A

More information

How Much Power Does your Server Consume? Estimating Wall Socket Power Using RAPL Measurements

How Much Power Does your Server Consume? Estimating Wall Socket Power Using RAPL Measurements How Much Power Does your Server Consume? Estimating Wall Socket Power Using RAPL Measurements Kashif Nizam Khan Zhonghong Ou, Mikael Hirki, Jukka K. Nurminen, Tapio Niemi 1 Motivation The Large Hadron

More information

Easy Weigh ADVANCED COUNTING SCALE MODEL: AC-100 OWNER S MANUAL VER 1.00

Easy Weigh ADVANCED COUNTING SCALE MODEL: AC-100 OWNER S MANUAL VER 1.00 Easy Weigh OWNER S MANUAL MODEL: AC-100 ADVANCED COUNTING SCALE VER 1.00 TABLE OF CONTENTS SPECIFICATIONS... 1 NOMENCLATURE... 1 FUNCTION KEYS... 2 DISPLAY... 3 ACCURACY SPECIFICATIONS... 4 UNPACKING &

More information

UKSM: Swift Memory Deduplication via Hierarchical and Adaptive Memory Region Distilling

UKSM: Swift Memory Deduplication via Hierarchical and Adaptive Memory Region Distilling UKSM: Swift Memory Deduplication via Hierarchical and Adaptive Memory Region Distilling Nai Xia* Chen Tian* Yan Luo + Hang Liu + Xiaoliang Wang* *: Nanjing University +: University of Massachusetts Lowell

More information

AN RPM to TACH Counts Conversion. 1 Preface. 2 Audience. 3 Overview. 4 References

AN RPM to TACH Counts Conversion. 1 Preface. 2 Audience. 3 Overview. 4 References AN 17.4 RPM to TACH Counts Conversion 1 Preface 2 Audience 3 Overview 4 References This application note provides look up tables for the calculation of RPM to TACH Counts for use with the EMC2103, EMC2104,

More information

ATN3580 Series: Fixed Attenuator Pads

ATN3580 Series: Fixed Attenuator Pads DATA SHEET ATN3580 Series: Fixed Attenuator Pads Applications Attenuators Features Specified flat response to 40 GHz Return loss > 15 db Available at 1-10, 12, 15, 20, 30, and 40 db Power handling to 1

More information

Advanced Superscalar Architectures. Speculative and Out-of-Order Execution

Advanced Superscalar Architectures. Speculative and Out-of-Order Execution 6.823, L16--1 Advanced Superscalar Architectures Asanovic Laboratory for Computer Science M.I.T. http://www.csg.lcs.mit.edu/6.823 Speculative and Out-of-Order Execution Branch Prediction kill kill Branch

More information

A Power-Aware, Application-Based, Performance Study Of Moder. Cluster Interconnection Networks

A Power-Aware, Application-Based, Performance Study Of Moder. Cluster Interconnection Networks A Power-Aware, Application-Based, Performance Study Of Modern Commodity Cluster Interconnection Networks Open Systems Lab Indiana University Bloomington, USA CAC 09 - IPDPS 09 Rome, Italy May, 25th 2009

More information

INSTITUTO SUPERIOR TÉCNICO. Architectures for Embedded Computing

INSTITUTO SUPERIOR TÉCNICO. Architectures for Embedded Computing UNIVERSIDADE TÉCNICA DE LISBOA INSTITUTO SUPERIOR TÉCNICO Departamento de Engenharia Informática Architectures for Embedded Computing MEIC-A, MEIC-T, MERC Lecture Slides Version 3.0 - English Lecture 02

More information

ZC706 MIG Design Creation November 2014

ZC706 MIG Design Creation November 2014 ZC706 MIG Design Creation November 2014 XTP244 Revision History Date Version Description 11/24/14 10.0 Regenerated for 2014.4. 10/08/14 9.0 Regenerated for 2014.3. 06/09/14 8.0 Regenerated for 2014.2.

More information

Enhancing Energy Efficiency of Database Applications Using SSDs

Enhancing Energy Efficiency of Database Applications Using SSDs Seminar Energy-Efficient Databases 29.06.2011 Enhancing Energy Efficiency of Database Applications Using SSDs Felix Martin Schuhknecht Motivation vs. Energy-Efficiency Seminar 29.06.2011 Felix Martin Schuhknecht

More information

Fixing the Hyperdrive: Maximizing Rendering Performance on NVIDIA GPUs

Fixing the Hyperdrive: Maximizing Rendering Performance on NVIDIA GPUs Fixing the Hyperdrive: Maximizing Rendering Performance on NVIDIA GPUs Louis Bavoil, Principal Engineer Booth #223 - South Hall www.nvidia.com/gdc Full-Screen Pixel Shader SM TEX L2 DRAM CROP SM = Streaming

More information

Decoupling Loads for Nano-Instruction Set Computers

Decoupling Loads for Nano-Instruction Set Computers Decoupling Loads for Nano-Instruction Set Computers Ziqiang (Patrick) Huang, Andrew Hilton, Benjamin Lee Duke University {ziqiang.huang, andrew.hilton, benjamin.c.lee}@duke.edu ISCA-43, June 21, 2016 1

More information

TEST REPORT. Test Report for Mating Cycle Validation of Mini Circuits QBL Series Quick-Lock Test Cables.

TEST REPORT. Test Report for Mating Cycle Validation of Mini Circuits QBL Series Quick-Lock Test Cables. TITLE: Test Report for Mating Cycle Validation of Mini Circuits QBL Series Quick-Lock Test Cables. 1. Background & Introduction Mini-Circuits has introduced a QBL Series of test cables is based upon an

More information

Lecture 14: Instruction Level Parallelism

Lecture 14: Instruction Level Parallelism Lecture 14: Instruction Level Parallelism Last time Pipelining in the real world Today Control hazards Other pipelines Take QUIZ 10 over P&H 4.10-15, before 11:59pm today Homework 5 due Thursday March

More information

Fiorano ESB 2007 Oracle Enterprise Gateway Integration Guide

Fiorano ESB 2007 Oracle Enterprise Gateway Integration Guide An Oracle White Paper June 2011 Fiorano ESB 2007 Oracle Enterprise Gateway Integration Guide 1 / 25 Disclaimer The following is intended to outline our general product direction. It is intended for information

More information

APPLICATION NOTE. ATAK51003-V1 Quick Start Guide. Atmel ATAN0033. Kit Contents

APPLICATION NOTE. ATAK51003-V1 Quick Start Guide. Atmel ATAN0033. Kit Contents APPLICATION NOTE ATAK51003-V1 Quick Start Guide Atmel ATAN0033 Kit Contents The Atmel ATAK51003-V1 Kit includes the following components: 1 Atmel ATAB5279-V1.2 LF antenna driver 2 LF antenna modules 1

More information

Out-of-order Pipeline. Register Read. OOO execution (2-wide) OOO execution (2-wide) OOO execution (2-wide) OOO execution (2-wide)

Out-of-order Pipeline. Register Read. OOO execution (2-wide) OOO execution (2-wide) OOO execution (2-wide) OOO execution (2-wide) Out-of-order Pipeline Register Read When do instructions read the register file? Fetch Decode Rename Dispatch Buffer of instructions Issue Reg-read Execute Writeback Commit Option #: after select, right

More information

ZC706 MIG Design Creation November 2015

ZC706 MIG Design Creation November 2015 ZC706 MIG Design Creation November 2015 XTP244 Revision History Date Version Description 11/24/15 14.0 Regenerated for 2015.4. 10/06/15 13.0 Regenerated for 2015.3. 06/30/15 12.0 Regenerated for 2015.2.

More information

MACP TB. 23 db Coupler MHz Rev. V1. Features. Functional Schematic. Description. Pin Configuration 4. Ordering Information 1,2

MACP TB. 23 db Coupler MHz Rev. V1. Features. Functional Schematic. Description. Pin Configuration 4. Ordering Information 1,2 23 Coupler Features 23 Coupling Ratio Surface Mount Package Available on Tape and Reel Excellent Temperature Stability RoHS Compliant and Pb Free 260 C Reflow Compatible Description The MACP-0145 is a

More information

Lecture 20: Parallelism ILP to Multicores. James C. Hoe Department of ECE Carnegie Mellon University

Lecture 20: Parallelism ILP to Multicores. James C. Hoe Department of ECE Carnegie Mellon University 18 447 Lecture 20: Parallelism ILP to Multicores James C. Hoe Department of ECE Carnegie Mellon University 18 447 S18 L20 S1, James C. Hoe, CMU/ECE/CALCM, 2018 18 447 S18 L20 S2, James C. Hoe, CMU/ECE/CALCM,

More information

WHITE PAPER. Informatica PowerCenter 8 on HP Integrity Servers: Doubling Performance with Linear Scalability for 64-bit Enterprise Data Integration

WHITE PAPER. Informatica PowerCenter 8 on HP Integrity Servers: Doubling Performance with Linear Scalability for 64-bit Enterprise Data Integration WHITE PAPER Informatica PowerCenter 8 on HP Integrity Servers: Doubling Performance with Linear Scalability for 64-bit Enterprise Data Integration This document contains Confi dential, Proprietary and

More information

CS 152 Computer Architecture and Engineering

CS 152 Computer Architecture and Engineering CS 152 Computer Architecture and Engineering Lecture 23 Synchronization 2006-11-16 John Lazzaro (www.cs.berkeley.edu/~lazzaro) TAs: Udam Saini and Jue Sun www-inst.eecs.berkeley.edu/~cs152/ 1 Last Time:

More information

Code Scheduling & Limitations

Code Scheduling & Limitations This Unit: Static & Dynamic Scheduling CIS 371 Computer Organization and Design Unit 11: Static and Dynamic Scheduling App App App System software Mem CPU I/O Code scheduling To reduce pipeline stalls

More information

8-bit. Application Note. Microcontrollers. AVR601: Atmel Modular Evaluation Kits for Motor Control Applications. 1.

8-bit. Application Note. Microcontrollers. AVR601: Atmel Modular Evaluation Kits for Motor Control Applications. 1. AVR601: Atmel Modular Evaluation Kits for Motor Control Applications 1. Introduction Following the success of the MC100 and MC200 motor control demonstration kits, Atmel is expanding the support for motor

More information

Finite Element Based, FPGA-Implemented Electric Machine Model for Hardware-in-the-Loop (HIL) Simulation

Finite Element Based, FPGA-Implemented Electric Machine Model for Hardware-in-the-Loop (HIL) Simulation Finite Element Based, FPGA-Implemented Electric Machine Model for Hardware-in-the-Loop (HIL) Simulation Leveraging Simulation for Hybrid and Electric Powertrain Design in the Automotive, Presentation Agenda

More information

E-AMOM: AN ENERGY-AWARE MODELING AND OPTIMIZATION METHODOLOGY FOR SCIENTIFIC APPLICATIONS ON MULTICORE SYSTEMS

E-AMOM: AN ENERGY-AWARE MODELING AND OPTIMIZATION METHODOLOGY FOR SCIENTIFIC APPLICATIONS ON MULTICORE SYSTEMS E-AMOM: AN ENERGY-AWARE MODELING AND OPTIMIZATION METHODOLOGY FOR SCIENTIFIC APPLICATIONS ON MULTICORE SYSTEMS A Dissertation by CHARLES WESLEY LIVELY III Submitted to the Office of Graduate Studies of

More information

ARC-H: Adaptive replacement cache management for heterogeneous storage devices

ARC-H: Adaptive replacement cache management for heterogeneous storage devices Journal of Systems Architecture 58 (2012) ARC-H: Adaptive replacement cache management for heterogeneous storage devices Young-Jin Kim, Division of Electrical and Computer Engineering, Ajou University,

More information

Getting the Lead Out December, 2007

Getting the Lead Out December, 2007 Getting the Lead Out December, 2007 Tom DeBonis Assembly & Test Technology Development Technology and Manufacturing Group Summary Intel has removed the lead (Pb) from its manufacturing process across its

More information

TABLE OF CONTENTS. 8.3 Online Sensor Specifications. Series TS (Models TS1, TSP, TSH, TSL,TSF, TSB1, TSB2) According to Electromatic factory procedure

TABLE OF CONTENTS. 8.3 Online Sensor Specifications. Series TS (Models TS1, TSP, TSH, TSL,TSF, TSB1, TSB2) According to Electromatic factory procedure 8.3 Online Sensor Specifications Series TS (Models TS1, TSP, TSH, TSL,TSF, TSB1, TSB2) Calibration: Accuracy: Remainder of range and other calibration material Overload protection Measuring principle Measuring

More information

Cooling System Simulation for Indian Utility Vehicle using COOL3D

Cooling System Simulation for Indian Utility Vehicle using COOL3D Indian GT SUITE Conference 2013 Cooling System Simulation for Indian Utility Vehicle using COOL3D Paper presented by Rishabh Pandey, M&M Gopakishore Gummadi, M&M Copyright 2012 Mahindra & Mahindra Ltd.

More information

TECHNICAL BRIEF Americas

TECHNICAL BRIEF Americas TECHNICAL BRIEF Americas Design Considerations When AC Coupling IQ Micros to Battery-Based Systems Overview AC coupling allows use of Enphase Microinverters with battery-based inverter systems. These applications

More information

China Intelligent Connected Vehicle Technology Roadmap 1

China Intelligent Connected Vehicle Technology Roadmap 1 China Intelligent Connected Vehicle Technology Roadmap 1 Source: 1. China Automotive Engineering Institute, , Oct. 2016 1 Technology Roadmap 1 General

More information

EE Architecture for Highly Electrified Powertrain

EE Architecture for Highly Electrified Powertrain EE Architecture for Highly Electrified Powertrain 2020-2030 M. Gleich, Senior Manager Marketing and Business Development Powertrain - restricted - Context Resources, Pollution, Climate Urbanization Moore

More information

Multi Core Processing in VisionLab

Multi Core Processing in VisionLab Multi Core Processing in Multi Core CPU Processing in 25 August 2014 Copyright 2001 2014 by Van de Loosdrecht Machine Vision BV All rights reserved jaap@vdlmv.nl Overview Introduction Demonstration Automatic

More information

ABB June 19, Slide 1

ABB June 19, Slide 1 Dr Simon Round, Head of Technology Management, MATLAB Conference 2015, Bern Switzerland, 9 June 2015 A Decade of Efficiency Gains Leveraging modern development methods and the rising computational performance-price

More information

Flo-Way. Measure Granular or Powdered Material. Stand Alone or Connect to Other Devices for Blending. Used by Successful Producers World-Wide

Flo-Way. Measure Granular or Powdered Material. Stand Alone or Connect to Other Devices for Blending. Used by Successful Producers World-Wide Flo-Way R Solids Impact Flow Meter Measure Granular or Powdered Material Stand Alone or Connect to Other Devices for Blending Used by Successful Producers World-Wide Visit our website: www.beltwayscales.com

More information

Fluke 438-II Power Quality and Motor Analyzer

Fluke 438-II Power Quality and Motor Analyzer Test Equipment Depot - 800.517.8431-99 Washington Street Melrose, MA 02176 - TestEquipmentDepot.com TECHNICAL DATA Fluke 438-II Power Quality and Motor Analyzer Quickly and easily discover electrical and

More information

Storage and Memory Hierarchy CS165

Storage and Memory Hierarchy CS165 Storage and Memory Hierarchy CS165 What is the memory hierarchy? L1

More information

ARM. series SIGNUM SYSTEMS CORPORATION. ARM Board Setup. User Guide

ARM. series SIGNUM SYSTEMS CORPORATION. ARM Board Setup. User Guide ARM series SIGNUM SYSTEMS CORPORATION ARM Board Setup User Guide C O P Y R I G H T N O T I C E Copyright (c) 2016 by Signum Systems Corporation, an IAR Systems company. All rights are reserved worldwide.

More information

ValveLink SNAP-ON Application

ValveLink SNAP-ON Application AMS Device Manager Product Data Sheet ValveLink SNAP-ON Application Communicate with both HART and Foundation Fieldbus FIELDVUE digital valve controllers in the same application Online, in-service performance

More information

CIS 662: Sample midterm w solutions

CIS 662: Sample midterm w solutions CIS 662: Sample midterm w solutions 1. (40 points) A processor has the following stages in its pipeline: IF ID ALU1 MEM1 MEM2 ALU2 WB. ALU1 stage is used for effective address calculation for loads, stores

More information

The MathWorks Crossover to Model-Based Design

The MathWorks Crossover to Model-Based Design The MathWorks Crossover to Model-Based Design The Ohio State University Kerem Koprubasi, Ph.D. Candidate Mechanical Engineering The 2008 Challenge X Competition Benefits of MathWorks Tools Model-based

More information

TID MEMORY MAPS FOR MONZA SELF-SERIALIZATION

TID MEMORY MAPS FOR MONZA SELF-SERIALIZATION Application Note IMPINJ MONZA TID MEMORY MAPS FOR MONZA SELF-SERIALIZATION Version 3.0 2017, Impinj, Inc. www.impinj.com TABLE OF CONTENTS 1 Introduction... 1 2 Monza Tag Chip Models... 2 2.1 Monza Self-Serialization

More information

Storage-less and converter-less maximum power tracking of photovoltaic cells for a nonvolatile microprocessor

Storage-less and converter-less maximum power tracking of photovoltaic cells for a nonvolatile microprocessor Seoul National University Storage-less and converter-less maximum power tracking of photovoltaic cells for a nonvolatile microprocessor Cong Wang, Naehyuck Chang, Y. Kim, S. Park, Yongpan Liu, Hyung Gyu

More information

Oracle Utilities Customer Care and Billing Release Utility Reference Model Process Customer Request for Net Metering

Oracle Utilities Customer Care and Billing Release Utility Reference Model Process Customer Request for Net Metering Oracle Utilities Customer Care and Billing Release 2.3.1 Utility Reference Model 3.4.3.2 Process Customer Request for Net Metering July 2012 Oracle Utilities Customer Care and Billing Utility Reference

More information

Energy Efficient Content-Addressable Memory

Energy Efficient Content-Addressable Memory Energy Efficient Content-Addressable Memory Advanced Seminar Computer Engineering Institute of Computer Engineering Heidelberg University Fabian Finkeldey 26.01.2016 Fabian Finkeldey, Energy Efficient

More information

Power Consumption Reduction: Hot Spare

Power Consumption Reduction: Hot Spare Power Consumption Reduction: Hot Spare A Dell technical white paper Mark Muccini Wayne Cook Contents Executive summary... 3 Introduction... 3 Traditional power solutions... 3 Hot spare... 5 Hot spare solution...

More information

Computer Architecture: Out-of-Order Execution. Prof. Onur Mutlu (editted by Seth) Carnegie Mellon University

Computer Architecture: Out-of-Order Execution. Prof. Onur Mutlu (editted by Seth) Carnegie Mellon University Computer Architecture: Out-of-Order Execution Prof. Onur Mutlu (editted by Seth) Carnegie Mellon University Reading for Today Smith and Sohi, The Microarchitecture of Superscalar Processors, Proceedings

More information

BLAST CAPACITY ASSESSMENT AND TESTING A-60 OFFSHORE FIRE DOOR

BLAST CAPACITY ASSESSMENT AND TESTING A-60 OFFSHORE FIRE DOOR BLAST CAPACITY ASSESSMENT AND TESTING Final Report December 11, 2008 A-60 OFFSHORE FIRE DOOR Prepared for: JRJ Alum Fab, Inc. Prepared by: Travis J. Holland Michael J. Lowak John R. Montoya BakerRisk Project

More information

Aluminum Electrolytic Capacitors Radial Very Low Impedance

Aluminum Electrolytic Capacitors Radial Very Low Impedance Aluminum Electrolytic Capacitors Radial Very Low Impedance FEATURES Very low impedance and low ESR Very long useful life: 4000 h to 0 000 h at 05 C, very high reliability Excellent ripple current capability

More information

Pipelining A B C D. Readings: Example: Doing the laundry. Ann, Brian, Cathy, & Dave. each have one load of clothes to wash, dry, and fold

Pipelining A B C D. Readings: Example: Doing the laundry. Ann, Brian, Cathy, & Dave. each have one load of clothes to wash, dry, and fold Pipelining Readings: 4.5-4.8 Example: Doing the laundry Ann, Brian, Cathy, & Dave A B C D each have one load of clothes to wash, dry, and fold Washer takes 30 minutes Dryer takes 40 minutes Folder takes

More information

Electro-Proportional Terms and Definitions

Electro-Proportional Terms and Definitions Electro-Proportional Terms and Definitions Valve Deadband The span of operation where there is no flow or pressure output for some specified range of command Hydraulic Valve Gain The characteristic relating

More information

Overcoming Motor Driver Heat-Transfer Challenges in Robotic Applications

Overcoming Motor Driver Heat-Transfer Challenges in Robotic Applications Overcoming Motor Driver Heat-Transfer Challenges in Robotic Applications Jason D. Carr, M.S. and David W. Sundin, Ph.D. Page 1 This paper is protected by copyright law. Individuals may reproduce and distribute

More information

DataCore Virtualization Appliance

DataCore Virtualization Appliance DataCore Virtualization Appliance Fujitsu-DataCore SVA November 2018 The Fujitsu Storage Virtualization Appliance (SVA) is a complete hardware solution for DataCore s SANsymphony SAN virtualization product.

More information

XC95288 In-System Programmable CPLD

XC95288 In-System Programmable CPLD R 0 XC95288 In-System Programmable CPLD 0 5 Product Specification Features 15 ns pin-to-pin logic delays on all pins f CNT to 95 MHz 288 macrocells with 6,400 usable gates Up to 166 user pins 5V in-system

More information

For questions or technical support, 1. Wiring Reference:

For questions or technical support,  1. Wiring Reference: Warning: Before proceeding you are obligated to read and agree to the terms and conditions attached to this manual. Misuse of this product may cause injury or death. Incorrect installation may cause damage

More information

Traffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7

Traffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7 Traffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7 1 Table of Contents Dimensions... 3 Technical Specifications... 4 Communication... 5 Programming... 6 Data Collection and

More information

A Predictive Delay Fault Avoidance Scheme for Coarse Grained Reconfigurable Architecture

A Predictive Delay Fault Avoidance Scheme for Coarse Grained Reconfigurable Architecture A Predictive Fault Avoidance Scheme for Coarse Grained Reconfigurable Architecture Toshihiro Kameda 1 Hiroaki Konoura 1 Dawood Alnajjar 1 Yukio Mitsuyama 2 Masanori Hashimoto 1 Takao Onoye 1 hasimoto@ist.osaka

More information

Tachometer. English. Users Manual

Tachometer. English. Users Manual 931 Tachometer Users Manual English PN 3409877 January 2009 2009 Fluke Corporation, All rights reserved. Printed in Germany. Specifications are subject to change without notice. All product names are trademarks

More information

Advantage Memory Corporation reserves the right to change products and specifications without notice

Advantage Memory Corporation reserves the right to change products and specifications without notice SD872-8X8-72VS4 SDRAM DIMM 8MX72 SDRAM DIMM with ECC based on 8MX8, 4B, 4K Refresh, 3.3V DRAMs with SPD GENERAL DESCRIPTION The Advantage SD872-8X8-72VS4 is a 8MX72 Synchronous Dynamic RAM high-density

More information

L (LK9, LQ8) when use in: Saab 9-3

L (LK9, LQ8) when use in: Saab 9-3 NOTE: Printing this file may require 8.5" x 14" (legal size) paper, depending on your printer setup. Catalytic Converter Monitoring P0420 Time for Rear O2 sensor signal Time for rear O2 to go low. Value

More information

Web Volume Control Model WV220

Web Volume Control Model WV220 WEB CONTROL PRODUCTS User Manual Web Volume Control Model WV220 1 In accordance with Nexen s established policy of constant product improvement, the specifications contained in this manual are subject

More information

Fleet Micro Series Advanced Battery Charging System Dual and Quad Instructional Manual

Fleet Micro Series Advanced Battery Charging System Dual and Quad Instructional Manual Fleet Micro Series Advanced Battery Charging System Dual and Quad Instructional Manual Introduction: Thank you for your choice in purchasing a Core SWX Fleet Micro Charger. The charger is designed to provide

More information

Department of Civil Engineering The University of British Columbia. Nicolas Saunier

Department of Civil Engineering The University of British Columbia. Nicolas Saunier Department of Civil Engineering The University of British Columbia TRUCK SIGNAL PRIORITY Nicolas Saunier Wook Kang Why Truck Priority? Reduce Rd the Cost of Goods Transportation Reduce Red Light Running

More information

Installation Guide High-end dynamic balancing instrumentation Page 1. Installation Guide for. EasyBalance 2.2

Installation Guide High-end dynamic balancing instrumentation Page 1. Installation Guide for. EasyBalance 2.2 High-end dynamic balancing instrumentation Page 1 for EasyBalance 2.2 High-end dynamic balancing instrumentation High-end dynamic balancing instrumentation Page 2 1 Mechanical Modifications There are two

More information

Tracer VMA with AutoReg

Tracer VMA with AutoReg Tracer with AutoReg Flowmeter with Automatic Flow Regulation Operating Instructions RoHS Compliant General The Tracer Flowmeter with AutoReg provides: Analog Flow Output (Selectable 0 to 3.5V, 0 to 5V

More information

Valvoline Fourth-Quarter Fiscal 2016 Earnings Conference Call. November 9, 2016

Valvoline Fourth-Quarter Fiscal 2016 Earnings Conference Call. November 9, 2016 Valvoline Fourth-Quarter Fiscal 2016 Earnings Conference Call November 9, 2016 Forward-Looking Statements This presentation contains forward-looking statements within the meaning of Section 27A of the

More information

Near-Optimal Precharging in High-Performance Nanoscale CMOS Caches

Near-Optimal Precharging in High-Performance Nanoscale CMOS Caches Near-Optimal Precharging in High-Performance Nanoscale CMOS Caches Se-Hyun Yang and Babak Falsafi Computer Architecture Laboratory (CALCM) Carnegie Mellon University {sehyun, babak}@cmu.edu http://www.ece.cmu.edu/~powertap

More information

Catalog Issued 3-03 (PDF Rev )

Catalog Issued 3-03 (PDF Rev ) W6/W9 series Magnetic Hydraulic P& Circuit reakers Features Designed for the international market. UL Recognized (UL77 and UL5), CS ccepted and VDE approved. Ratings to 5 amps. Heavy duty #-32 stud connections.

More information

UBI-2590 HC SMBus (Part No. UBBL13) Battery Specification

UBI-2590 HC SMBus (Part No. UBBL13) Battery Specification UBI-2590 HC SMBus (Part No. UBBL13) Battery Specification Copyright Copyright 2012, Ultralife Corporation. All Rights Reserved. Trademark Information Ultralife, the Ultralife logo design, SmartCircuit

More information

Using SystemVerilog Assertions in Gate-Level Verification Environments

Using SystemVerilog Assertions in Gate-Level Verification Environments Using SystemVerilog Assertions in Gate-Level Verification Environments Mark Litterick (Verification Consultant) mark.litterick@verilab.com 2 Introduction Gate-level simulations why bother? methodology

More information

From Developing Credit Risk Models Using SAS Enterprise Miner and SAS/STAT. Full book available for purchase here.

From Developing Credit Risk Models Using SAS Enterprise Miner and SAS/STAT. Full book available for purchase here. From Developing Credit Risk Models Using SAS Enterprise Miner and SAS/STAT. Full book available for purchase here. About this Book... ix About the Author... xiii Acknowledgments...xv Chapter 1 Introduction...

More information

Service Bulletin Buses

Service Bulletin Buses Prevost Saint-Nicolas, Quebec, Canada Service Bulletin Buses Date Group No. Release Page 6.2010 364 00 1(12) Vehicle Electronic Control Unit (VECU) MID 144, Diagnostic Trouble Code (DTC), Guide Vehicle

More information

Food Service Technology Center

Food Service Technology Center Food Service Technology Center Convotherm C4eT 6.20EB Electric Combination Oven Test Report Application of ASTM Standard Test Method F2861-10 May 2015 Prepared by: Denis Livchak Fisher-Nickel, Inc. Contributors:

More information

Local Memory Bus (LMB) V10 (v1.00a)

Local Memory Bus (LMB) V10 (v1.00a) Local Memory Bus (LMB) V10 (v1.00a) DS445 December 2, 2009 Introduction The LMB V10 module is used as the LMB interconnect for Xilinx FPGA-based embedded processor systems. The LMB is a fast, local bus

More information

TMD90A Dual Input Digital Thermometer

TMD90A Dual Input Digital Thermometer TMD90A Dual Input Digital Thermometer User Manual For detailed specifications and ordering info go to www.testequipmentdepot.com TMD90A Dual Input Thermometer Contents Safety Information 2 Symbols Used

More information

Combiner Manual #OMBINER #OMBINER < 2* )NSTALLATION 7(&+12/ i 62/$5

Combiner Manual #OMBINER #OMBINER < 2* )NSTALLATION 7(&+12/ i 62/$5 Combiner Manual i ii Combiner Manual Combiner Manual Copyright 2004 SMA America, Inc. All rights reserved. All rights reserved. No part of this document may be reproduced, stored in a retrieval system,

More information

Dual-Rail Domino Logic Circuits with PVT Variations in VDSM Technology

Dual-Rail Domino Logic Circuits with PVT Variations in VDSM Technology Dual-Rail Domino Logic Circuits with PVT Variations in VDSM Technology C. H. Balaji 1, E. V. Kishore 2, A. Ramakrishna 3 1 Student, Electronics and Communication Engineering, K L University, Vijayawada,

More information

Linking the Indiana ISTEP+ Assessments to the NWEA MAP Growth Tests. February 2017 Updated November 2017

Linking the Indiana ISTEP+ Assessments to the NWEA MAP Growth Tests. February 2017 Updated November 2017 Linking the Indiana ISTEP+ Assessments to the NWEA MAP Growth Tests February 2017 Updated November 2017 2017 NWEA. All rights reserved. No part of this document may be modified or further distributed without

More information

Model-Based Design and Hardware-in-the-Loop Simulation for Clean Vehicles Bo Chen, Ph.D.

Model-Based Design and Hardware-in-the-Loop Simulation for Clean Vehicles Bo Chen, Ph.D. Model-Based Design and Hardware-in-the-Loop Simulation for Clean Vehicles Bo Chen, Ph.D. Dave House Associate Professor of Mechanical Engineering and Electrical Engineering Department of Mechanical Engineering

More information

128Mb Synchronous DRAM. Features High Performance: Description. REV 1.0 May, 2001 NT5SV32M4CT NT5SV16M8CT NT5SV8M16CT

128Mb Synchronous DRAM. Features High Performance: Description. REV 1.0 May, 2001 NT5SV32M4CT NT5SV16M8CT NT5SV8M16CT Features High Performance: f Clock Frequency -7K 3 CL=2-75B, CL=3-8B, CL=2 Single Pulsed RAS Interface Fully Synchronous to Positive Clock Edge Four Banks controlled by BS0/BS1 (Bank Select) Units 133

More information

ABOT CHASSIS ASSEMBLY GUIDE

ABOT CHASSIS ASSEMBLY GUIDE ABOT CHASSIS ASSEMBLY GUIDE ABOT-0004 01/12 Features Abot assembly Distance sensor and servomotor data Description The guide for assembling the Abot robot chassis Including the basic data Table of Contents

More information

INSTALLATION, MAINTENANCE & SERVICE MANUAL

INSTALLATION, MAINTENANCE & SERVICE MANUAL INSTALLATI, MAINTENANCE & SERVICE MANUAL M151217L Product Specifications 20" (506mm) 3.3 (85mm) 9.8 (250mm) 38" (974mm) 4.2 (106mm) 9.8 (250mm) 48" (1208mm) 4.2 (106mm) 56" (1442mm) 4.2 (106mm) Wiring

More information

Attitude Instructions Ford 6.0 Powerstroke

Attitude Instructions Ford 6.0 Powerstroke Attitude Instructions Ford 6.0 Powerstroke Attitude 2003, Edge Products Incorporated All rights reserved. Edge Products Incorporated 5875 South Adams Avenue Suite 100 Ogden, UT 84405 (801) 476-3343 www.edgeproductsinc.com

More information

WWW.MORETRACTION.COM TMS-5500-SL ELECTRONIC TRACTION CONTROL US PATENT 6,577,944 Other Patents Pending COPYRIGHT NOTICE Copyright 1999-2013 Davis Technologies, LLC. All rights reserved. Information in

More information

Cormorant Engineering

Cormorant Engineering Gas Well Deliquification Workshop Sheraton Hotel, Denver, Colorado February 27 March 2, 20 Selection and Validation of Hydraulic Dewatering Pumps using Predictive Performance Model and Downhole Simulator

More information

Replacement Part has been judged by Mini Circuits Engineering as a suitable replacement to Original Part a. Case Style DQ849 (No Change) Marking MN5A

Replacement Part has been judged by Mini Circuits Engineering as a suitable replacement to Original Part a. Case Style DQ849 (No Change) Marking MN5A REPLACEMENT PART REFERENCE GUIDE, MNA 5+ AN 60 088 ORIGINAL PART: REPLACEMENT PART: MNA 5+ MNA 5A+ Replacement Part has been judged by Mini Circuits Engineering as a suitable replacement to Original Part

More information

CMMI Opens the Gate: The Practical Relationship between CMMI and Stage- Gate Decision Models

CMMI Opens the Gate: The Practical Relationship between CMMI and Stage- Gate Decision Models NDIA CMMI Technology Conference & User Group Denver, CO CMMI Opens the Gate: The Practical Relationship between CMMI and Stage- Gate Decision Models ABB USCRC - 1 Agenda ABB Overview Stage-Gate Process

More information

Advantage Memory Corporation reserves the right to change products and specifications without notice

Advantage Memory Corporation reserves the right to change products and specifications without notice SDRAM SODIMM 4MX64 SDRAM SO DIMM based on 4MX16, 4Banks, 4K Refresh, 3.3V DRAMs with SPD GENERAL DESCRIPTION The Advantage is a 4MX64 Synchronous Dynamic RAM high density memory module. The Advantage consists

More information

Traffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7

Traffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7 Traffic Logix SafePace 100 Radar Speed Sign Product Specifications Version 2.7 1 Table of Contents Dimensions... 3 Dimensions of SafePace 100 Radar Sign with Full YOUR SPEED Sign Face... 4 Dimensions of

More information

Computer Architecture 计算机体系结构. Lecture 3. Instruction-Level Parallelism I 第三讲 指令级并行 I. Chao Li, PhD. 李超博士

Computer Architecture 计算机体系结构. Lecture 3. Instruction-Level Parallelism I 第三讲 指令级并行 I. Chao Li, PhD. 李超博士 Computer Architecture 计算机体系结构 Lecture 3. Instruction-Level Parallelism I 第三讲 指令级并行 I Chao Li, PhD. 李超博士 SJTU-SE346, Spring 2018 Review ISA, micro-architecture, physical design Evolution of ISA CISC vs

More information

INDUSTRIAL CRANE MODERNIZATION THE SMART WAY TO EXTEND OVERHEAD CRANE SERVICE LIFE

INDUSTRIAL CRANE MODERNIZATION THE SMART WAY TO EXTEND OVERHEAD CRANE SERVICE LIFE INDUSTRIAL NUCLEAR PORT HEAVY-DUTY LIFT TRUCKS SERVICE MODERNIZATION SERVICES INDUSTRIAL CRANE MODERNIZATION THE SMART WAY TO EXTEND OVERHEAD CRANE SERVICE LIFE 2 Konecranes Crane Modernizations Crane

More information

Four-Position Gang Charger for SAFEMTX Li-ION Battery Packs

Four-Position Gang Charger for SAFEMTX Li-ION Battery Packs Four-Position Gang Charger for SAFEMTX Li-ION Battery Packs (P/N 10067285) Instruction Manual " WARNING THIS MANUAL MUST BE CAREFULLY READ BY ALL INDIVIDUALS WHO HAVE OR WILL HAVE THE RESPONSIBILITY FOR

More information

UNLEASH THE POWER. See More at: bullydog.com WITH BULLY DOG PERFORMANCE PRODUCTS. Doc.# BD41001 v1.0.3

UNLEASH THE POWER. See More at: bullydog.com WITH BULLY DOG PERFORMANCE PRODUCTS. Doc.# BD41001 v1.0.3 Bully Dog Technologies, LLC is a team built on integrity that is dedicated to leading the vehicle performance industry with an uncompromising code of ethics demonstrated in the soundness of its employees,

More information

Using on-line power modeling for server power capping

Using on-line power modeling for server power capping IBM Research and University of Texas at Austin Using on-line power modeling for server power capping Madhu Saravana Sibi Govindan, Charles Lefurgy, and Ajay Dholakia Workshop on Energy-Efficient Design

More information