TUE, a new look at PUE - Lawrence Livermore National ...ย ยท PUE ๐‘ƒ = ๐ผ J Q โ€ข Introduced in...

Post on 18-Jun-2020

3 views 0 download

Transcript of TUE, a new look at PUE - Lawrence Livermore National ...ย ยท PUE ๐‘ƒ = ๐ผ J Q โ€ข Introduced in...

1

TUE, a new look at PUE

Michael K Patterson, PhD, PE, DCEP

Principal Engineer, TCG Systems Architecture and Pathfinding

November 17, 2013

2

This presentation adapted from our presentation at ISC 13โ€ฆ

TUE, a new energy-efficiency metric applied at ORNL's Jaguar

David J Martinez

William F Tschudi Henry Coles

Stephen W Poole Chung-Hsing Hsu Don Maxwell

Natalie J Bates

3

ISC 2013 Best Paper Award

4

Motivation

HPC Cluster and Data Center energy use is a challenge;

possibly constraining Industry growth

We canโ€™t manage what we donโ€™t measure

Metrics allow tracking and trending of our performance and

comparison to others

PUE has helped but has limitations, what is the next step

towards better energy efficiency?

5

Overview

PUE Definition and Development

Issues with PUE

Defining new metrics

ITUE: IT-power usage effectiveness

TUE: total-power usage effectiveness

Metrics demonstration and example

Case Study at Jaguar

BoF

6

PUE

๐‘ƒ๐‘ˆ๐ธ =๐‘‡๐‘œ๐‘ก๐‘Ž๐‘™ ๐ท๐‘Ž๐‘ก๐‘Ž ๐ถ๐‘’๐‘›๐‘ก๐‘’๐‘Ÿ ๐ด๐‘›๐‘›๐‘ข๐‘Ž๐‘™ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ

๐‘‡๐‘œ๐‘ก๐‘Ž๐‘™ ๐ผ๐‘‡ ๐ด๐‘›๐‘›๐‘ข๐‘Ž๐‘™ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ

โ€ข Introduced in 2006 by Malone and Belady

โ€ข Developed and agreed to by EU Code of Conduct, DOE, EPA,

Green Grid, ASHRAE, etcโ€ฆ

โ€ข Has led Energy Efficiency drive in Data Centers

โ€ข PUE Average in 2007 ~ 2.5

โ€ข Best in Class 2013:

NREL= 1.06, LRZ= 1.15, NCAR~1.2,

ORNL= 1.25, TU Dresden < 1.3

7

PUE Definition

8

but PUE isn't perfect, considerโ€ฆ..

data center

IT fan

fan

๐‘ƒ๐‘ˆ๐ธ =๐‘๐‘ค๐‘Ÿ + ๐‘“๐‘Ž๐‘›๐ท๐ถ + (๐ผ๐‘‡ + ๐‘“๐‘Ž๐‘›๐ผ๐‘‡)

(๐ผ๐‘‡ + ๐‘“๐‘Ž๐‘›๐ผ๐‘‡)

UPS & PDU

pwr

9

Three variationsโ€ฆ

a) both fans

b) IT

fans only

c)

bldg fan only

๐‘ƒ๐‘ˆ๐ธ๐‘Ž =๐‘๐‘ค๐‘Ÿ + ๐‘“๐‘Ž๐‘›๐ท๐ถ + (๐ผ๐‘‡ + ๐‘“๐‘Ž๐‘›๐ผ๐‘‡)

(๐ผ๐‘‡ + ๐‘“๐‘Ž๐‘›๐ผ๐‘‡)

๐‘ƒ๐‘ˆ๐ธ๐‘ =๐‘๐‘ค๐‘Ÿ + (๐ผ๐‘‡ + ๐‘“๐‘Ž๐‘›๐ผ๐‘‡)

(๐ผ๐‘‡ + ๐‘“๐‘Ž๐‘›๐ผ๐‘‡)

๐‘ƒ๐‘ˆ๐ธ๐‘ =๐‘๐‘ค๐‘Ÿ + ๐‘“๐‘Ž๐‘›๐ท๐ถ + ๐ผ๐‘‡

๐ผ๐‘‡

PUEb < PUEa < PUEc but is (b) best? We donโ€™t knowโ€ฆ.

10

Can we define a โ€œserver-PUEโ€? Maybe ITUE?

Data Center Server

Power dist losses UPS, line losses, PDUs PSU, VRs, board losses

Cooling losses Chiller, CRAC, Pumps, Fans Fans, Pumps

Misc losses Security, Lighting, Building Control

Indicators, Platform Control

IT Servers, Storage, Network Processor, Memory, Disk

๐‘ƒ๐‘ˆ๐ธ =๐‘‡๐‘œ๐‘ก๐‘Ž๐‘™ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ

๐ผ๐‘‡ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ=

๐‘ƒ๐‘ค๐‘Ÿ + ๐ถ๐‘œ๐‘œ๐‘™๐‘–๐‘›๐‘” + ๐‘€๐‘–๐‘ ๐‘ + ๐ผ๐‘‡

๐ผ๐‘‡=

๐ผ๐‘›๐‘“๐‘Ÿ๐‘Ž๐‘ ๐‘ก๐‘Ÿ๐‘ข๐‘๐‘ก๐‘ข๐‘Ÿ๐‘’ ๐ต๐‘ข๐‘Ÿ๐‘‘๐‘’๐‘› + ๐ผ๐‘‡

๐ผ๐‘‡

๐ผ๐‘‡๐‘ˆ๐ธ =๐ผ๐‘›๐‘“๐‘Ÿ๐‘Ž๐‘ ๐‘ก๐‘Ÿ๐‘ข๐‘๐‘ก๐‘ข๐‘Ÿ๐‘’ ๐ต๐‘ข๐‘Ÿ๐‘‘๐‘’๐‘› + ๐ถ๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’

๐ถ๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’=

๐‘ƒ๐‘ค๐‘Ÿ + ๐ถ๐‘œ๐‘œ๐‘™๐‘–๐‘›๐‘” + ๐‘€๐‘–๐‘ ๐‘ + ๐ถ๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’

๐ถ๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’

ITUE = ๐‘‡๐‘œ๐‘ก๐‘Ž๐‘™ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ ๐‘–๐‘›๐‘ก๐‘œ ๐‘กโ„Ž๐‘’ ๐ผ๐‘‡ ๐ธ๐‘ž๐‘ข๐‘–๐‘๐‘š๐‘’๐‘›๐‘ก

๐‘‡๐‘œ๐‘ก๐‘Ž๐‘™ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ ๐‘–๐‘›๐‘ก๐‘œ ๐‘กโ„Ž๐‘’ ๐ถ๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’ ๐ถ๐‘œ๐‘š๐‘๐‘œ๐‘›๐‘’๐‘›๐‘ก๐‘ 

11

ITUE

Wall

Cooling

PSU VRs

CPU/Mem/Drive

(f)

(j)

(i)(h)(g)

๐ผ๐‘‡๐‘ˆ๐ธ = ๐‘ก๐‘œ๐‘ก๐‘Ž๐‘™ ๐‘’๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ ๐‘–๐‘›๐‘ก๐‘œ ๐‘กโ„Ž๐‘’ ๐ผ๐‘‡ ๐‘’๐‘ž๐‘ข๐‘–๐‘๐‘š๐‘’๐‘›๐‘ก

๐‘ก๐‘œ๐‘ก๐‘Ž๐‘™ ๐‘’๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ ๐‘–๐‘›๐‘ก๐‘œ ๐‘กโ„Ž๐‘’ ๐‘๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’ ๐‘๐‘œ๐‘š๐‘๐‘œ๐‘›๐‘’๐‘›๐‘ก๐‘ =

๐‘”

๐‘–

12

The next stepโ€ฆ

PUE and ITUE are both:

โ€ข dimensionless ratios

โ€ข Represent the burden or โ€œtaxโ€ of infrastructure

โ€ข โ€œ1โ€ is ideal, values larger than 1 are worse

โ€ข Values less than 1 are not allowed

โ€ข So why not:

๐‘‡๐‘ˆ๐ธ = ๐‘ƒ๐‘ˆ๐ธ ๐‘ฅ ๐ผ๐‘‡๐‘ˆ๐ธ

13

TUE

๐‘ƒ๐‘ˆ๐ธ =๐‘‡๐‘œ๐‘ก๐‘Ž๐‘™ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ

๐ผ๐‘‡ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ=

๐‘Ž + ๐‘

๐‘‘ ๐ผ๐‘‡๐‘ˆ๐ธ =

๐‘‡๐‘œ๐‘ก๐‘Ž๐‘™ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ

๐ถ๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’ ๐ธ๐‘›๐‘’๐‘Ÿ๐‘”๐‘ฆ=

๐‘”

๐‘–

๐‘‡๐‘ˆ๐ธ = ๐ผ๐‘‡๐‘ˆ๐ธ ร— ๐‘ƒ๐‘ˆ๐ธ = ๐‘Ž + ๐‘

๐‘–

14

Does it work?

a) both fans

b) IT

fans only

c)

bldg fan only

๐‘‡๐‘ˆ๐ธ๐‘Ž =๐‘๐‘ค๐‘Ÿ + ๐‘“๐‘Ž๐‘›๐ท๐ถ + ๐‘“๐‘Ž๐‘›๐ผ๐‘‡ + ๐‘๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’

๐‘๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’

The lowest TUE yields the lowest energy use. Yes, it works!

๐‘‡๐‘ˆ๐ธ๐‘ =๐‘๐‘ค๐‘Ÿ + ๐‘“๐‘Ž๐‘›๐ผ๐‘‡ + ๐‘๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’

๐‘๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’

๐‘‡๐‘ˆ๐ธ๐‘ =๐‘๐‘ค๐‘Ÿ + ๐‘“๐‘Ž๐‘›๐ท๐ถ + ๐‘๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’

๐‘๐‘œ๐‘š๐‘๐‘ข๐‘ก๐‘’

15

Demonstration of the Metrics

Two Data Centers (a) & (b), identical infrastructure, each w/ PUE = 1.6

Major refresh, Installing 10,000 new servers

DCa getting economy servers, DCb w/ high efficiency models

a) Economy (W) b) High Eff (W)

Proc, Mem, Stor 198 198

PSU 58 18

VRs 56 38

Fans 18 12

Total Platform 330 266

After the refresh, DCb will have a worse PUE

16

Example Data Center ITUE

๐ผ๐‘‡๐‘ˆ๐ธ๐‘Ž =18 + 58 + 56 + 198

198= 1.67

With โ€œeconomyโ€ hardware:

๐ผ๐‘‡๐‘ˆ๐ธ =Fan + PSU + VRs + Compute

Compute

๐ผ๐‘‡๐‘ˆ๐ธ๐‘ =12 + 18 + 38 + 198

198= 1.34

With โ€œhigh efficiencyโ€ hardware:

ITUE appropriately reflects the more efficient hardware

17

Summary of the Metrics

a) Economy b) High Eff

Total Platform 3.31 MW 2.67 MW

Infrastructure 1.99 MW 1.99 MW

Total Site Power 5.3 MW 4.66 MW

PUE 1.6 1.74

ITUE 1.67 1.34

TUE 2.67 2.33

The combination of the three

metrics paints the whole picture

Managed by UT-Battelle for the U. S. Department of Energy

Jaguar โ€“ TUE Case Study

Source: J. Rogers, CUG 2009.

Managed by UT-Battelle for the U. S. Department of Energy

Power Monitoring in Jaguar

92% 84%

Managed by UT-Battelle for the U. S. Department of Energy

67.24%

12.81%

6.96%

9.97%

3.03%

Compute

IBC+POLLoss

PSULoss

Blower

XDP

The ITUE of Jaguar for 2011-01

ITUE = 1.49 PUE = 1.25 TUE = 1.86

21

Want more?

Thank You. Questions?

22