Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and...

27
Presented by: Data Visualization Jeffrey A. Shaffer Vice President, IT and Analytics, Unifund Adjunct Professor, University of Cincinnati Visualization and Data Presentation www.wayfind.com/napoleon.html Visualization and Data Presentation On Minard’s graph: “This is one of the worse graphs ever made. He [Tufte]'s very happy because it shows five different pieces of information on three axis and if you study it for fifteen minutes it really is worth a thousand words. I don't think that's what graphs are for. I think they try to make a point in two seconds for people who are too lazy to read the forty words underneath.” - Seth Godin

Transcript of Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and...

Page 1: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Presented by:

Data Visualization

Jeffrey A. Shaffer

Vice President, IT and Analytics, Unifund

Adjunct Professor, University of Cincinnati

Visualization and Data Presentation

www.wayfind.com/napoleon.html

Visualization and Data PresentationOn Minard’s graph:

“This is one of the worse graphs ever made. He [Tufte]'s very happy because it shows five different pieces of information on three axis and if you study it for fifteen minutes it really is worth a thousand words. I don't think that's what graphs are for. I think they try to make a point in two seconds for people who are too lazy to read the forty words underneath.”

- Seth Godin

Page 2: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

“Dumbing it down for the executive”

Recreation of Jorge Camoes Redesign (www.ExcelCharts.com)

Visualization and Data Presentation

Image retrieved from the Library of Congress. http://hdl.loc.gov/loc.gmd/g3201e.ct000242

Drawing of the approximate representation of world emigration for the year 1858 by Charles Minard, published in 1862

Early 19th Century: Charles Minard Visualization and Data Presentation

Every Few Months

Never

Every Few Weeks

Every Few Days

Daily

Default in Excel

Page 3: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

42%

29%

19%

8%

Every Few Months

Never

Every Few Weeks

Every Few Days

Daily 42%

29%

19%

8%

2%

Every Few Months Never Every Few Weeks Every Few Days Daily

Frequency

42%

29%

19%

8%

2%

Every Few Months

Never

Every Few Weeks

Every Few Days

Daily

Frequency

TargetEvery Few

MonthsNever

Every FewWeeks

Every FewDays

Daily

Frequency 42% 29% 19% 8% 2%

Cumulative % 42% 71% 90% 98% 100%

71%

0%

20%

40%

60%

80%

100%

0%

10%

20%

30%

40%

50%

Cumulative %

Frequency71% of occurrences are caused by low frequency

Time Utilization Chart Visualization and Data Presentation

The KISS Principle:

Keep it Simple, Stupid!

“It seems that perfection is reached not when there is nothing to add, but when there is nothing left to take away.”

- Antoine de Saint Exupéry

Page 4: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

“When we understand that slide, we’ll have won the war.” -General Stanley McChrystal

Politics in the Middle East

Page 5: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Clear - easily seen; sharply defined

• who's the audience? what's the message?

• clarity more important than aesthetics

Clean - thorough; complete; unadulterated

• labels, axis, gridlines, formatting, right chart type, color choice, etc.

Concise - brief but comprehensive

• not minimalist but not verbose

Captivating - to attract and hold by beauty or excellence

• does it capture attention? is it interesting? does it tell the story?

The Shaffer 4 C’s of Data Visualization

Visualization and Data Presentation

Figure out the messageThen apply the tools

Figure out the messageThen apply the tools

85%

15%

Percentage of Chart which Looks like PacMan

Looks Like PacMan

Does Not Look LikePacMan

Page 6: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Looks like PacmanDoes not look like

Pacman

% 15% 85%

0%

25%

50%

75%

100%% of Chart that looks like Pacman

0%

1/3

2/3

Rectangle?

0%

25%

50%

75%

100%

Full Container 3/4 Container 1/2 Container 1/4 Container

Ingeniousness

Amount of Pringles Left

How Ingenious a Pringles Can is:

This is

Awesome!

Hmm, this

is getting

awkward.

Crap, my

hand got

stuck!

@#$% it,

I'll just

drink them.

0%

25%

50%

75%

100%

Full Container 3/4 Container 1/2 Container 1/4 Container

Ingeniousness

Amount of Pringles Left

How Ingenious a Pringles Can is:

This is

Awesome!

Hmm, this

is getting

awkward.

Crap, my

hand got

stuck!

@#$% it,

I'll just

drink them.

Source: www.HamiltonCountyAuditor.org

Page 7: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

PacMan Eating Skittles

What is the message?

$500.07

$31.55

Food expenditure for one week

Germany Ecuador

Germany Ecuador

$500.07 $31.55

$0.00

$200.00

$400.00

$600.00

Food expenditure for one week

$500.07

$31.55

Germany

Ecuador

Food expenditure for one week

Germany: The Melander family of BargteheideFood expenditure for one week: $500.07 Source: “Hungry Planet”

By Peter Menzel and Faith D’Aluisio

Page 8: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Ecuador: The Ayme family of TingoFood expenditure for one week: $31.55 Source: “Hungry Planet”

By Peter Menzel and Faith D’Aluisio

Elle and Nina

Elle Nina

Stretching our thinking about data…

…what about this?

Stretching our thinking about data…

…what about this?

Page 9: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

A Data Visualizationof your checking account

Checking Acct #123456789: $843.62

Checking Acct #123456789: -$43.17

Current Balance

Visualization and Data Presentation

Sparklines (Tufte):

Visualization and Data Presentation

Sparklines:

• Small, high-resolution graphics embedded in a context of words, numbers, images. Sparklines are data-intense, design-simple, word-sized graphics. (New in Excel 2010)

Visualization and Data Presentation

Sparklines:

Page 10: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Visualization and Data Presentation

Sparktweets

Latest Developments

Bandlines and Sparkstrips

Invented by Stephen Few (January 2013)

Visualization and Data Presentation

Trellis Charts (Small Multiples):

• "Illustrations of postage-stamp size are indexed by category or a label, sequenced over time like the frames of a movie, or ordered by a quantitative variable not used in the single image itself."

Visualization and Data Presentation

Box and Whiskers or Box Plot (Tukey):

Visualization and Data Presentation

Box and Whiskers or Box Plot (Tukey):

Page 11: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Box Plot vs. PDF

Page 12: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Visualization and Data Presentation

Common Mistakes:

• Chance is measured as %

– there is a 10% chance

• Probability is a number between 0 and 1

– The probability is .3

• Correlation is a number between -1 and 1

• Principal ($) not Principle (Think PayPal)

• Judgment and Management

• We pay $.035 for the widget not 3.5%

• Number Formatting and Rounding

(Excel example)

Color Vision Deficiency (CVD)aka Colorblind

Page 13: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Color Blind

Protanomaly (1 out of 100 males) – “red weak”

Deuteranomaly (5 out of 100 males) – “green weak”

Dichromasy (2 out of 100 males) -no perceptible difference between red, orange, yellow, and green

Protanopia (1 out of 100 males) -use light/dark

Deuteranopia (one out of 100 males) – “green blind”

= 10% of males are Color Blind

Main issue is Red and Green

Color

Page 14: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Bucket A (2.6%) Bucket B (4.2%) Bucket C (93.2%)

Paying 22.53% 19.93% 14.95%

On-time 51.27% 27.42% 71.01%

Late 18.63% 14.60% 1.63%

Stale 7.57% 38.05% 12.41%

0%

25%

50%

75%

% of Accts

VisCheckwww.vischeck.com

Deuteranope Simulation

VisCheckwww.vischeck.com

Protanope Simulation

Protanope Simulation

Bucket A (2.6%) Bucket B (4.2%) Bucket C (93.2%)

Paying 22.53% 19.93% 14.95%

On-time 51.27% 27.42% 71.01%

Late 18.63% 14.60% 1.63%

Stale 7.57% 38.05% 12.41%

0%

25%

50%

75%

% of Accts

Protanope Simulation

Page 15: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Created by:

Steve Wexler

www.datarevelations.com

Example in Practice

Page 16: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Compare and Contrast

Visualization and Data Presentation

Showing performance vs. target (ex. actual to forecast)

• Bar Chart with Target line

– Easy to build

– Good when single target for all categories

Bullet Graph – What is it?

Not this kind of chart…

Bullet Graph:

Target A

Target B

Target C

Invented by Stephen Few

Visualization and Data Presentation

Two Variations on the Bullet Graph:

• Remove performance bands

• Now it’s simply actual to target

• Easier to understand

• Dual Axis Bullet Graph

– Shows both value and %

Page 17: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Visualization and Data Presentation

Geospatial:

• Location can relate disparate data

• A Map is worth a thousand cells

• Easy to see problem areas

• By State, County, Zip, Block

• Even down to Address

Page 18: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Foreclosures (July 2008)

Source: RealtyTrac (www.RealtyTrac.com)

Page 19: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Source: Fox News (http://foxnewsinsider.com/2012/09/14/breaking-

university-of-texas-in-austin-orders-evacuation-due-to-threats-on-campus/)

Breaking News: Missouri invades Arkansas,Mississippi and Alabama Land Swap

Pharmacy Growth?

What the data actually shows:

2006 2007 2008 2009 2010 2011

77.8% 79.0% 79.5% 79.7% 80.0% 80.0%

Pharmacies 1,920 1,964 1,973 1,967 1,966 1,948

Stores 2,468 2,486 2,481 2,468 2,458 2,435

The Real Story

Page 20: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

The Real Story What is the real message?

1. From 2007 to 2010 - closed stores that don’t have pharmacies thus increasing the % of stores with a pharmacy.

2. Increased Pharmacy Sales from $6.5B to $7.3B with fewer stores and fewer pharmacies

They missed a very compelling Story12% increase in Sales with Lower Costs!

Putting it all together

Our Tableau Server

Operations Dashboard

Page 21: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context
Page 22: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context
Page 23: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

2008 Feltron Reporthttp://feltron.com

2007 Feltron Reporthttp://feltron.com

Page 24: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Compare and Contrast

Healthcare Examples

Page 25: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Compare and Contrast

Page 26: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

Compare and Contrast

Page 27: Data Visualization · Visualization and Data Presentation Sparklines (Tufte): Visualization and Data Presentation Sparklines: •Small, high-resolution graphics embedded in a context

CitationsFrom Rome Reborn: The Vatican Library & Renaissance Culture, Library of Congress website. Accessed on 2/18/2012. http://www.loc.gov/exhibits/vatican/math.html

Few, Stephen (2007), Data Visualization: Past, Present, and Future, p.3

Wikipedia entry René Descartes, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/18/2012. http://en.wikipedia.org/wiki/Rene_descartes

Wikipedia entry Cartesian Coordinate System, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/18/2012 http://en.wikipedia.org/wiki/Cartesian_coordinate_system

Wikipedia entry William Playfair, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/18/2012 http://en.wikipedia.org/wiki/William_Playfair

Tufte, Edward R. (2001), The Visual Display of Quantitative Information, Second Edition, p. 9

The Oxford Dictionary of National Biography, ©2004-12 Oxford University Press. Retrieved on 2/18/2012. http://www.oxforddnb.com/view/printable/22370

Friendly, Michael (2001), Gallery of Data Visualization, Electronic document, http://www.datavis.ca/gallery/, Accessed: 02/19/2012 22:20:39

Tufte, Edward R. (2001), The Visual Display of Quantitative Information, Second Edition, p. 40

Bertin, Jacques (1983), Semiology of Graphics, English translation by William J. Berg, pp. 42, 65

Wikipedia entry Exploratory Data Analysis, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/19/2012.http://http://en.wikipedia.org/wiki/Exploratory_data_analysis

Wikipedia entry Edward Tufte, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/19/2012. http://en.wikipedia.org/wiki/Edward_Tufte

Make sure to connect online and stay in touch:

Follow on Twitter:@HighVizAbility

Connect with us on LinkedIn:www.linkedin.com/in/jeffreyshaffer/

Websites:www.dataplusscience.com/workshop.htmlwww.dataplusscience.com/http://makingdatameaningful.com/

Facebook:https://www.facebook.com/DataPlusScience

Email:[email protected]