CS-5630 / CS-6630 Visualization for Data Science The ...

49
CS-5630 / CS-6630 Visualization for Data Science The Visualization Alphabet: Marks and Channels Alexander Lex [email protected] [xkcd]

Transcript of CS-5630 / CS-6630 Visualization for Data Science The ...

Page 1: CS-5630 / CS-6630 Visualization for Data Science The ...

CS-5630 / CS-6630 Visualization for Data Science

The Visualization Alphabet: Marks and Channels

Alexander [email protected]

[xkcd]

Page 2: CS-5630 / CS-6630 Visualization for Data Science The ...

How can I visually represent two numbers, e.g., 4 and 8

Page 3: CS-5630 / CS-6630 Visualization for Data Science The ...

Marks & Channels

Marks: represent items or linksChannels: change appearance based on attributeChannel = Visual Variable

Page 4: CS-5630 / CS-6630 Visualization for Data Science The ...

Marks for ItemsBasic geometric elements

3D mark: Volume, but rarely used

0D 2D1D

Page 5: CS-5630 / CS-6630 Visualization for Data Science The ...

Marks for Links

Marks as Items/Nodes

Marks as Links

Points Lines Areas

Containment Connection

Page 6: CS-5630 / CS-6630 Visualization for Data Science The ...

Containment can be nested

[Riche & Dwyer, 2010]

Page 7: CS-5630 / CS-6630 Visualization for Data Science The ...

Channels (aka Visual Variables)

Control appearanceproportional to orbased on attributes

Page 8: CS-5630 / CS-6630 Visualization for Data Science The ...

Jacques Bertin

French cartographer [1918-2010]Semiology of Graphics [1967]Theoretical principles for visual encodings

Page 9: CS-5630 / CS-6630 Visualization for Data Science The ...

Bertin’s Visual Variables

Semiology of Graphics [J. Bertin, 67]

Points Lines AreasMarks:

PositionSize(Grey)Value

TextureColorOrientationShape

Page 10: CS-5630 / CS-6630 Visualization for Data Science The ...

Using Marks and Channels

Mark: Line Channel: Length, Position1 quantitative attribute

Adding Hue+1 categorical attr.

Adding Size+1 quantitative attr.

Mark: PointChannel: Position2 quantitative attr.

Page 11: CS-5630 / CS-6630 Visualization for Data Science The ...

Redundant encoding

Length, Position and Value

Page 12: CS-5630 / CS-6630 Visualization for Data Science The ...

Good bar chart?

Rule: Use channel proportional to data!https://twitter.com/ChaseThomason/status/1118478036507164672?s=19

Page 13: CS-5630 / CS-6630 Visualization for Data Science The ...

Types of Channels

Identity ChannelsWhat?ShapeColor (hue)Spatial region …

Magnitude ChannelsHow much? Which Rank?PositionLengthSaturation …

Categorical DataOrdinal & Quantitative Data

Page 14: CS-5630 / CS-6630 Visualization for Data Science The ...

Magnitude Channels: Ordered Attributes Identity Channels: Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tilt/angle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

Channels: Expressiveness Types and E!ectiveness Ranks

Page 16: CS-5630 / CS-6630 Visualization for Data Science The ...

Characteristics of ChannelsSelective

Is a mark distinct from other marks? Can we make out the difference between two marks?

AssociativeDoes it support grouping?

Quantitative (Magnitude vs Identity Channels)Can we quantify the difference between two marks?

Page 17: CS-5630 / CS-6630 Visualization for Data Science The ...

Characteristics of Channels

Order (Magnitude vs Identity)Can we see a change in order?

LengthHow many unique marks can we make?

Page 18: CS-5630 / CS-6630 Visualization for Data Science The ...

Position

Strongest visual variableSuitable for all data typesProblems:

Sometimes not available (spatial data)Cluttering

Selective: yesAssociative: yesQuantitative: yesOrder: yesLength: fairly big

Page 20: CS-5630 / CS-6630 Visualization for Data Science The ...

Position in 3D?

[Spotfire]

Page 21: CS-5630 / CS-6630 Visualization for Data Science The ...

Length & Size

Good for 1D, OK for 2D, Bad for 3DEasy to see whether one is biggerAligned bars use position redundantlyFor 1D length:Selective: yesAssociative: yesQuantitative: yesOrder: yesLength: high

Page 23: CS-5630 / CS-6630 Visualization for Data Science The ...

Value/Luminance/Saturation

OK for quantitative data when length & size are used.Not very many shades recognizable

Selective: yesAssociative: yesQuantitative: somewhat (with problems)Order: yesLength: limited

Page 25: CS-5630 / CS-6630 Visualization for Data Science The ...

Color

Good for qualitative data (identity channel)Limited number of classes/length (~7-10!)Does not work for quantitative data!Lots of pitfalls! Be careful!My rule:

minimize color use for encoding datause for brushing

Selective: yesAssociative: yesQuantitative: noOrder: noLength: limited

< <?????

Page 26: CS-5630 / CS-6630 Visualization for Data Science The ...

Cliff Mass

Color: Bad Example

Page 28: CS-5630 / CS-6630 Visualization for Data Science The ...

Shape

Great to recognize many classes.No grouping, ordering.

Selective: yesAssociative: limitedQuantitative: noOrder: noLength: vast

< <?????

Page 30: CS-5630 / CS-6630 Visualization for Data Science The ...

Chernoff Faces

Idea: use facial parameters to map quantitative data

Critique: https://eagereyes.org/criticism/chernoff-faces

Does it work?Not really!

Page 31: CS-5630 / CS-6630 Visualization for Data Science The ...

More Channels

Page 32: CS-5630 / CS-6630 Visualization for Data Science The ...

Why are quantitative channels different?

S = sensationI = intensity

Page 33: CS-5630 / CS-6630 Visualization for Data Science The ...

Steven’s Power Law, 1961

From Wilkinson 99, based on Stevens 61

Electric

Page 34: CS-5630 / CS-6630 Visualization for Data Science The ...

How much longer?

A

B

2x

Page 35: CS-5630 / CS-6630 Visualization for Data Science The ...

How much longer?

A

B

4x

Page 36: CS-5630 / CS-6630 Visualization for Data Science The ...

How much steeper?

A B

~4x

Page 37: CS-5630 / CS-6630 Visualization for Data Science The ...

How much larger?

A B

5x

Page 38: CS-5630 / CS-6630 Visualization for Data Science The ...

How much larger?

A B

2xdiameter4x areaarea is proportional to diameter squared

Page 39: CS-5630 / CS-6630 Visualization for Data Science The ...

How much larger (area)?

A B

3x

Page 40: CS-5630 / CS-6630 Visualization for Data Science The ...

How much darker?

A B

2x

Page 41: CS-5630 / CS-6630 Visualization for Data Science The ...

How much darker?

A B

3x

Page 42: CS-5630 / CS-6630 Visualization for Data Science The ...

Position, Length & Angle

Page 43: CS-5630 / CS-6630 Visualization for Data Science The ...

Other Factors Affecting Accuracy

AlignmentDistractorsDistanceCommon scale…

A B

Unframed Aligned

Framed Unaligned

AB

AB

Unframed Unaligned

VS VS VS

Page 44: CS-5630 / CS-6630 Visualization for Data Science The ...

Cleveland / McGill, 1984

William S. Cleveland; Robert McGill , “Graphical Perception: Theory, Experimentation, and Application to the Development of Graphical Methods.” 1984

Page 46: CS-5630 / CS-6630 Visualization for Data Science The ...

Positions

Rectangular areas

(aligned or in a treemap)

Angles

Circular areas

Cleveland & McGill’s Results

Crowdsourced Results

1.0 3.01.5 2.52.0Log Error

1.0 3.01.5 2.52.0Log Error

Log Error = log2(judged percent - true percent + 1/8)

Page 47: CS-5630 / CS-6630 Visualization for Data Science The ...

[Mackinlay, Automating the Design of Graphical Presentations of Relational Information, 1986]

Jock Mackinlay, 1986D

ecre

asin

g

Page 48: CS-5630 / CS-6630 Visualization for Data Science The ...

Magnitude Channels: Ordered Attributes Identity Channels: Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tilt/angle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

Channels: Expressiveness Types and E!ectiveness Ranks

Page 49: CS-5630 / CS-6630 Visualization for Data Science The ...

Separability of Attributes

Can we combine multiple visual variables?

T. Munzner, Visualization Analysis and Design, 2014