data science: past, present, and future

111
data science: past/present/future 1962-2062 [email protected] @chrishwiggins references: http://bit.ly/datascience-links 1 hackNYDS.key - Thursday:June.18

Transcript of data science: past, present, and future

Page 1: data science: past, present, and future

data science: past/present/future

1962-2062

[email protected] @chrishwiggins

references: http://bit.ly/datascience-links

1 hackNYDS.key - Thursday:June.18

Page 2: data science: past, present, and future

2 hackNYDS.key - Thursday:June.18

Page 3: data science: past, present, and future

3 hackNYDS.key - Thursday:June.18

Page 4: data science: past, present, and future

4 hackNYDS.key - Thursday:June.18

Page 5: data science: past, present, and future

5 hackNYDS.key - Thursday:June.18

Page 6: data science: past, present, and future

“data science” jobs, jobs, jobs

6 hackNYDS.key - Thursday:June.18

Page 7: data science: past, present, and future

“data science” jobs, jobs, jobs

7 hackNYDS.key - Thursday:June.18

Page 8: data science: past, present, and future

“data science” jobs, jobs, jobs

8 hackNYDS.key - Thursday:June.18

Page 9: data science: past, present, and future

“data science” d. conway, 2010

9 hackNYDS.key - Thursday:June.18

Page 10: data science: past, present, and future

“data science” blogs, blogs, blogs

10 hackNYDS.key - Thursday:June.18

Page 11: data science: past, present, and future

“data science” blogs, blogs, blogs

11 hackNYDS.key - Thursday:June.18

Page 12: data science: past, present, and future

modern history:2009

12 hackNYDS.key - Thursday:June.18

Page 13: data science: past, present, and future

“data science” blogs, blogs, blogs

13 hackNYDS.key - Thursday:June.18

Page 14: data science: past, present, and future

“data science” blogs, blogs, blogs

The first time I heard "data science" was in 2007 while reading a proposal that my adviser had passed along, outlining an academic program similar to what we think of as data science.

The first time I heard "data science" was in 2007 while reading a proposal that my adviser had passed along, outlining an academic program similar to what we think of as data science.

14 hackNYDS.key - Thursday:June.18

Page 15: data science: past, present, and future

“data science” blogs, blogs, blogs

15 hackNYDS.key - Thursday:June.18

Page 16: data science: past, present, and future

“data science” ancient history: 2001

16 hackNYDS.key - Thursday:June.18

Page 17: data science: past, present, and future

“data science” ancient history: 2001

17 hackNYDS.key - Thursday:June.18

Page 18: data science: past, present, and future

data science context

18 hackNYDS.key - Thursday:June.18

Page 19: data science: past, present, and future

data science context

19 hackNYDS.key - Thursday:June.18

Page 20: data science: past, present, and future

home schooled

20 hackNYDS.key - Thursday:June.18

Page 21: data science: past, present, and future

PhD in topology

21 hackNYDS.key - Thursday:June.18

Page 22: data science: past, present, and future

“By the end of late 1945, I was a statistician rather than a topologist”

22 hackNYDS.key - Thursday:June.18

Page 23: data science: past, present, and future

invented: “bit”

23 hackNYDS.key - Thursday:June.18

Page 24: data science: past, present, and future

invented: “software”

24 hackNYDS.key - Thursday:June.18

Page 25: data science: past, present, and future

invented: “FFT”

25 hackNYDS.key - Thursday:June.18

Page 26: data science: past, present, and future

“the progenitor of data science.” - @mshron

26 hackNYDS.key - Thursday:June.18

Page 27: data science: past, present, and future

“The Future of Data Analysis,” 1962John W. Tukey

27 hackNYDS.key - Thursday:June.18

Page 28: data science: past, present, and future

introduces: “Exploratory data anlaysis”

28 hackNYDS.key - Thursday:June.18

Page 29: data science: past, present, and future

Tukey 1965, via John Chambers

29 hackNYDS.key - Thursday:June.18

Page 30: data science: past, present, and future

TUKEY BEGAT S WHICH BEGAT R

30 hackNYDS.key - Thursday:June.18

Page 31: data science: past, present, and future

Tukey 1972

31 hackNYDS.key - Thursday:June.18

Page 32: data science: past, present, and future

? 1972

32 hackNYDS.key - Thursday:June.18

Page 33: data science: past, present, and future

Jerome H. Friedman

33 hackNYDS.key - Thursday:June.18

Page 34: data science: past, present, and future

TUKEY BEGAT ESL

34 hackNYDS.key - Thursday:June.18

Page 35: data science: past, present, and future

Tukey 1975

In 1975, while at Princeton, Tufte was asked to teach a statistics course to a group of journalists who were visiting the school to study economics. He developed a set of readings and lectures on statistical graphics, which he further developed in joint seminars he subsequently taught with renowned statistician John Tukey (a pioneer in the field of information design). These course materials became the foundation for his first book on information design, The Visual Display of Quantitative Information

35 hackNYDS.key - Thursday:June.18

Page 36: data science: past, present, and future

TUKEY BEGAT VDQI

36 hackNYDS.key - Thursday:June.18

Page 37: data science: past, present, and future

Tukey 1977

37 hackNYDS.key - Thursday:June.18

Page 38: data science: past, present, and future

TUKEY BEGAT EDA

38 hackNYDS.key - Thursday:June.18

Page 39: data science: past, present, and future

fast forward -> 2001

39 hackNYDS.key - Thursday:June.18

Page 40: data science: past, present, and future

“The primary agents for change should be university departments themselves.”

40 hackNYDS.key - Thursday:June.18

Page 41: data science: past, present, and future

data science @ The New York Timesand how a 164-year old content company became data-driven

41 hackNYDS.key - Thursday:June.18

Page 42: data science: past, present, and future

biology: 1892 vs. 1995

biology changed for good.

42 hackNYDS.key - Thursday:June.18

Page 43: data science: past, present, and future

data science: mindset & toolset

drew conway, 2010

43 hackNYDS.key - Thursday:June.18

Page 44: data science: past, present, and future

1851

44 hackNYDS.key - Thursday:June.18

Page 45: data science: past, present, and future

news: 20th century

church state

45 hackNYDS.key - Thursday:June.18

Page 46: data science: past, present, and future

church

46 hackNYDS.key - Thursday:June.18

Page 47: data science: past, present, and future

church

47 hackNYDS.key - Thursday:June.18

Page 48: data science: past, present, and future

news: 20th century

church state

48 hackNYDS.key - Thursday:June.18

Page 49: data science: past, present, and future

news: 21st century

church state

engineering

49 hackNYDS.key - Thursday:June.18

Page 50: data science: past, present, and future

1851 1996

newspapering: 1851 vs. 1996

50 hackNYDS.key - Thursday:June.18

Page 51: data science: past, present, and future

example:

millions of views per hour2015

51 hackNYDS.key - Thursday:June.18

Page 52: data science: past, present, and future

52 hackNYDS.key - Thursday:June.18

Page 53: data science: past, present, and future

data science: the web

53 hackNYDS.key - Thursday:June.18

Page 54: data science: past, present, and future

data science: the web

is your “online presence”

54 hackNYDS.key - Thursday:June.18

Page 55: data science: past, present, and future

data science: the web

is a microscope

55 hackNYDS.key - Thursday:June.18

Page 56: data science: past, present, and future

data science: the web

is an experimental tool

56 hackNYDS.key - Thursday:June.18

Page 57: data science: past, present, and future

data science: the web

is an optimization tool

57 hackNYDS.key - Thursday:June.18

Page 58: data science: past, present, and future

1851 1996

newspapering: 1851 vs. 1996 vs. 2008

2008

58 hackNYDS.key - Thursday:June.18

Page 59: data science: past, present, and future

“a startup is a temporary organization in search of a repeatable and scalable business model” —Steve Blank

59 hackNYDS.key - Thursday:June.18

Page 60: data science: past, present, and future

every publisher is now a startup

60 hackNYDS.key - Thursday:June.18

Page 61: data science: past, present, and future

61 hackNYDS.key - Thursday:June.18

Page 62: data science: past, present, and future

every publisher is now a startup

62 hackNYDS.key - Thursday:June.18

Page 63: data science: past, present, and future

news: 21st century

church state

engineering

63 hackNYDS.key - Thursday:June.18

Page 64: data science: past, present, and future

news: 21st century

church state

engineering

64 hackNYDS.key - Thursday:June.18

Page 65: data science: past, present, and future

learnings

65 hackNYDS.key - Thursday:June.18

Page 66: data science: past, present, and future

learnings

- supervised learning- unsupervised learning- reinforcement learning

66 hackNYDS.key - Thursday:June.18

Page 67: data science: past, present, and future

learnings

- supervised learning- unsupervised learning- reinforcement learning

cf. modelingsocialdata.org

67 hackNYDS.key - Thursday:June.18

Page 68: data science: past, present, and future

supervised learning, e.g.,

cf. modelingsocialdata.org

68 hackNYDS.key - Thursday:June.18

Page 69: data science: past, present, and future

supervised learning, e.g.,

“the funnel”

cf. modelingsocialdata.org

69 hackNYDS.key - Thursday:June.18

Page 70: data science: past, present, and future

interpretable supervised learning

supe

r co

ol s

tuff

cf. modelingsocialdata.org

70 hackNYDS.key - Thursday:June.18

Page 71: data science: past, present, and future

unsupervised learning, e.g,

“segments”

cf. modelingsocialdata.org

71 hackNYDS.key - Thursday:June.18

Page 72: data science: past, present, and future

unsupervised learning, e.g,

“segments”

cf. modelingsocialdata.org

72 hackNYDS.key - Thursday:June.18

Page 73: data science: past, present, and future

unsupervised learning, e.g,

“segments”

argmax_z p(z|x)=14

cf. modelingsocialdata.org

73 hackNYDS.key - Thursday:June.18

Page 74: data science: past, present, and future

unsupervised learning, e.g,

“segments”

“baby boomer”

cf. modelingsocialdata.org

74 hackNYDS.key - Thursday:June.18

Page 75: data science: past, present, and future

unsupervised learning, e.g,

cf. modelingsocialdata.org

75 hackNYDS.key - Thursday:June.18

Page 76: data science: past, present, and future

reinforcement learning

cf. modelingsocialdata.org

76 hackNYDS.key - Thursday:June.18

Page 77: data science: past, present, and future

reinforcement learning

aka “A/B testing”;RCT

cf. modelingsocialdata.org

77 hackNYDS.key - Thursday:June.18

Page 78: data science: past, present, and future

Reporting

Learning

Testaka “A/B testing”;

business as usual

(esp. supervised)

Some of the most recognizable personalization in our service is the collection of “genre” rows. …Members connect with these rows so

well that we measure an increase in member retention by placing the most tailored rows higher on the page instead of lower.

cf. modelingsocialdata.org

78 hackNYDS.key - Thursday:June.18

Page 79: data science: past, present, and future

real-time A/B -> “bandits”

GOOG blog:

cf. modelingsocialdata.org

79 hackNYDS.key - Thursday:June.18

Page 80: data science: past, present, and future

Reporting

Learning

Test

Optimizing

Exploreunsupervised:

supervised:

reinforcement:

80 hackNYDS.key - Thursday:June.18

Page 81: data science: past, present, and future

Reporting

Learning

Test

Optimizing

Exploreunsupervised:

supervised:

reinforcement:

81 hackNYDS.key - Thursday:June.18

Page 82: data science: past, present, and future

common requirements in data science:

82 hackNYDS.key - Thursday:June.18

Page 83: data science: past, present, and future

common requirements in data science:

1.people2.ideas3.things

cf. USAF

83 hackNYDS.key - Thursday:June.18

Page 84: data science: past, present, and future

things:what does DS team deliver?

84 hackNYDS.key - Thursday:June.18

Page 85: data science: past, present, and future

things:what does DS team deliver?

- build data prototypes- build APIs- impact roadmaps

85 hackNYDS.key - Thursday:June.18

Page 86: data science: past, present, and future

- build data prototypes

86 hackNYDS.key - Thursday:June.18

Page 87: data science: past, present, and future

- build data prototypes

cf. daeilkim.com

87 hackNYDS.key - Thursday:June.18

Page 88: data science: past, present, and future

- build APIs

88 hackNYDS.key - Thursday:June.18

Page 89: data science: past, present, and future

- build APIs

89 hackNYDS.key - Thursday:June.18

Page 90: data science: past, present, and future

- impact roadmaps

flickr/McJex

90 hackNYDS.key - Thursday:June.18

Page 91: data science: past, present, and future

data science: ideas

91 hackNYDS.key - Thursday:June.18

Page 92: data science: past, present, and future

data skills

- data engineering- data science- data visualization- data product- data multiliteracies- data embeds

cf. “data scientists at work”, ch 1

92 hackNYDS.key - Thursday:June.18

Page 93: data science: past, present, and future

data science: people

- new mindset > new toolset

93 hackNYDS.key - Thursday:June.18

Page 94: data science: past, present, and future

summary:pay attention to:

1.people2.ideas3.things

cf. USAF

94 hackNYDS.key - Thursday:June.18

Page 95: data science: past, present, and future

wait i want to learn more stuff

95 hackNYDS.key - Thursday:June.18

Page 96: data science: past, present, and future

wait i want to learn more stuff

githubs ESL

play w/data

96 hackNYDS.key - Thursday:June.18

Page 97: data science: past, present, and future

githubs

97 hackNYDS.key - Thursday:June.18

Page 98: data science: past, present, and future

githubs

98 hackNYDS.key - Thursday:June.18

Page 99: data science: past, present, and future

githubs

99 hackNYDS.key - Thursday:June.18

Page 100: data science: past, present, and future

play w/data

100 hackNYDS.key - Thursday:June.18

Page 101: data science: past, present, and future

play w/data

101 hackNYDS.key - Thursday:June.18

Page 102: data science: past, present, and future

play w/data

102 hackNYDS.key - Thursday:June.18

Page 103: data science: past, present, and future

ESL

103 hackNYDS.key - Thursday:June.18

Page 104: data science: past, present, and future

ESL

104 hackNYDS.key - Thursday:June.18

Page 105: data science: past, present, and future

a “book”

105 hackNYDS.key - Thursday:June.18

Page 106: data science: past, present, and future

wait i want to learn more stuff

githubs ESL

play w/data

106 hackNYDS.key - Thursday:June.18

Page 107: data science: past, present, and future

data science: past/present/future

1962-2062

[email protected] @chrishwiggins

references: http://bit.ly/datascience-links

107 hackNYDS.key - Thursday:June.18

Page 108: data science: past, present, and future

108 hackNYDS.key - Thursday:June.18

Page 109: data science: past, present, and future

“popular” jobs, jobs, jobs

109 hackNYDS.key - Thursday:June.18

Page 110: data science: past, present, and future

“popular” jobs, jobs, jobs

110 hackNYDS.key - Thursday:June.18

Page 111: data science: past, present, and future

“popular” jobs, jobs, jobs

111 hackNYDS.key - Thursday:June.18