14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical...

43
14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally. It can be downloaded free and easily customised for a very wide variety of applications. The package and the paper introducing it have been cited over 1700 times, by far the highest for publications in the mathematical sciences R CRAN
  • date post

    20-Jan-2016
  • Category

    Documents

  • view

    215
  • download

    1

Transcript of 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical...

Page 1: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

14.2 Computer software

Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally. It can be downloaded free and easily customised for a very wide variety of applications. The package and the paper introducing it have been cited over 1700 times, by far the highest for publications in the mathematical sciences over the last ten years, worldwide. It is now disseminated from over 75 internet sites in 30 countries.

R

CRAN

Page 2: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

The package is used both for teaching and research by hundreds of universities around the world, including Stanford, Oxford, Cambridge and Berkeley. There are over 40 books written about, or featuring, the use of R.

http://www.nytimes.com/2009/01/07/technology/business-computing/07program.html?scp=2&sq=ihaka&st=cse

Page 3: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

Time series data.

t: time (but could be space)

0 t T (continuous)

t = 0,...,T-1 (discrete)

t = 1 ,..., N

(unique) value Y(t)

Y in R

{0,1}

Rp

N, category, [0, 2)

Page 4: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

Useful when there is special period P

t=nP+p, p=0,...,P-1

Yn(p)=Y(nP+p)

Page 5: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

Vector ts case matplot(type="l"

use several line types

colors

use to force comparison

Page 6: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

Tufte (1983). 10th or 11th century movement of planets and sun

Page 7: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 8: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

Connected. Good for smooth series

individual data points not unambiguously displayed

irregular sampling can be unclear

plot(type="l",

Page 9: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

Symbol graph. Good for long term behavior

cannot appreciate middle and high frequency behavior

irregular sampling can be unclear

plot(type="p",...) seals T = 940

Page 10: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

Both. Points and line segments

plot(type="b",

Page 11: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

Both. Points on top of connected

plot(type="o",

Page 12: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

plot(type="n"

points(pch="*"

Page 13: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

plot(type="n"

lines(type=3

Page 14: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

plot(type="h"

Good when need to see individual values

when series long

about centralvalue (next)

Not good when strong trend

Page 15: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

plot(type="h",...) Amazon T = 32875

Good about central value

Page 16: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 17: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 18: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 19: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 20: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 21: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 22: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 23: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 24: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 25: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 26: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 27: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 28: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 29: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 30: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 31: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 32: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 33: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 34: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 35: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 36: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 37: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 38: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 39: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 40: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 41: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.
Page 42: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

Box-Jenkins Model-building strategy.

 

1. model specification

2. model fitting

3. model diagnostics

 

Scientific method

Page 43: 14.2 Computer software Ross Ihaka is one the originators of R, a software package for statistical computing that has had phenomenal uptake internationally.

EDA

I. "...three of the main strategies of data analysis are: 1. graphical presentation. 2. provision of flexibility in viewpoint and in facilities, 3. intensive search for parsimony and simplicity ..."

II. "In exploratory data analysis there can be no substitute for flexibility; for adapting what is calculated - and what we hope plotted - both to the needs of the situation and the clues that the data have already provided."

III. "I would like to convince you that the histogram is old-fashioned ..."

IV. "Exploratory data analysis ... does not need probability, significance or confidence."

V. "... I hope that I have shown that exploratory data analysis is actively incisive rather than passively descriptive, with real emphasis on the discovery of the unexpected ..."

VI. "'exploratory data analysis' is an attitude, a state of flexibility, a willingness to look for those things that we believe are not there, as well as those we believe to be there."

VII. "Exploratory data analysis isolates patterns and features of the data and reveals these forcefully to the analyst."

VIII. "If we need a short suggestion of what exploratory data analysis is, I would suggest that: 1. it is an attitude, AND 2. a flexibility, AND 3. some graph paper (or transparencies, or both)."