Eye movements and visual expertise in chess and...

28
CHAPTER 29 Eye movements and visual expertise in chess and medicine Eyal M. Reingold and Heather Sheridan Abst ra ct The chapter highlights th e theoretical a nd applied contributions of eye movement research to the study of human expertise. Using examples drawn from th e domains of chess and medicine, the chapter demonstrat es that eye movements arc particularly we ll -suited for studying two hallmarks of expert performance: the superior perceptual encoding of domain related patterns, and experts' tacit (or implicit) domain related knowledge. Spec ifi cally, eye mo vement findings indicate that expertise is associated with a greater abili ty to process domain related visual information in terms of larger patterns of features rather than isolated features. Furthermore, in support of th e role of tacit knowl- edge in exper tise, there is evidence that th e eye movements of experts may conta in informarion that is not consciously accessible. Introduction To assess the usefulness of eye movement measurement for the study of exper ti se one need look no further th an the pivotal role played by eye movement pa radigms in the study of read ing s kill (see Rayner, 1998 for a review; Part 5 of this volume) . However, there is a rapidly growing literature employing eye movement monitoring to study expert ise in domai ns other than reading. Our smvey of this literature found over 1000 relevant publications, in over a dozen skill domains. In addition to chess and medicine, which are the focus of the present chapter, these domains include art (e.g. Kozbelt and Seeley, 2007; Loeber, 2006; Nodine et al., 1993 ), aviation (e.g. Ah lstrom and Friedman- Berg, 2006; Sarter et al., 2007), d1·iving (e.g. Hills, 1980; Shinar, 2008; Underwood, 2007), forensics and security (e.g. Bond, 2008; Dyer et aJ., 2006; McCarley et a l. , 2004), mus ic read ing (e.g. Goolsby, 1989; Madell and Hebert, 2008; Rayner and Pollatsek, 1997), sports (e.g. Land and McLeod, 2000; Mann et a l. , 2007; Vickers, 1992), scientific knowledge (e.g. )arodzka et al., 2010; Tai et al., 2006; Van Gog et a l. , 2005), teach in g (e.g. Bchets, 1996; Petrakis, 1993), and typing (e.g. Butsch, 1932; In hoff and Gordon, 1997). A review of these studies is beyond the scope of the present chapter. Instead we illustrate the unique contributions of eye movement studi es to the study of skilled performance by reviewing findings from two domains of expertise: chess and medicine. The domai n of chess was se lected due to its cruci al hi storical significance fo r expertise research, while the domain of medicine was chosen because of the extensive and productive use of eye movement paradigms in this domain during the

Transcript of Eye movements and visual expertise in chess and...

CHAPTER 29

Eye movements and visual expertise in chess and medicine Eyal M. Reingold and Heather Sheridan

Abstract The chapter highlights the theoretical and applied contributions of eye movement research to the study of human expertise. Using examples drawn from the domains of chess and medicine, the chapter demonstrates that eye movements arc particularly well-suited for studying two hallmarks of expert performance: the superior perceptual encoding of domain related patterns, and experts' tacit (or implicit) domain related knowledge. Specifically, eye movement findings indicate that expertise is associated with a greater ability to process domain related visual information in terms of larger patterns of features rather than isolated features. Furthermore, in support of the role of tacit knowl­edge in expertise, there is evidence that the eye movements of experts may contain informarion that is not consciously accessible.

Introduction

To assess the usefulness of eye movement measurement for the study of expertise one need look no further than the pivotal role played by eye movement pa radigms in the study of read ing skill (see Rayner, 1998 for a review; Part 5 of this volume) . However, there is a rapidly growing literature employing eye movement monitoring to study expertise in domains other than reading. Our smvey of this literature found over 1000 relevant publications, in over a dozen skill domains. In addition to chess and medicine, wh ich are the focus of the present chapter, these domains include ar t (e.g. Kozbelt and Seeley, 2007; Loeber, 2006; Nodine et al., 1993), aviation (e.g. Ahlstrom and Friedman­Berg, 2006; Sarter et al., 2007), d1·iving (e.g. Hills, 1980; Shinar, 2008; Underwood, 2007), forensics and security (e.g. Bond, 2008; Dyer et aJ., 2006; McCarley et al. , 2004), music read ing (e.g. Goolsby, 1989; Madell and Hebert, 2008; Rayner and Pollatsek, 1997), sports (e.g. Land and McLeod, 2000; Mann et al. , 2007; Vickers, 1992), scientific knowledge (e.g. )arodzka et al., 2010; Tai et al., 2006; Van Gog et al., 2005), teach ing (e.g. Bchets, 1996; Petrak is, 1993), and typing (e.g. Butsch, 1932; In hoff and Gordon, 1997).

A review of these studies is beyond the scope of the present chapter. Instead we illustrate the unique contribu tions of eye movement studies to the study of skilled performance by reviewing findings from two domains of expertise: chess and medicine. The domain of chess was selected due to its crucial historical significance fo r expertise research, while the domain of medicine was chosen because of the extensive and productive use of eye movement paradigms in this domain during the

Owner
TextBox
Reingold, E. M., & Sheridan, H. (2011). Eye movements and visual expertise in chess and medicine. In S.P. Liversedge, I.D. Gilchrist, & S. Everling, (Eds.) Oxford Handbook on Eye Movements (pp. 528-550). Oxford: Oxford University Press.

524 · Eyal M. Reingold and Heather Sheridan

past four decades. In addition, the present chapter is focused on two fundamental·aspects of skilled performance for which eye movement techniques are uniquely suited; 1) the superior perceptual encoding of domain related patterns by experts and 2) the exploration of tacit, or implicit knowledge, which constitutes a hallmark of human expertise. In the remainder of this chapter, we introduce these topics and review related studies followed by a brief concluding section.

Expertise and superior perceptual encoding of domain related patterns Introductory textbooks to cognitive science typically discuss perception in the early book sections and problem solving and expertise in the fina l book sections to reflect the progression from 'low­level perception' to 'high- level cognition'. However, one of the most fascinating and impressive aspects of skilled performance is the ability of the experienced eye to encode at a glance the essence of briefly presented stimulus material that is related to the domain of expertise (henceforth, domain related patterns). The most innuential investigation of the perceptual aspects of skilled perfoimance odginated from pioneering work on expertise in chess by de Groot ( 1946/1965) and Chase and Simon ( 1973a, 1973b). Indeed, this research is often regarded as the origin of modern expertise research. Accordjngly, we begi.n this section by briefly reviewing this work and its impact. We then review eye movement findings concerning both the superior perceptual eJJcoding of chess related patterns by experts, and the unconscious processing of chess related patterns by experts. Later on in this chapter, the generality of these findings is assessed by reviewing related findings concerning eye movements and visual expertise in medicine.

Perception in chess

Chess research dates back to the beginning of modem experimental psychology during the late 1800s and early 1900s (e.g. Binet, 1894; Cleveland, 1907; Djakow et al. ; 1927). Simon and Chase (1973) argued that simjJar to the use of Drosophila (the frui t fly) as a model organism for the study of genetics, chess offers cognitive scientists an ideal task environment for the study of cognitive processes in general, and ski lled performance in particular. Similarly, Newell and Simon (1972) selected chess as one of the three model tasks that they used in developing their highly influential information process­ing theory of human problem solving. ConsisteJH with these suggestions, during the last century chess research has proven to be very instrumental in enhancing our understanding of human expertise and in contributing to the study of AT (for reviews see Charness, 1992; Ericsson and Charness, 1994 ).

Arguably, the most important contribution of chess research is in producing a major theoretical shift in tbe conceptualization of expertise in cognitive science away from viewing skilled perform­ance as the product of superior general intelligence and innate talent, and toward the recognition that expertise largely reflects domau1 specific knowledge acquired through extensive and deliberate practice (for a review see Ericsson and Charness, 1994). This dramatic change in perspective origi­nated from pioneering work on chess by de Groot (1946/1965) and Chase and Simon (l973a, l973b). De Groot (1946/1965) presented chess positions briefly (2- 15 s) and then removed them from view. Even after such a brief exposure the best chess players were able to reproduce the locations of the chess pieces almost perfectly (ab()ut 93% correct for positions containing abo\lt 25 pieces), and substantially better than less skilled players. In a classic s tudy, Chase and Simon (1973a, 1973b) replicated and extended de Groot's findings by demonstrating that after viewing chess positions for on ly a few seconds, chess masters were able to reproduce these positions much more accurately than Jess skilled players. Chase and Simon also presented chess positions with randomly rearranged chess pieces. There was little difference as a function of expertise when random board configurations were used , which indicates that the superior immediate memory performance of the skilled players wus not attributable to the general superiority or

Eye movements and visual expertise in chess and medicine · 525

unique structure of their memory systems or processes. More recently, a very small but reliable advantage in recall for random configurations has been shown for expert players, though this is probablyattributable to occasional presence of famil iar configurations in random positions (Go bet and Simon, 1996a).

Taken together, the findings reported by de Groot and Chase and Simon suggest that chess grand­masters use efficient perceptual encoding of chess configurations to generate the most promising candidate moves <Uld to restrict their reliance on the effortful and slow serial search through the space of possible moves. Consistent with this suggestion, both de Groot and Chase and Simon highlighted the importance of perceptual encoding of chess configurations as a key determinant of chess skill. For example, in a seminal paper entitled 'Perception in Chess', Chase and Simon ( l973a) introduced their chunking theory of skilled performance in chess. Echoing an earlier conclusion by de Groot ( 1946/1965) that the efficiency of perceptual encoding processes was a more important differentiator of chess expertise than was lhe ability to think ahead in the search for good moves, Chase and Simon ( 1973a) argued 'that the most important processes underlying chess mastery are these immediate visual-perceptual processes rather than the subsequent logical-deductive thinking processes' (p. 215). Chase and Simon (1973a, l973b) proposed that through extensive study and practice, expert players build up associations between perceptually recognizable chunks (i.e. groups of chess pieces related by type, colour, or role) and long-term memory structures that trigger the generation of plausible moves. Search for the best move is thereby constrained to the more promis­ing branches in the space of possible moves from a given chess position. The size of an expert's vocabulary of chess related COJJfigurations was initially estimated to be 50,000-100,000 chunks (Simon and Gilmartin, 1973) . However, a more recent estimate puts the m1mber of chunks at approximately 300,000 (Gobel and Simon, 2000). In addition, small perceptual chunks are most likely supplemented by larger structures termed templates (Go bet and Simon, 1996b, 1998).

Eye movements in chess: predictions and studies An important goal of the present review is to illustrate the potential role of eye movement measure­ment in supplementing traditional measures of performance such as reaction time ( RT), accuracy, and verba.! reports as a means for inves tigating the perceptual aspects of skilled performance in general, and chess skill in particular. One facilitaliog factor for using eye movement measurement in chess is that just like words and sentences, the chess board is easily, visually segmentable. In addition, if as suggested by Chase and Simon and de Groot, chess masters perceptually encode chess positions more efficiently by relying upon larger patterns of related pieces (i.e. chunks), then several predic­tions concerning the differences in eye movement patterns between exp~rt and intermediate players can be made. Specifically, chess experts' encoding of chunks rather than individual pieces should result in fewer fixations, and fixations between rather than on individual pieces. This may also imply that in any given fixation that is produced while examining structured but not random chess config­urations, experts process information about a larger segment of the chessboard tha11 less skil led players con~tituting an increa$e in the visual span as a function of expertise (the term visual span is also referred to in the literature as the perceptual span or the span of effective vision, see facobs, 1986; Rayner, 1998). Sud1 a visual span advantage should also mean that experts make greater use of peripheral and para foveal processing to extract information from a larger portion of a chessboard during an eye fixation. Ln addition, experts may make greater use of automatic and parallel extraction of chess relations relative to intermediate players.

Several early studies employing eye movement measurement provided weak support for the idea that perception of chess related configurations improves with skill. Tikhomirov and Poznyanskaya ( 1966) and Winikoff ( 1967) both found evidence that when chess players fixate on a chess piece, they also extract information about other pieces near the point of gaze and often move to fLXate on a related piece. Based on this general process, Simon and Barenteld ( I 969) devised a computer model to simulate the initial scanning patterns chess players might use when encoding a chess position. Their simulation, PERCEIVER, produced eye movement patterns that resembled those of chess players.

526 · Eyal M. Reingold and Heather Sheridan

Reynold~ ( 1982) and Holdlng (1 985) re-examined the eye movement data collected by Tikhomirov and Poznyanskaya ( 1966 ), and noted that many fixations did not fall on pieces, but on empty squares. There was no report of systematic variation in the proportion of fixations on empty squares as a function of skiLl.

Re-analysing the work of Jongman ( 1968), de Groot and Go bet (1996) reported no significant difference in the proportion of fixations on empty squares as a function of skill. These authors cautioned however that the negative results do not necessarily refute the chunking hypothesis. They pointed out that the crude frame-by-frame analysis of film records of eye movements and the trans­formation of gaze positions from a three-dimensional chessboard viewed by the players to a two­dimensional coordinate system may have .resulted in the introduction of noise making it d ifficult to estimate the accuracy of the computed gaze position. Furthermore, de Groot and Go bet (I 996) demonstrated that skilled players made more fixations along the edges of squares (28.7% of fixa­tions) as compared with novices ( 13.7%), providing some indication that the skilled players may be able to e.rtcode two or more pieces in a single fLxation. In addition, de Groot and Gobet (1996) concluded based on their analysis of retrospective verbal reports that the best players tended to perceive groups of pieces, ra ther than individual pieces.

More recently, a research programme by Reingold, Charness, and their colleagues (Charness et al., 2001 ; Reingold and Charncss, 2005; Reingold et al., 200la, 200lb) that employed more modern eye movement paradigms provided strong support for enhanced perceptual encoding as a function of chess expertise. Across studies Reingold, Chamess, and their colleagues employed three different tasks: 1) a check detection paradigm, 2) a 'change blindness' flicker paradigm, and 3) a move-choice task. These tasks and the fmdings obtained will be discussed below.

The check detection task

Saari luoma ( 1985) has shown that master players can rapidly and accurately decide whether a chess piece is attacked, and do so more quickly than their less skilled counterparts. The rather simple chess relation of check detection (attack of a King) is highly salient and presents a good model for the extraction of chess-relevant relations among pieces. As ~hown in Pig. 29.1 A, the check detection task employed by Reingold et al. (2001 a) was performed using a minimized 3x3 chessboard containing a Black King and one or two potentially checking pieces. At the beginning of each trial, participants fixated the centre square of the board, a square that was always empty. A large visual span in this task may result in few if any saccades during a trial and in fixations between, rather than on individual pieces. To demonstrate that the encoding advantage of experts is related at least in part to their chess experience, rather than to a general perceptual superiority, Reingold el al. (200 I a) manipulated the familiarity of the notation (symbol vs. letter) used to represent the chess pieces (see Fig. 29.2A). The symbol and letter notations were used to represent identical chess problems. However, the symbol representation is much more familiar than the letter representation. Consequently, if encoding effi­ciency is related to chess experience, any skill advantage should be more pronounced in the symbol than in the letter trials (i.e. a skill by notation interaction).

In order to compare the spatial distributions of gaze positions in the check detection task across the novice, intermediate and expert groups, Fig. 29.26 shows scattergrams with each dot represent­ing an individual gaze position. An inspection of the scattergrams collapsing across all trial types (i.e. the spatial layout of chess pieces, check status, and notatioD), reveals a greater concentration of black pixels in the centre of the scattergram for the experts as compared to the intermediates and novices. This centre of gravity effect reflects a large disparity between skill groups in the proportion of trials without an eye movement ( i.e. No-saccade trials). In such trials the gaze position remained in the centre square of the chessboard throughout the duration of the triaL For each skill group by notation type, Panel C of Fig. 29.2 displays the proportion of No-saccade trials. As can be dearly seen in this figure, only the expert group demonstrated a substantial proportion of No-saccade trials and the proportion of such trials was greater for the symbol notation than the letter notation in this group of players.

Eye movements a nd visual expertise in chess and medicine · 527

A Expert B Radioloogist

+ 712

Resident

Intermediate

• 3rd year medical student

2nd year medical student

Novice

1 51 year medical student

Fig. 29.1 Scan path efficiency as a function of expertise for chess players (A) and radiologists (B).

As shown in Fig. 29.2D, on trials in which eye movements occurred, experts made fewer fixations than intermediates and novices. In addition, compared to their less skilled counterparts, experts placed a smaller proportion of these fixations on pieces (Fig. 29.2E). More importantly, all skill related differences were more pronounced in the symbol than the letter notation. Thus, consistent with Chase and Simon's chunking hypothesis, in the check detection task chess experts made fewer fixations and placed a greater proportion of fixations between individual pieces, rather than on pieces. The magnitude of these effects was stronger for the more familiar symbol notation than for the letter notation, demonstrating that the experts' encoding advantage is related at least in part to their chess experience, rather than to a general perceptual superiority.

528 · Eyal M. Reingold and Heather Sheridan

A Symbol notation C 0.3

"' D Symbol notation (ij

Letter notation

K

B Expert

Intermediate

·.s (!.J Letter notation <I> -o

~ 0.2 1/)

6 c:

§ 0.1 :e 0 c. e 0..

D 5

(/) c: .2 4 (ti

~ 0 (i; ..0 E 3 ::l z

E (/) <I> u <I> ·a. c 0 (/) c 0

iii ~ 0 c: 0 ·-e 0 c. e 0..

2

0.7

0.6

0.5

0.4

0.3

0.2

Novice

Novice

Novice

Intermediate Expert

0 Symbol notation 12:.] Letter notation

Intermediate Expert

0 Symbol notation 12:.] Letter notation

Intermediate Expert

Fig. 29.2 The check detection task from Reingold et al. (2001a). The notation manipulation (A). Scattergrams of gaze positions in the check detection task (B) the capital letter 'A' represents the position of an attacker piece and 'K' represents the posit ion of the King). Proportion of No-saccade trials (C), number of fixations (D), and proportion of fixations on pieces (Panel E). Eyal M. Reingold, Neil Charness, Marc Pomplun, and Dave M. Stampe, Psychological Science, 12(1), copyright © 2001 by Sage Publications. Reprinted by Permission of SAGE Publications.

Eye movements and visual expertise in chess and medicine · 529

The change blindness flicker paradigm

The flicker paradigm was introduced by Rensink eta!. (1997). Reingold et al. (200la} used two types of configurations: chess configurations (with 20 chess pieces in each) selected from a large database of chess games, and random configurations, which were created by repeatedly and randomly exchanging pieces in the chess configurations. Thus, random positions maintained the same spatial configuration but destroyed the chess relation information. Each random or chess configuration was modified by changing the identity but not the colour of a single piece to create a modified display (see Fig. 29.3A). In each trial, in1ages of the original and modified board configurations were displayed sequentially and alternated repeatedly with a blank interval between each pair of configurations. Each variant of the configuration (i.e. original, modified) was presented for 1000 ms, with the display blanking for 100 ms between each alternation. As soon as participants detected the changing piece (the target), they ended the trial by pressing a button and naming the alternating pieces. Previous research indicated that participants are surprisingly poor at change detection in the flicker paradigm, a phenomenon termed 'change blindness' (Rensink et al., 1997; for a review see Simons and

A

• • •• t l! ~ t

it ••• • B 30

• • • •

~ Chess configurations 25 D Random configurations

eli) <ll Q)

g. (ij 20 _ :> <l)t:r :J (/)

·~ 0 15

0~ ~ § 10 <CS

5

Novice Intermediate Expert

Fig. 29.3 Il lustration of the gaze-contingent flicker paradigm from Reingold et al. (2001a). A) Shows an original (bottom left) and a modified (top right) chess configuration (with a box indicating the changed piece) and with chess pieces outside the window being replaced by blobs masking their identity and colour. B) Area of visual span (number of squares). Eyal M. Reingold, Neil Charness, Marc Pomplun, and Dave M. Stampe, Psychological Science, 12(1), copyright © 2001 by Sage Publications. Reprinted by Permission of SAGE Publications.

530 · Eyal M. Reingold and Heather Sheridan

Levin, 1997). It was predicted that when processing chess configurations, but not random configura­tions, chess experts would demonstrate larger visual spans than less skilled players.

ln this task the visual span as a function of chess skill (novice vs. intermediate vs. eKpert) and configuration type (chess configuration vs. random configuration) was measured using a gaze­contingent window technique (e.g. McConkie and Rayner, 1975; see Rayner, 1998, for a review). As shown in Fig. 29.3A, a gaze-contingent window requires obscuring the identity of all chess pieces except those within a certain 'window' that is continually ce11tred on the participant's gaze position. The pieces outside a circular, gaze-centred window were replaced with grey blobs masking the actual colours and shapes. The participant-'s visual span was measured by varying the size of the window over successive trials and determining the smallest possible window that did not significantly differ from the participant's normative RT criteria. These criteria were established separately for chess configuration and random configuration by using baseline trials in which the entire display was visible (i.e. No-window trials). Note that this change detection task required no chess knowledge and consequently Reingold et al. (200 I a) were able to explore visual span across a broad range of chess skill stretching from novice to master. For each skill group by configuration type, Panel B of Fig. 29.3 displays the visual span results. Experts' span area for chess configurations was dramatically larger than all other skill group by configuration type cells, which in turn did not differ from each other. Thus, consistent with Chase and Simon's hypothesis, the increase in visual span area which charac­terizes expert performance on trials with chess, but not random configurations, clearly indicates an encoding advantage attributable to chess experience, rather than to a general perceptual or memory superiority.

The move-choice task

Given the strong support for enhanced perceptual encoding as a function of chess expertise obtained in t he tasks reviewed above, Charness et al. (2001) aJ1d Reingold and Charness (2005), attempted to extend these findings to the more ecologically valid task of choosing the best move with full chess­boru·d displays (henceforth, the move-choice task). Focusing on the perceptual encoding phase during move-choice trials. Charness et al. (2001) restricted their analysis to the first five fixations in each trial (approximately the first 1-2 s). Consistent with the check detection findings, experts produced a greater proportion of fixations on empty squares than intem1ediates (experts: M = 0.52; intermediates: M = 0.41 ). In addition, consistent with de Groot and Gobet (1996), among fiXations on pieces, experts produced a greater proportion of fixations on salient pieces (i.e. active pieces that were relevant to generating the best move) than intermediates (experts: M = 0.80; intermediates: M = 0.64). Piece saJjency was determined by asking two international masters to classify pieces in each position used in the experiment as salient· or non-salient. Thus, as uidicated by the spatial distri­bution of early fixations, experts processed larger patterns or chunks and such processing of global position information might have enabled them to be remarkably efficient in rapidly identifying task relevant pieces and configurations.

Whereas in Charness et al. (200 1) the focus was on the first five eye fixations during the perform­ance of the move-choice task, in the follow-up experiment Reingold and Chamess (2005) recorded fixations during the first I 0 sin each trial They hypothesized that an examination of changes in the number and duration of fixations, which might occltr as the trial progresses, would be potentially useful in distinguishing between perceptual encoding and problem solving or solution retrieval and evaluation. Specifically, perceptual encoding was expected to involve shorter fixations and conse­quently a greater number of fixations in a given time interval than problem solving. Reingold and Charoess (2005) were also interested in the proportion of fixations with durations greater than 500 ms Sucb fiXations have been identified previously as reflecting visual problem solving and evaluation (e.g. Nodine et al., 1978).

Pig. 29.4A shows scattergrams aggregating all fixations in the first 10 s for both intermediates (left panel) and experts (right panel) for one of the positions used in the experiment. Ead1 circle repre­sents an individual fixation , and the diruneter of the circle increases as a function of an iucrease in

Eye movements and visual expertise in c hess and medicine · 531

Intermediates Experts

a b c d e 9 h a b c d e 9 h

8 22 0.2 0 First 5-second interval 0 First 5-second interval ~Second 5-second interval u

ll ~ Second 5-second interval 20 g

(/) 0 0 c tO

0 1\ ~ 18 X '!! ;,: g 0.1 0 "' Q) 16 ~ .0 0 E :l i5 z

14 t: ::> :l. ::> ct

12 0 Intermediate Expert Intermediate Expert

Fig. 29.4 Scattergrams aggregating all fixations in the first 10 s across intermediates (left) and experts (right) for one of the positions used in the move choice task in Reingold and Charness (2005) (A). Each c ircle represents an individual fixation, and the diameter of the circle increases as a function of an increase in fixation duration. Best move for this position = White Queen takes Pawn at h5 check, Black Rook takes White Queen, White Bishop moves to g6 mate. Number of fixations (left), and the proportion of fixations longer than 500 ms (right) in the move choice task by skill and interval (first 5-s interval vs. second 5·s interval) (B). Reproduced from G. Underwood, Cognitive Processes in Eye Guidance, 2005, with permission from Oxford University Press.

fixation duration. As can be clearly seen by comparing the scauergrams, consistent with the chunk­ing hypothesis and the findings of Charness et aJ. {2001 ), experts produced a greater proportion of fixations on empty squares than intermediates (experts: M = 0.55; intermediates: M = 0.43). rn addi­tion, as indicated by a comparison of the relative size of the circles across scattergrams, experts clearly produced a higher proportion oflonger fixations than intermediates. Reingold and Charness (2005) divided the 10-s period of eye movement recording in the beginning of each trial into two 5-s inter­vals. For each interval, the mean number of fixations and the proportion of fixations with durations greater than 500 ms were computed. Fig. 29.48 d isplays these two dependent variables by skill group and interval. As can be seen in this figure, the pattern of performance is qualitatively different across experts and intermediates. Specifically, for intermediates there was no difference across intervals in

532 · Eyal M. Reingold and Heather Sheridan

the number of ftxations and in the propoTtion of long ftxations (i.e. >500 ms). In marked contrast, experts produced substantially fewer fixations and a much greater proportion oflong ftxations as the trial progressed. This indicates that during tbe second 5-s interval in a trial experts started engaging in problem solving, whereas intermediates were still perceptually encoding the chess configura6ons. This p rovides further support for the hypothesis of enhanced efficiency of pattern recognition processes in encoding chess configurations as a function of chess expertise.

Unconscious processing of chess related patterns by experts

The results reviewed above suggest that it is likely that the main perceptual advantage for experts is not in the identification of single chess pieces and board locations, but rather in the extraction of relational infonnation between pieces. This was powerfully demonstrated by tbe strong skill effects on the area of the visual span obtained with actual chess configurations (i.e. wltere relational infor­mation is intact), coupled with the absence of skill effects on span size obtained with random config­Ltrations (i.e. where relational information is broken down). Based on this dramatic demonstration oflarger visual spao in chess experts, in SLtbsequent studies Reingold eta\. (200lb) and Reingold and Charness (2005) prt>poscd that one possible mechanism that might a llow chess masters to process chess configurations more efficiently is automatic and parallel extraction of several chess relations that together constitute a meaningful chunk. These studies contrasted the standard check detection trials with two attackers with trials in which one of two attackers was cued (coloured). In the latter condition, the task was to determine if the cued attacker was checking the King while ignoring the other attacker. Three different trials in wbicb the correct response was No were contrasted: 1) a standard No-Cue condition (i.e. no cueing), with 2 non-checking attackers, 2) a cued non­checking attacker that appeared togctherwit.h another non-checking attacker (i.e. a congruent condi­tion), and 3) a cued non-checking attacker that appeared together with a checking attacker (i.e. incongruent condition). Note that all of these trials are No trials even though the Incongruent condi­tion contains a checking attacker. That Is, in the Incongruent condition the semantics of the cued chess relation ( i.e. no check) is inconsistent with the semantics of the configuration as a whole (i.e. check). Serial processing of chess relations will manifest as faster processing (e.g. faster RTs, fewer fixations) in the Congruent condition, than in t.he No-Cue condition, as the cueing constrains the search space. In contrast, parallel processing of chess relations should result in no benefit from cueing in the Congruent condition. In addition, if parallel processing of chess relations occurs, cueing should produce slower processing in the Incongruent, than in the Congruent condition, demonstrating Stroop-like interference. While novices and intermediates displayed serial processing of chess tela­tioJlS, experts showed a parallel processing pattern includjng a Stroop like interference effect. Such an .interference effect constitutes the 'gold standard' for demonstrating automaticity and an absence of conscious control. ln other words, the experts, but not their less skilled counterparts were unable to avoid processing the global pattern even when such processing was detrimental to task perform<mce.

Eye movements have also revealed evidence of unconscious processing in a chess study concerning the Einstellu11g (set) effect (Bilalic et a.l., 2008). This effect refers to the finding that i11 many problem­solving situations, initially generating a less optimal solution can prevent a better solution from coming to mind. In the caseofcl1ess, Bilalic' et aJ. (2008) observed that when expert players (Candidate Masters) are asked to iind the fastest way to win a game, they might miss the best solution (checkmate in three moves) when a familiar, Jess optimal solution is also present (checkmate in five moves). To study the mechanisms underlying this effect, Bilalic et al. (2008) compared the percentage of time that players spent looking at the squares that were important for the optimal verst.lS the less optimal solution. tn retrospective reports, the players claimed that they had continued to search for a faster solution after discovering the five-move solution. However, their eye movement record showed that throughout the problem-solving period they spen t a greater proportion oftime looking at the squares associated with the less optimal than the optimal solution. This finding suggests that the Einstellung (set) effect may work by directing the problem-solvers' attention towards evidence which is consist­ent with their initial solution and preventing them from considering evidence that is consistent

Eye movements and visual expertise in chess and medicine · 533

with new solutions. Furthermore, the inconsistencies between the retrospective reports given by the players a11d their pattern of eye movements suggest that their eye movements reflected information that was not accessible to awareness.

Superior perceptual encoding and visual expertise in medicine

In medical domains such as radiology, pathology, dermatology, and minim<Jlly invasive surgery, there is a growing reliance on imaging equipment and the development of specialized visual expertise. Like experts in other domains, medical experts exhibit superior performance as indicated by their faster decision times and greater accuracy on domain related tasks (for a review, see Nodine and Mello-Thoms, 2000). Importantly for U1e present context, the application of eye tracking to the study of visual expertise in medicine (primarily in the field of radiology) has revealed a number of differences between experts and novices. As indicated in the review below, although eye movement studies in chess and medicine constitute independent research efforts with almost no overlap or cross citation, there is substantial theoretical and empirical convergence across these two domains. Table 29.1 contains brief summaries (in chJ·onological order) of studies that directly compared the eye movements of experts and novices in order to study medical expertise. Taken together, many of these studies suggest visual expertise in medicine involves superior encoding of domain related configurations (often referred to as a global processing advantage) by experts (see Fig. 29.1B for an illustration of increased efficiency of scan paths as a function of skill).

To explain the superior encoding of domain related patterns by medical experts, models, such as the global-focal search model (Nodine and Kundel, 1987) and the two-stage detection model (Swensson, 1980), have incorporated a global processing component. Both of these models postulate that experts have more efficient scan paths because they can simultaneously process visual informa­tion from across a wide field of vision (i.e. a larger visual span). According to the global-focal search model, upon viewing an image, experts quickly obtain a global impression, which consists of a comparison between the contents of the image and the expert's schema. The term schema refers to the expert's knowledge about the overall visual patterns that are associated with abnormal and normal radiographs. During the global impression, the e>..-pert makes 11ote of both perturbations, which are deviations from the expert's schema, and of potential abnormalities. These perturbations and the potential abnormal ities are then scanned with the fovea. Thus, the expert radiologist's knowl­edge about the visual patterns typically associated with normal and abnormal anatomic structures allows them to quickly pinpoint suspicious regions. As pointed out by Gunderman et al. (2001), this explanation of how prior experience shapes search patterns in radiology is similar to the concept of 'chunki.ng' (Miller, 1956). As such the global -focal search model is also reminiscent of Chase and Simon's chunki.ng theory of skilled performance in chess. Both models argue that given that experts have a vocabulary of domain specific visual patterns built up from their prior experiences in the domain, they are able to evaluate larger consteUalions of features, instead of only perceiving indi­vidual features.

Similarly, the two-stage detection model (Swensson, 1980) incorporates the idea that radiologists can quickly evaluate large regions of an image. This model assumes that expert radiologists have developed perceptual mechanisms, which act as an initial filter that automatically identifies features that require fmther examination. With experience, these visual mechanisms have been trained to filter out nom1al structures, in order to direct the radiologist's attention to structures that are likely to be abnormalities. Thus, the notion that experts engage in global processing in order to flag poten­tial abnormalities is captured hy both the global-focal model (Nodine and Kundel, 1987) and by the two-stage detection model tSwensson, 1980). As outlined below, a variety of evidence is consistent with these two models.

Early evidence that radiologists engage in global processing was provided by the brief exposure studies, which are sometimes referred to as 'flash studies'. When an in1age is presented for a brief period of time, radiologists can sti ll identify a large number of abnormalities (Cam1ody et aL, 1980a; 1981; Gale et al., 1990; Kundel and Nodine, 1975; Mugglestone el al., 1995; Oestmanri eta\., 1988).

534 · Eyal M. Reingold and Heather Sheridan

Table 29.1 Summary table of studies of eye movements and medical expertise

Reference

Kundel and La Follette (1972)

Kundel (197 4, 1983)

Carmody et al. (1984)

Krupinski (1996a,

1996b, 2000)

Nodine et al., {1996a, 1996b)

Range of expertise Task Key findings

Laypersons, Lung nodules The radiologists showed more efficient scan medical students, in chest paths and required fewer fixations to reach the residents, radiographs abnormalities radiologists

Laypersons, radiologists

Radiology residents, radiology instructors

Radiology residents, Radiologists

Laypersons, residents, technologists, mammographers

Lung nodules The radiologists spent more time fixating the in chest abnormalities in abnormal films, and more time radiographs

Lung nodules in chest radiographs

fixating areas that were more likely contain abnormalities in the normal fi lms

This study reports a discrepancy between verbal reports and eye movements: radiology experts reported that they frequently used bilateral comparison scans (alternating between the same sections on the two lungs). In contrast. radiologists' eye movements showed that th is type of scanning occurred only 4% of the time

Mammograms Radiologists had shorter overall viewing times, faster times to first fixation on abnormality, more efficient scan patterns, covered less of the image, and had longer fa lse-negative durations

Mammograms Mammographers had faster times to first fixation on the abnormality, and longer fixation times on the lesion

Nodine and Laymen, Lung nodules Radiologists had faster times to first fixation on Kundel (1997); radiologists Nodine and Krupinski (1998)

Wooding et al. (1999)

Mean experience in months: 0 (laymen), 2.3 (novices), 16.5 (trainees), 90

in chest the abnormalities in the chest images, but not in radiographs control visual search tasks

Lung nodules The fixation patterns of trainees show the least in chest within-group consistency and the least amount of

radiographs similarity to radiologists, suggesting that tra inees go through a developmental phase characterized by more idiosyncratic patterns of attention

(radiologists) allocation and eye movements -·~-.. ··--···- ··- ··- ···-····-····-·· .. ·- ··- ·- ···- ·- ·-··--·-·-·-··-····--··- ···- ·"-····-······-"···- ···- ··· .. -····-"··-·····--... -...... _ .. ,_,,,_,,,_,,_,,, __

Krupinski (2000)

Nodine et al. (2002)

Residents, mammographers

Trainees, mammographers

Mammograms Observers· eye movements were monitored before and after computer-aided detection (CAD) prompts, which identified locations containing suspicious features that were likely to be a lesion. Prior to the prompt, the mammographers had longer false-negative durations than the residents. After the prompt, the residents had longer durations for all four response outcomes

Mammograms Mammographers were more likely to fixate on the lesion for 1000ms or greater. Both groups showed longer fixations for false-negative than for true-negative responses and this difference was numerically larger for the mammographers

Eye movements and visual expertise in chess and medicine · 535

Table 29.1 (continued) Summary table of studies of eye movements and medical expertise

Reference Range of expertise Task Key findings

Mello-Thoms Residents, Mammograms Mammographers were more consistent than et al. (2002); mammographers residents in terms of their visual search and Mello-Thoms decision-making strategies. For residents, but (2003) not mammographers, there were differences in

local spatial frequency information between missed lesions that attracted visual attention, and those that did not. indicating that the residents' search strategy was more influenced by the local sal iency of the lesions whereas the experts were better able to make use of global patterns

········- ······- ·····- ······- ···- ·····-·····- ·····- ····- ···-·····- ····- ··- ····- ·····- ·····- ·····- ····- ····- ·· ···- ·····- ·····- ·····- ·····- ·····- ·····- ···- ·····-·····- ····- ····- ······- ···- ····-··-"·--'"' Manning et al. (2003, 2006a)

Law et al. (2004)

Kundel and Nodine (2004)

Novices. radiographers, radiologists

Novices, surgeons

Residents, mammography

fellows, mammographers

Lung nodules Increased experience resulted in longer in chest saccades amplitudes, fewer fixations, less radiographs coverage of the image, and faster decision times

Laparoscopic While performing an aiming task that involved surgery touching a small target with the tip of a tool, the simulation experts spent more time fixating on the target,

less time fixating on the tool, and had fewer saccades between the tool and the target

Mammograms Mammographers had longer false-negative durations, were more likely to fixate on lesions, and had faster times to first fixation on abnormalities ----·- --··-· .. -·.-· ... -.... -

Manning et al. (2004, 2006b)

Donovan et al. (2005)

Kocak et al. (2005)

Krupinski (2005)

Novices, radiographers, radiologists

Lung nodules in chest radiographs

All the groups showed longer fixation times for false-negative than true-negative responses, and this effect tended to be larger for the inexperienced groups. The experienced groups covered less of the image

Laymen, first-year Wrist fractures Perceptual feedback about eye movements students, th ird-year resulted in decreased performance by students, experts

Novice, intermediate, expert

Residents, mammographers,

experienced observers and showed a trend

towards improving the performance of less experienced observers

laparoscopic As expertise increased, there was a decrease in surgery the saccadic rate and an associated increase in

simulation mean fixation times, and there was a trend towards larger saccadic amplitudes

Mammograms The mammographers had faster times to first fixation on abnormality, and this effect was greater in magnitude for the more subtle lesions. For both groups, lesions associated With true­positive outcomes were fixated for longer than lesions associated with fa lse-negative outcomes, and subtle lesions were fixated for longer than more obvious lesions

(continued)

536 · Eyal M. Reingold and Heather Sheridan

Table 29.1 (continued) Summary table of studies of eye movements and medical expertise

Reference

Krupinski et al. (2006)

Burgert et al. (2007}

Kundel etal. (2007}

Range of expertise

Medical students, residents, pathologists

13 surgeons with varying levels of expertise

Observers varied in experience, and

expertise was defined based on detection accuracy

Expertise varied

Task

Telepathology

virtual slides

Neck dissection planning

Key findings

The observers· task was to view a low magnification image, and to decide on the top three locations that they would zoom into if they were to continue to examine the slide. The pathologists showed a more efficient scan path with shorter overall viewing times, fewer fixations, fewer saccades and longer saccades. They did not fixate at all , significantly more frequently than the trainees, on some of their preferred zoom locations, indicating greater use of parafoveal and peripheral vision to extract global information

Experienced surgeons showed fewer saccades and longer fixations on areas of interest, indicating more efficient integration of 3-dimensional global information

Mammograms The more accurate performers had faster times

to first fixation on the abnormality, and were more likely to exhibit a long saccade towards the abnormality immediately upon viewing the image, indicating greater peripheral and parafoveal processing of global information

Fracture Leong et al. (2007} widely from officers detection

to surgeons and (shoulder,

For true-positive trials, the more experienced observers spent a smaller proportion of t ime fixating the fracture site but there was no difference between the groups on false-negative t rials. The more experienced observers showed greater consistency in their search patterns. The experts, but not the novices. displayed distinct stages in eye movement search patterns

radiologists

Donovan et al. Laymen, level 1 (2008) students, level 2

students, experts

Litchfield et al. (2008)

Novice radiographers, expert radiographers

hand, and knee)

Lung nodules in chest radiographs

Lung nodules in chest radiographs

Perceptual feedback about eye movements produced improvement across all levels of expertise, with the greatest improvement shown by the level1 group

Both novice and expert radiographers show improved performance after viewing the eye tracking record of another observer who had examined the same image, but the improvements were more pronounced for the novices than the experts

Por example, after viewing chest films for 200 m s, expert radiologists were able to detect 70o/o of the

abnormalities, compared with 97% true-positives with unlimited viewing conditions (Kundel and Nodine, 1975). Similarly, expert mammographers were above chance at identifying abnorm alities

under flash viewing conditions, although they tended to miss the more subtle abnormalities (Mugglestone et al., 1995). To determine the limits of performance under brief exposure conditions,

Carm ody et a!. ( 1980a), systematically varied the distance between the observers' point of fiXation and the location of a chest nodule. Although accuracy decreased as the distance increased, it was

found that a radiologist could detect some nodules that were 15° away, and a Jess experienced film

Eye movements and visual expertise In chess and medicine · 537

reader could detect some noduJes that were 10° away. GiveJl that btief e~posure conditions preclude eye movements toward the abnormalities, it appears that radiologists can identify abnormalities without foveal vision.

The evidence from brief exposure studies was later supplemented by a variety of eye movement f111dings. One prominent finding is that expert radiologists exhibit more efficient scan paths than novices (e.g. Krupinski 1996:.'1, 1996b, 2000; Krupinski et al., 2006; Kundel and La Follette 1972), suggesting that they can use information that is outside of the fovea to guide their search. Figure 29.1 B (taken from Kundel and La Follette, 1972; see also Kundel, 2007) shows examples oft he range of scru1 paths exhibited by medical students, residents, and expert radiologists. As shown in this figure, expert radjo]ogists often exhibit a circumferential scan pattern (Kundel and Wright, 1 969) that involves making a wide sweep around the image with long saccades and a small number of widely spaced fixations. An efficient scan path entails quickly discovering abnormalities while at the same time covering less area with the fovea. Relative to novices, experts exhibit a number of the characteristics associated with efficient scan paths, including fewer f001tions (Krupinski et al., 2006; Manning et al., 2003, 2006a), fewer $aCcades (Burgert et al., 2007; Kocak et al., 2005; Kntpinski et al., 2006; Law et al., 2004 ), longer saccades (Kocak et al., 2005; Krupinski et al., 2006; Manning et al., 2003; Mru1ning et al., 2006a), less coverage of the image (Krupinski; 1996a, 1996b, 2000; Manning, et al., 2003, 2004, 2006a) arriving at the abnormality faster (Kmpinski; 1996a, 1996b, 2000, 2005; Kundel and Nodine, 2004; Kundel et al. , 2007; Nodine et al., 1996a, J996b), and spending a greater proportion of time fixating on abnormalities (Kundel, 1974; Kundel and La Follette, 1972, Kundel and Nodine, 1983, 2004; Nodine et al.. 1996a, 1996b, 2002; but see Leong et al., 2007 ).

In further support of the models, whe11 the number of abnormalities reported is plotted against time, ex11erts display a rapid reporting phase, followed by a more gradual reporting phase (Christensen et al., 1981; Nodine et a!., 2002). This is because the expert can quickly identify many abnormalities with peripheral and para foveal vision, before engaging in a slower search for the more subtle abnor­malities that require foveal vision. In contrast, less experienced observers show a more constant rate of reporting instead of the rapid and slow reporting phases, because the global mode of processing is less developed for them. Given that novices are unable to identify as many abnormalities with their peripheral and parafoveal vision, they must instead examine a large proportion of the image with their fovea.

The rapid reporting phase shown by experts is consistent with eye movement findings that experts fixate many abnormalities within I second of viewing the image (e.g. Kundel et al., 2008). As I sis not enough time for the entire image to be scanned with fovea l vision, the short times to the t1rst fixation on the abnormality provide strong evidence experts use para foveal and peripheral vision to Locate abnormalities. Furthermore, re lative to novices, experts have faster times to their first fixation on an abnormality (Krupinski 1996a, 1996b, 2000, 1005; Kundel and Nodine, 2004; Kundel et al., 2007; Nodine et al., 1996a, l996b). For example, Kundel et al. (2007) showed that the time to fixate the abnormality decreased as the observer's accttracy increased, and the more accurate observers were mme Likely to exhibit a long saccade towards the abnormality inunediately upon viewing the mammogram.

Gaze-contingent window experiments have strongly supported the role of global processing. As in chess research, the gaze-contingent window paradigm has been used to selectively vary the extent to

whlch information is accessible by foveal , parafoveal and peripheral vision. Chest nodules were fixated faster when they were visible by para foveal and peripheral vision, compared to a condition in which the nodule was only visible when fixated directly (within 3.5 or 5.25°) (Kundel et al., 1991 ). Similarly, when nodules are only visible within a central gaze-contingent window,the time it takes to first fiXate the nodule decreases as the window size increases (Kundel et al., 1984). Thus, removing parafoveal and peripheral information decreases search efficiency, as indexed by the amount of time required to fixate the abnormality. Building on these findings, future research could employ the gaze-contingent window paradigm to djrectly compare the size of the perceptual span for medical experts and novices for both domain-relevant and non-relevrull visual information.

The gaze-contingent window findings seem to be consistent with an earlier s tudy that compared a full -image viewing condition to a segmented viewing condition, in which a chest image was divided

538 · Eyal M. Reingold and Heather Sheridan

into six segments that were viewed one at a time (Carmody et al., 1980b). The segmented condition resulted in lower accuracy, suggesting that experts benefited from global viewing conditions. Similarly, Carmody (1984) reported decreased accuracy in a constrained viewing condition that prevented participants from making visual comparisons between different regions of the image. In line with these findings, instructing radiologists to focus on particular regions or features dimin­ishes accuracy in comparison to a free search condition (Swensson et al. , 1982; 1985). Overall, these studies provide evidence that radiology experts perform best under free search conditions that allow them to use a wide field of view to guide their St'arch. As discussed previously, a wide field of view may improve efficiency by directing observers to abnormalities in their peripheral and patafoveal vision. However, global p rocessing may also improve accuracy by facilitating comparisons between different regions of the same image. It has been previously argued that comparison scans (alternating between different regions with foveal vision) play an important role in visual search (Carmody, I 984; Carmody et aJ., 1980b). but it is possible that some visual comparisons rely on global processing. In free search conditions, radiologists may use their para foveal and peripheral vision to detect asym­metries between different parts of the radiograph, and segmented viewing conditions could interfere with this type of processing.

Similar to the above findings from the field of radiology, there is evidence that medical experts in other fields are able to use parafoveal and peripheral information to improve the efficiency of their performance. Por example, a recent study examined the eye movements of medical students, pathol­ogy residents, and practicing pathologists while they viewed telepathology virtual slides (Krupinski et al., 2006). The observers' task was to select three locations that they would zoom into further if they were to continue to exe1mine the sUde. While selecting the three locations, the pathologists showed a more efficjent scan path with fewer fixations and longer saccades. Furthermore, the experts did not fLXate at aU, significantly more frequently than the trainees, on some of their preferred zoom locations, suggesting that they used para foveal and peripheral vision to extract global information.

These observations of pathology experts are consistent with reports that surgeons show fewer saccadcs than novices during laparoscopic surgery s imulation tasks (Kocak et aJ., 2005; Law et al., 2004). Por example, while performing a task that involved touching a small target with the tip of a tool, the experts spent more time fixating on the target, less time fiXating on the tool. and had fewer saccades between the tool and the target (L;1w etal., 2004). This indicates that the e.xpert surgeons were able to direct the tool using peripheral and para foveal vision alone. Thus, the global processing advantage shown by medical experts seems to span a number of different fields, including radiology, pathology, and minimally invasive surgery.

Three main parallels can be drawn between the global processing advantage shown by medical experts, and the larger visual span shown by chess experts. First, in both cases the perceptual encoding advantage appears to be domain specific. Radiologists do not perform better than novices when tested with control visual search tasks that involved searching for the character WALDO and searching for the word NINA (Nodine and Krupinski, 199B; Nodine and Kundel, 1997), and a comparative visual search task that more closely mimics radiology taskssbowed a similar pattern of results between radiologists and laymen (Moise et al., 2005). Second, chess expertise is analogous to expertise in medical diagnosis because both forms of expertise involve extensive, domain specific knowledge of visual configurations (Wood, 1999). This knowledge allows experts to 'chunk' together domain specific information such that they can recognize patterns instead of only seeing individual features (Gunderman et al. , 200 I). Furthermore, it is likely that it is necessary to build up this vocabulary of domain related visual knowledge in order to facilitate the global mode of processing. Third, for both chess players and medical experts, it is possible that not all of this knowledge is accessible to conscious awareness (Heiberg Engel, 2008; Norman et al., 1992). The remainder of this chapter will focus on eye movement findings that suggest that there is an unconscious component to expertise.

Expertise and tacit or implicit knowledge One of the prominent modern philosophers to extensively theorise about implicit or tacit knowledge was Michael Polanyi. In describing the gradual acquisition of the skill with which a blind person uses

Eye movements and visual expertise in chess and medicine · 539

a stick to navigate, Polanyi provides a very eloquent portrayal of non-analytic implicit learning generating implicit knowledge:

Someone using a stick for the fLrst time to feel his way in the dark, will at first feel its impacL against his palm

and fingers when the stick hits an object. But as he learns to use the stick effectively, a tnmsformation of

these jerks will take place into;~ feeling of the point of the stick touching an object; the user of the stick is no

longer artending then to the meaningless Jerks in his hand but attends ji-om them to their meaning at the far end of the stick.

(italics in original; Polanyi, 1969, p.145.)

With practice there is a gradual shift from attending to the manipulation and placement of the stick, to focusing on the objects in the environment which are being probed by it. Just as we are only marginally aware of the movements of our limbs during a leisurely walk (our attention is focused on the sights and sounds around us), a blind person skilled at navigating with a stick might have only a Sltbsidiary awareness of its placement and manipulation. In a sense, the stick is being assimilated as an extension of the skilled user's body.

As the above examples illustrate, implicit knowledge may be considered an .integral part of exper­tise. This can be powerfully demonstrated by considering Expert systems, a subarea of artificial intel­ligence. Practitioners in this area attempt to create computer programs that mimic the performance of human experts. A prerequisite of such an enterprise is an explicit representation of the rules and strategies used by experts to achieve skilled performance (i.e. domain specific knowledge). However, extracting the knowledge of experts turned out to be much more difficult than first assumed. Basing the programs on rules experts claimed they were using resulted in inferior performance. lt is not that experts were uncooperative; rather, they knew much more than they could verbalize. For the purpose of creating expert systems it is important to explicitly represent not only experts ' declarative knowl­edge (i.e. knowing that) , but also their procedural knowledge (i.e. knowing how). Unfortunately, such procedural knowledge is largely implicit (i.e. not easily verbalized or consciously accessed). To overcome this problem, designers of expert systems sometimes referred to as knowledge engineers interrogate expert.s as they undertake a variety of carefully selected skill related examples. Critics of expert systems point out that this effort is only partially successful at making the implicit knowledge of experts explicit. Accordingly, they argue that the performance of expert systems cannot rival that of the best human experts, and that there are res trictions on the nature and scope of the domains for which expert systems can be developed.

Eye movements have the potential to contribute to the study of experlise by revealing information about ongoing cognitive processing that is not consciously accessible to the expert. Previously, in the chess section of this chapter, we briefly discussed evidence for automatic and parallel extraction of chess related patterns in the absence of conscious control (Reingold and Charness, 2005; Reingold et al., 200lb) and evidence that chess players' eye movements provide information that is not contained in their retrospective reports (Bilali et al., 2008). We now focus on the case of medical ex.pertise, in order to highlight the consistent finding that even when radiologists fail to report abnormalities, their eye movements still differentiate between the missed abnormalities and abnor­mality-free areas (for a reviews, see Krupinski et al., 1998; Krupinski and Borah, 2006). We will argue that both chess and medicine findings suggest that eye tracking is uniquely suited for studying the unconscious component of expertise.

Eye movements, implicit knowledge and radiology

In the field of radiology, it has been reported that inter-rater variability is high and abnorrnaJities are missed as frequently as 30% of the time (e.g. Austin et a!., I 992; Bird et a!., J 992; Birkelo, Chamberlain et al., 1947; Guiss and Kuenstler, 1960). Eye tracking has been used to investigate why many abnormalities are missed even though they are visible in ret rospect. Misses (referred to as false­negative responses) are frequently classified as either scanning errors, recognition errors, or decision­making errors (Kundel et al., 1978). Scanning errors occur when the radiologist does not frxate near

540 · Eyal M. Reingold and Heather Sheridan

the abnormality. Recognition errors occur when the region containing the abnormality is fixated for a short period of time, indicating a failure to recognize the presence of a potential abnormality. Decision-making errors occur when the abnormality was fiXated for a long period of time, indicating that the observer might have recognized the presence of a potential abnormality, but they incorrectly decided that the region was normal. Typically, false-negative responses are categorized as recognition errors if the cumulative cluster duration on the abnom1ality is less thail a threshold value and as decision -making errors if the cumulative duster duration is above the threshold value. For example, Kundel et al., ( 1978) used ail 800-ms tllreshold value and found that 30o/o of false-negative responses were considered scanning errors, 25% were recognition errors, aild 45o/o were decis ion-making errors. Subsequent research has confirmed that a substantial proportion of false-negative responses can be categorized on this basis as a failure of decision-making (e.g. Berbaum et aL, 1996, 200 I; Krupinski and Nishikawa, 1997; Kundel et al., 1989; Manning et al. , 2004; Manning et al., 2006b). Furthermore, an additional source of false-negative errors that has been investigated extensively is the satisfaction of search (SOS) effect (Berbaum eta I., 2000; Berbaum et al., 1990, 2010), which is the finding that radiologists are less accurate at detecting subtle abnormalities if additional abnormalities are present on the same racliograph.

However, regardless of how false-negative errors are classified, it is clear that missed abnormalities are fixated for a prolonged period of time relative 10 abnormality-free regions. There are two reasons why this eye movement finding might constitute an example of implicit knowledge. First, radiologists adopt a lenient response criterion (Scheft, 1963) for detecting abnormalities because the costs associ­ate.d with missed abnormalities (the critical importailce of early detection in improving outcomes) far exceed the costs associated with false alarms (patient arlXiety and the costs of additional testing). Due to this bias towards avoiding a miss, radiologists are taught to report any evidence that indicates the presence of a possible abnormality, even if this evidence is weak. Given that the knowledge reflected by the eye movement record on missed abnormalities was not reflected in the overt decision, it was likely iliat it was not consciously accessible at the time of the decision. Second, providing racli­ology experts with feedback about their eye movements has been shown to improve accuracy, suggest­ing that the eye movement record reflected information that was previously nor consciously accessible to the expert (Kundel et al., 1990). As outlined below, although further work is required, this line of research has the pOtential to contribute to our understandb1g of implicit knowledge and e).1Jertise.

Given that radiology images do not have predefined interest areas, a set of procedures was devel­oped for categorizing and grouping fiXations. As detailed by Nodine et al. (1992), fiXation clusters are formed by summing together nearby consecutive fixations. This is done by setting a spatial threshold (typically 2.5°) and then comparing tl1e location of each new ftxation to tlle meail location of a1l of the previous fixations in the current cluster. If the distance of the new fiXation from the cluster mean is below a threshold of 2.5°, it is grouped with the previous fixations in that duster, and if it exceeds the threshold, the new fiXation marks the start of a new cluster. Given that regions are sometimes refixated, to compute the total time associ-ated with a region throughout the trial (referred to as the cumulative cluster duration) dusters are summed whenever the centre of two or more clusters in a trial is closer than the threshold of 2.SO.

Cumulative dusters can be associated with four different response outcomes, depending on whether or not there was ail abnormality in the fixated region, and whether or not an abnormality was reported. When the centre of a cumulative cluster is within 2.5° degrees of the centre of an abnormality, it is associated with a true-positive (TP) response (i.e. a hit) if the abnormality was later reported, and with a false-uegative (FN) response (i.e. a miss) if the abnormality was not reported. For abnormality-free regions, cumulative clusters are associated with a false-positive (PP) response (i.e. a false alarm) if the observer had incorrectly reported an abnormal.ity in a location that was less than 2.5° from ilie centre of tl1e cumulative cluster. Cumulative clusters falling in unreported, abnor­mality-free regions are associated with tme-negative (TN) responses (i.e. a correct rejection).

Subsequent research consistently documented that ilie longest cumulative duster durations are asso­ciated with true-positive aild false-positive responses, with somewhat lower durations for false-negative responses, and the lowest durations for true-negative responses. As can be seen fwm Fig. 29.5A,

A

0 Q) (/)

.s c 0

·~ :l '0

Cii (ii :l 0

~ ·~ :; E :l

(.)

B

0 Q) (/)

2500

2000

1500

1000

500

0

2500

.s 2000 c 0

~ 1500 :l '0

* :l 1000 0 Q) > ~ :; 500 E :l (.)

0

Eye movements and visual expertise in chess and medicine · 541

• True-positive D False-negative

[I] True-negative

Chest Mammography Bone trauma Bone fractures

• True-positive D False-negative

Mass Calc Mass Calc Experienced Inexperienced

Fig. 29.5 Cumulative cluster duration as a function of response outcome and image type (Chest data from Kundel et al., 1990, Mammography data from Krupinski, 1996a, bone trauma data from Krupinski and Lund, 1997, bone fractures data from Hu et al. 1994) (A), and cumulative cluster duration as a function of response outcome and level of expertise in mass and microcalcification cluster (Calc) mammogram search tasks (from Krupinski, 1996a) (B). Reprinted from Academic Radiology, 3(2), Elizabeth A. Krupinski, Visual scanning patterns of radiologists searching mammograms, pp. 137-44, Copyright (1996), with permission from Elsevier.

the finding that cumulative cluster durations are longer for false-negative responses than true-nega­tive responses is very general and was demonstrated for a variety of image types, including chest x-rays (e.g. Kundel eta!., 1978, 1989, 1990; Krupinski and Roehrig, 2002; Manning eta!., 2004, 2006b ), mammography (e.g. Krupinski, 1996a, 1996b; Krupinski and Roehrig, 2000; Krupinski et al., 1999; Nodine eta!., 2002; Nodine eta!., 2001), bone trauma and fractures (e.g. Hu et al., 1994; Krupinski and Lw1d, 1997). As shown in Fig. 29.5B, this effect was also demonstrated across different levels of expertise (e.g. Krupinski 1996a, 1996b; Manning et at., 2006b ).

The finding that eye movements consistently distinguish between false-negative and true-negative responses has been exploited by researchers in order to develop a visual feedback method for improving diagnostic accuracy (Kundel eta!., 1990). The visual feedback method involves monitoring

542 · Eyal M. Reingold and Heather Sheridan

A

B

.2:­·:; ; ·u; c:: Q) (/)

c:: 0

n Q)

Q) u .£ Q) Ol c:: <0

.I:: (.)

0.35

0.3

0.25

0.2

0.15

0.1

o.os

0

-O.OS

- 0.1

-0.1S A1 A2 A3 A4 AS

Feedback

A6 A1 A2 A3 A4 AS A6

No feedback

Fig. 29.6 An illustration of the feedback paradigm used by Kundel et al. (1990). A) Feedback was provided in the form of circles (5° in diameter) that were overlaid on top of an image to indicate the locations that were fixated for greater than a threshold duration that is typically set at 1000 ms. The white arrow (not shown in the study) indicates the location of an abnormality. B) Change in detection sensitivity (as measured by the area under the AFROC curve) for six observers by condition (feedback vs. no feedback).

radiologists' eye movements while they make an initial decision about the p resence or the absence of an abnormality, and then providing them with feedback about the locations and durations of their eye movements. On the basis of this feedback, the radiologist may then either confirm or revise their init ial decision. Typically, the feedback is provided in the form of circles (5° in diameter) that are overlaid on top of the image to i11dicate the locations that were fixated for greater than a threshold duration that is typically set at 1000 ms (see Fig. 2.9.6A).

Al though earl ier work had discussed the possibility of using perceptual feedback about eye movements to improve performance (Nodine and Kundel, 1987), the idea was first tested by Kundel et al. (1990) . Six radiology residents examined chest images for lung nodules for ISs while their eye movements were monitored. After making an initial decision about the locations containing nodules, they either received feedback about the regions that they looked a t for 1000 ms or greater, or they were given another look without any feedback. In both conditions, they were given an unlimited

Eye movements and visual expert ise in chess and medicine · 543

amount of time during the second viewing, and they could either confirm or revise their initial deci­sion. The same observers repeated the experiment 2 months later in order to counterbalance these two conditions. Importantly, in the feedback condition, performance improved by an average of 16% after the second view, relative to the injtial view, while the no feedback condition led to an average decrease of3% (see Fig. 29.68). The performance benefit in the feedback condition reflected both an increase in the number of true-positives, and a decrease in the number of false-positives, and performance was measured by calculating the area under the AFROC curve (Chakraborty and Winter, 1990), wh.ich is a variant of the ROC analysis technique that is adapted for cases in which multiple responses are associated with a single image.

Building on the finding that visual feedback improves lung nodule detection, Banett and Trainer (1997) suggested that local image enhancement (such as contrast adjustments) could be made in regions tbat received prolonged dwell times. Furthermore, it has been reported that both novice and expert radiographers show improved performance after viewing the eye tracking record of another observer who examined the same image, although the improvements we.re more pronounced for the novices than the experts (Litchfield et al. , 2008). In addition to these eye tracking variations on the original method, researchers have proposed non-eye tracking feedback methods. These methods include the monitoring of interpretation times for mammograms (Saunders and Samei, 2006), and the pattern and duration of the zooming choices made by observers while usjng an interface that allows them to zoom in or out of particular regions (Mello-Thoms et al., 2000).

Whil.e the finding reported by Kundel eta I. ( 1990) was replicated using another lung nodule detec­tion task (Donovan et al. , 2008), a 12% improvement ln the case of mammography (Nodine et al., 2001) was not significant. In addition, for bone fractures feedhack did not improve performance (Donovan et al., 2005). There are a number of factors that might influence the effectiveness of feedback, including tbe physical appearance of the cue itself (Krupinski ct al. , 1993a, 1993b), the experience of the observers and the complexity of the task (Donovan et at., 2005; Nodine, et al., 2001 ), the amount· of time spent viewing the image ( Kmpinski, et al. 1998), and the threshold selected (Krupinski, et al. 1998). As shown in Figs. 29.7 and 29.8, survival curves can be used to make predic­tions about the efficacy of different feedback thresholds (Krupinski et al. , 1998). Survival curves are cre9ted by plotting the proportion of curuuJative cluster durations that were at least as long as (i.e. survived) certain duration cut-off points. The lour response types (TP, FP, FN, TN) are plotted separately in order to compare their different time-courses. For exa mple, as can be seen in Fig. 29.7, for all image types, a greater proportion of FN responses than TN responses survived the I 000-ms cut-off point. For the purpose of feedback, a cut-off point should be selected such that it maximizes the number of circles containing abnormalities, and minimizes the number of circles without abnor­malities. An inspection of the survival curves in Fig. 29.7 reveals that for the case of chest images the 1000-ms threshold is the most effective. In contrast, for mammograms, it appears that a higher threshold might be more effective than lOOO ms at differentiating between FN and TN responses. Finally, for bone trauma and bone fractures, although there is still a difference between FN and TN responses, the difference is very small, suggesting that feedback would be less effective for these image types, which is consistent with empirical findings (Donovan et aL, 2005). In considering the effec­tiveness of tbe feedback, in addition to considering the different image domains and detection tasks, it migbt be important to consider the characteristics of the observer. The few studies that looked at the effects of expertise revealed that the FN and TN difference is quite consistent across Levels of expertise. Krupinski ( l996a) reported no signitlcant differences between groups, although the difference between FN and TN cumulative cluster durations was numericaUy larger for radiologists than residents (see Fig. 29.58). In addition, as shown iu Fig. 29.8, Manning et al. (2006b) reported that FN responses were significantly higher than TN responses for all levels of experience. PP responses were longer thaJl TP responses for the three more cX'perienced groups, while the reverse was true for novices. Importantly, at least in the case of mammography and chest nodules, it appears that feedback could be effective for a wide range of levels of expertise.

To sum up, eye movements consistently distinguish between missed abnormalities and abnormality­free areas and it is possible that radiologists are not always aware of their prolonged fixations on

544 · Eyal M. Reingold and Heather Sheridan

100

80

"0 ~ 60 (ij > -~

40 :J (J)

20

0

100

80

~ 60 (ij >

-~ 40 :J

(J)

20

0

Chest -o- TP -<:~- FP _..._ FN -TN

1 000 2000 3000 4000 5000 Cumulative cluster

duration (msec)

Bone-trauma -o- TP -<:!- FP -..t.- FN - TN

1 000 2000 3000 4000 5000

Cumulative cluster duration (msec)

100

80

~ 60 (ij > -~ 40 (J)

20

0

100

80

* 60 (ij > -~ 40 :J

(J)

20

0

-o- TP Mammography -<:~- FP

_..._ FN - TN

1 000 2000 3000 4000 5000 Cumulative cluster

duration (msec)

-o-TP -<:!- FP _.,.__ FN - TN

1 000 2000 3000 4000 5000 Cumulative cluster

duration (msec)

Fig. 29.7 Survival curves (created by plotting the percentage of cumulative clusters that are longer than certain threshold values) as a function of image type (Chest data from Kundel et al. , 1990, Mammography data from Krupinski, 1996a, bone trauma data from Krupinski and Lund, 1997, bone f ractures data from Hu et al. 1994).

missed abnormalities. There is evidence that radiologists cannot accurately report the pattern of eye movements that they use to examine an image (Carmody et al., 1984) and due to the lenient response criterion adopted by radiologists (Scheff, 1963), the fact that they do not make use of all of the infor­mation reflected in the eye movement record suggests that at least part of this information was not available to them at the time of the overt decision. Consistent with this notion, providing radiologists with feedback about their eye movements has been shown to improve accuracy in the case of lung nodule detection (Kundel et al., 1990). However, although the feedback paradigm seems promising, further research is needed to determine the factors that maximize its effectiveness and to determine the mechanisms through which feedback improves performance. It has been suggested that feedback works by focusing attention on specific locations containing abnormalities while at the same time reducing interference from the surrounding regions (Krupinski et al., 1993a, 1993b). However, as discussed above, feedback might also be effective because it provides an additional source of infor­mation that was previously not consciously accessible to the radiologists.

Conclusion

The present review illustrates that eye movement paradigms may prove invaluable in supplement­ing traditional measures of performance such as RT, accuracy, and verbal reports as a means for

Eye movements and visual expertise in chess and medicine · 545

100

80

~ 60

«i > 'i! ~ 40

20

100

80

~

~ 60

«i > 'i! ::> 40 (f)

20

Radiologists

1000 2000 3000 4000 5000 6000

~TP

-o- FP -FN --TN

7000 8000 9000

Cumulative cluster duration (msec)

~TP Radiographers -o-- FP

Pre-training - FN --TN

1 000 2000 3000 4000 5000 6000 7000 8000 9000

Cumulative cluster duration (msec)

100

80

20

100

80

~ !?.... 60

(ij > 'i! ::> (f) 40

20

Radiographers Post-training

~TP

--o-FP - FN --TN

1000 2000 3000 4000 5000 6000 7000 8000 9000

Cumulative cluster duration (msec)

Novices ~TP

--o- FP -FN --TN

1000 2000 3000 4000 5000 6000 7000 8000 9000

Cumulative cluster duration (msec)

Fig. 29.8 Survival curves (created by plotting the percentage of cumulative clusters that are longer than certa in threshold values) as a function of expertise. Based on data from Manning et al. (2006b).

understanding human expertise in general, and visual expertise in chess and medicine in particular. Specifically, by employing eye-movement methodology, the research reviewed here provided power­ful and direct evidence for the suggestion of de Groot ( 1946/1965) and Chase and Simon ( 1973a, 1973b) that a perceptual advantage is a fundamental component of chess skill and that in line with the global-focal search model (Nodine and Kundel, 1987) a global processing advantage is a crucial aspect of visual expertise in medicine. In addition, evidence derived from eye movement studies suggests the occurrence of unconscious or implicit processing of domain related patterns by experts, and such findings might begin to illuminate this important but as of yet relatively unexplored topic. In addition to its theoretical significance, the study of expertise and eye movements could lead to practical applications, such as more effective training programmes, expert systems and tools for computer aided diagnosis.

References Ahlstrom, U. and Friedman-Berg, F.J. (2006). Using eye movement activity as a correlate of cognitive workload.

lnternatiqnal Journal of Industrial Ergonomics, 36(7), 6234536. Austin, J.H.M., Romney, B.M., and Goldsmith, L.S. ( 1992). Missed bronchogenic carcinoma: radiographic findings in

27 patients with a potentially respectable lesion evident in retrospect. Radiology, 182, 115-22. Barrett, J.R. and Trainor, F. ( 1997). Eye movements can control localized image enhancement and analysis. Radiographies,

17, 525-530. Behets, D. ( 1996). Comparison of visual information processing between preservice students and experienced physical

education teachers. journal ofTeaching in Physical Education, 16( I), 79-87.

546 · Eyal M. Reingold and Heather Sheridan

Berbaum K.S., Franken, E. A. Jr, Dorfman D.O., Rooholamini, S.A .. Kathol, M.H., BariOOII, T.j., eta/, ( 1990). Satisfaction of search in diagnostic radiology. fnvestigative Radiology, 25, 133-140.

Berbaum, K.S., Franken, E.A., Dorfman, D.O., Miller, E.M., Krupinski, E.A., Kreinbring, K., eta/. (1996). Cause of satisfaction of s~arch effects in contrast studies of the abdomen. Academic Radiology, 3, 81 S-826.

Berbaum, K.S., Dorfman, D. D., Franken, E.A., Jr, and CaldweU, R.T. (2000). Proper ROC analysis and joint ROC analysis of the satisfaction of search effect in chest radiography. Academic Radiology, 7, 94S-958.

Berbaum, K.S., Brandser, E.A., Franken, E.A., Dorfman, D.O., Caldwell, R.T., and Krupinski. E.A. (2001). Gaze dweU times on acute 1 rauma injuries m issed because of satisfaction of search. Amdemic Radiology, 8, 304-314.

Berbaum, K.S., Franken, E., Caldwell, R., and Schartt, K. (2010). Satisfaction of search in traditional rad iographic imaging. In E. Samei and E. Krupinski (eds .) Tlte haudbook of medic<1l image perceptior~ and rechr~iques (pp. 107-138). Cambridge: Cambridge Un iversity Press.

BilaliC, M., Mcleod, P., and Gobet, F. (2008). Why good thoughts block better ones: The mechanism of the pernicious Einstdlung (set) effect. Cognition, 108, 652-661.

Binet, A. (1894). Psyd1ologie des Grmtds Calculntettrs et }oue~rs d'F.checs. Paris: Hachette. Bird, R.E., Wallace, T .W., and Yankaskas. B.C. (1992). Analysis of cancers missed at screening mammography, Radiology,

184(3),613-6 17. Birkelo, C. C., Chamberlain, W.E., Phelps. P.S., Schools, P.F. .. Zacks. D .. and Yerushalmy, J, ( 1947). Tuberculosis case

finding. A comparison of t he effectiveness of various roentgenographic and photofluorographic methods. foumal of the Amerimn Medical AssodatiOil, 133,359-366.

Bond, G.D. (2008). Deception detection expertise. Lawnndlwmmt behavior, 32(4), 339-5 1. Burgert, 0., Orn, V., Velichkovsky. B.M., Gessat, M., Joos, M .. Strouss. G., et11l. (2007, March 8). Evaluation of perception

performance in neck dissection planning using eye tracking and atrentionlandscapes. Paper presented at the Medical lmaging 2007: Image Perception, Observer Perfonnance,and Technology Assessment, San Diego, CA, USA.

Butsc.b, ILL. ( 1932). Eye movements and the eye-hand span in typewriting. }oumal of Educatimllll Psychology, 23(2), 104-121.

Carmody, D.P. (1984). Lung tumour identification: Decision-making and comparison scann ing-Advances in Psychology. In A.G. Gale, and F.Johnson (eds.) Theoretical artd Applied Aspects of Eye Movement Resectrch, Selected/Edited Proceedings of Tire Second European Conference on Eye Moveuumts (VoL 22, pp. 305-312). North-Holland,

Chakraborty, D.P. and Winter, U LL. ( 1990), Free-respoos{ methodology: Alternative a.11alysis and a new ob~erver performance experiment. RadicJ/ogy, 174(3 ), 873-88 I.

Carmody, D.P., Nodine, C.P., and Kundel, H.L. ( 1980a). Ar. analysis of perceptual and cognitive factors in radiographic interpretation. Perception, 9, 339-344.

Carmody, D.P., Nodine, C. F., and Kundd, I 1.1~ ( 1980b). Global and segmented search for lung nodule~ of different edge grad ients. Investigative Radiology, 15, 224-233.

Carmody, D.P., Nodine, C. F., and Kundel, H.L ( 1981). l' ind ing lung nodules with and without comparative visual scanning. Percept lou and Psyclroplrysics, 29,594-598.

Carmody, D.P., Kundel, H.L, and Toto, I .. C. ( 1984), Comparison scans while reading chest images: taught, but not practiced. lnvestigtllive Radiology, 19(5), 462-466.

Charness., N. (1992). The impact of chess research on cognitive $dence. Psychological Research, 54, 4-9. Charness, N., Reingold, E.M .. Pomplun, M., and Stampe. D.M. (200 1 ). The perceptual aspect of skilled performance in

chess: Evidence from eye movements. Memory n11d Cognirion, 29, I 146-1 I 52. Chase, W.C. and Simon, H.A. ( 1973a). Perception in chess. Cognitive Psyclwlogy, 4, 55-8 I. Chase, W.G. and Simon, H.A. ( 1973b). The mind's eye in chess. In W.G. Chase (ed. ) Visual infomwtion processing

(pp. 21S-281). New Yofk: Academic Press. Christensen, E.£ .. Murry, R.C., Holland, K., R<'ynolds, )" I.anday, M.)., and Moore, ).G. (1981). The effect of search time

on perception. Radiology, 138(2), 361-365.

Cleveland, A.A. ( 1907). The psychology of chess and oflearn ing to play it..Americm1/ounwl of Psychology, 18, 269-308. de Groot, A.D. ( 1946). Het deukeu va11 deu sclwker. Amsterdam: Noord HoUandsche. de Groot, A.D. ( 1965). T/Jougln and choice ill d1ess. The Hague: Mouton. de Groot, A.D. and Gobel, F. (1996). Perceprion and memory in chess. Assen (The Netherlands): Van Gorcum. Djakow, LN., Petrowski, N.W., und Rud ik, P.A. (1927) . Psychologie des Sdwdupiels. Berlin: de Gruyterc Donovan, T., Manning, 0.)., and Crawford. T. (2008, February 20). Performance changes in lung nodule detection

following perceptual feedback of eye movements. Paper presented at the Medical Imaging 2008: Image Perception, Observer Performance, and Technology Assessment, San Diego, CA, USA.

Donovan, T., Mannfng, D.J .• Phill ips, P.W., Higham, S .. anci Crawford, T, (2005. February 15). The effect offeedback on performance in a fracture detection task. Paper presented at the Medical Imaging 2005: Image Perception, Observer Performance, and Technology Assessment, San Diego, CA. USA.

Dyer, A. G., Found, B .. and Rogers, D. (2006) . Visual attention and expertise for forensic signat ure analysis. fottmol of Forensic Sciences, 5 1(6) , 1397-404.

Ericsson, K.A. and Charness, N. ( 1994). Expert performance: Its structure and acquisition. American Psyclrologist, 49, 72S-747. Gale, A.G., Vernon,)., Miller, K., and Worthington, B.S. ( IWO). Reponing in a nash. British /oumnl of Radiology, 63,571.

Eye movements and visual expert ise in chess and medicine · 547

Gobet, F. and Simon, H.A. ( 1996a). Recall of rnpid ly presentro ra ndom chess positio ns is a function of skill. Psyc/lonomic Bnlletinmul Review, 3, 159-163.

Gobet, F. and Simon , I I.A. ( 1 \196b). Templates in chess mem~ry: A mechanism for recall ing several boRrds. Cognitive Psychology. 31, 1-40.

Gobel, F. ( 1998). Expert m e mory: a comparison of four theooies , Cugrritiun, 66(2), 115-152.

Gobet, F. and Simon, H.A. (2000). Five seconds o r sixty? Presentation time in expert memory. Olgnitive Science, 24, 651-682. Goolsb y, T. ( 1989). Computer applications to eye movement researc h in music reading. Psyc/Jomusicology, 8(2), 111-126.

Guiss, LW. and Kuenstler, P. ( 1960). A retrospective vlew of survey photo fluoroca rbons of persons with lung cancer. Cmocer, 13, 91-95.

G underman, R., Williamson, K. Fraley, R .• and Steele, }. (2001 ). Expertise: implications for radiological education. Academic Radiology. 8, 1252-6.

Heiberg Engel, P.J. (2008). Tacit knowledge and visual expertise in medical diagnostic reasoning: Implications for m edical education. Medical Tenc/Ler, 30, cl84-el88.

Hills, B.L. ( 1980). Vision, visibility, and perception in driving. Perception, 9, 183-216.

Holding, D.H. ( 1985). 1'hepsyclwlog)' of chess skill. H illsdalc, N.J: Erlb<~um.

Hu, C. H., Kundel, H.L .. Nodine, C.r., Krupinski, E.A., and Toto, L.C. ( 1994). Se-Jrching for bone fnoctures: a compariso n with pulmonary nodule search . Artldemir Radiology. 1, 25-32.

Inhoff, A.W. and Gordon, A.M. ( 1997). Eye movements a nd eye-hand coordination during typing. Current Directi011S in Psychologicnl Science. 6(6), 153-157.

Jacohs, A.M. ( 1986). Eye movement control in visual se.arch: How dired is visual span cont rol? Perception nnrl Ps)'clwphysics, 39. 47-58.

Jarodzka, H .. Scheiter, K., Gerjels, P., and Van Gog, T . (20 10). In th~ eyes of the beholder: How expe rts and novices interpret dynamic Stimuli. Leaming nnd l.llslmction, 20(2), 146-154.

Iongman, R. W. ( 1968). 1-fet oog va11 de meester ITbe eye o f the master]. A.ssen (The Netherl"nds): Van Gorcu m. Kocak, E., Ober,l .• Bcrme, N., a nd Melvin, W.S. (2005 ). Eye motion parameters correlate with level of experience in

video·assisted surgery: Objective testing of th ree tasks. /oumal ofLnpnroenrloscopic nnd Advn11ced Surgical Tedmiqnes, 1S, 575-580.

Kozbelt, A. and Seeley, W.P. (2007). Integrating nrt historical, psychologi.:al, and neuroscien tific explanations of artists'

advantages in dr.twing and perception. Psychology of Aestlwics, Cremivity, and the Arts, l ( 2), 80-90.

Krupinski, E.A. ( i996a}. Visual scanning patterns of r:odiologists searching mammogr:oms. Amdemir Radiology, 3 , 137- 144. Krupinski, E.A. ( 1996b). lnllucnce of experience on scanning strategies in mammography. SP/ E, 2712, 95-10 I. Krupinski, E.A. (2000, January 24). Medical image percept ion: Evalua ting the role of expe rien ce. Paper presented at the

Human Vision and Electronic lmaging V, San Jose, CA. USA.. Krupinski, E.A. (2005). Visual search of mammographic image;: in.tluence oflesion subtlety. Acade111ic Rndiology, 12, 965-969.

Krupinski, E. and Bo rah, J. (2006). Eye trncki11g helps improve acc uracy in radiology. Biophotonics flltc>rllntiorw/, 13(6), 44-49.

Krupinski, E.A. and Lund, P.}. ( 1997). Differences in time to interpretation for evaluat ion of bone radiographs with monitor and film viewing. Acndemic Radiolog>•, 4 , 177-182.

Krupinski, E.A. and Nishikawa, R.M. (1997). Comparison of ~ye position versus computer identified microcalcificat1on

clusters on mammograms. Medical Physics, 24, 17- 23.

Krupinski, E.A., Nodine, C. F., and Kundel, H, L. ( 1993a). Pe rceptua l enhancement of tumor targets in chest X-ray image.s. Perception and Psychophysics, 53, 519-526.

Krupinski, E.A., Nodine, C. F., and Kundel, H..L, ( 1993b). A pt rceptuaUy L>ased method for enhancing pulmonary tumor recognition. /m,e.stigative Radiology, 28, 289- 294.

Krupinski, E.A. Nodine, C.F., and Kundel, H . .L. (1998). Enhancing recogn it ion oflcsions in r:~d iographic images using perceptual feedback. Optical Engineering, 37(3), 813-818.

Krupinski, E.A. and Roehrig, H. (2000). The inllucnce of a p£'1'ceptually linearized display o n observer performa11ce and

visual search. Academic Radiology, 7, 6-13.

Krupinski, E.A. a nd Roehrig, H. (2002). Pulmonary nodule detection and visual search: P45 and P104 monochrome versus color monitor displays. Academic Ratliolog>•. 9, 638-645.

Krupinski, E., Roehrig, H .• aud Fu ru kawa, T, ( 1999). Influence offtlm and monitor display lum inance o n observer performance and visual sea rch. Academic Radiology, 6, 411-418.

Krupinski, E.A., Tillack, A.A., Richter, L., Henderson, }.T., Uhattacharyya, A.K., Scott, K.M., et a/, (2006). Eye· movem en t

study an d human performance us ing tclepathology virtual slides. Implications for med ical educa tion and differences

with experience. Human Pathology, 37, 1543-1556.

Kundel, H.L. ( 1974). Visual sampling and estimates of the location o f in formation o n chest fi lms. Invest Radio/, 9 , 87- 93.

Kundel, H.L. (2007, February 21 ). How to minimize perceptual error and m axim ize expertise fn medical imaging. Paper

presented at the Medical Imaging 2007: Image Pe rception, Observer Performance, and Technology Assessment, San Diego, CA, USA.

Kundel, H.L. and La PoUene, P.S. Jr ( 1972). Visual search patterns and experie nce with radiological images. Radiology. 103,

523-528.

Kundel H.L. and Nodine, C. F. ( 1975). Interpre ting c hest radiogra phs wi thout visual search. Radi(>/ogy, 116 (3), 527- 532.

548 · Eyal M. Reingold and Heather Sheridan

Kunde.!, H.L and Nodine, C. F. ( 1983). A visual concept shapes image perception. Radiology, 146, 363-368. Kundel, H.L. and Nodine, C. F. (2004, l'ebruary 17) . Modeling visual search during mammogram view·ing. Paper presented

at the Medical Imaging 2004: Image Perception, Observer Performance, and Technology Assessment, San Diego, CA, USA. Kundel. H.L. and Wright, D,J. ( 1969). The influence of prior knowledge on visual search strategies during the viewing of

chest radiographs. l<adiology, 93, 31 S-320.

Kundel, H.L., Nodine, C.f., and Carmody, D.P. (1978). Visu1l scamting, pattern recognition and decision making in pulmonary nodule detection. lnvesrigarive Radiology, 13, 175-181.

Kundel, H.L., Nodine, C. F., and Krupinski, E.A. ( 1989). Searching fot lung nodules: Visual dweU indicates location of false-positive and false-negative decisions. ln vesriga rive Radiology, 4, 472-478.

Kundel, H.L., Nodine, C. F., and Krupinski, E.A. ( 1990). Computer-displayed eye position as a visual aid to pulmonary nodule interpretation. Jn vesrigarive Radiology, 25, 890-896.

Kundel. H.L., Nodine, C.l'., Conant, E. F., and Weinstein, S.P. (2007). Holistic component of image perception in mammogram interpretation: Gaze· tracking study. Radiology, 242, 39H02.

Kwtdel, H.L.. Nodine, C. F., Krupinski, B.A., and Mello-Thoms, C. (2008) . Using gaze-tracking data and mixture distribution analysis to support a holistic model for the detection of cancers on mammOj,'Tams. Academic Radiology, 15, 881-886.

Kundel, ILL, Nodine, C. F., and Toto, L. (1984). Eye movements and the detection oflung tumors in chest images-Advan~es in l'sychology. In A. G. Cale, and F. Johnson (eds.) Tlreore,ical and Applied Aspects of Jiye Movemerrt Research, Selected/ Edited Proceedil1gs of1'/JeSecond European Conference on Eye Movemems (Vol. 22, pp. 297- 304). Elsevier: North ·HoUand.

Kundel. H.L. Nodine, C.F., and Toto, L. ( 1991 ). Searching for lung nodules. The guidance of visual scanning. lnvesrigati,,e Radiology, 26, 777-781.

Land, M.F. and McLeod, P. (2000). Front eye movements to actions: how babmen hillhe ball. Nature neuroscier1ce, 3(12), 1340-5.

Law, B .. Atkins, M.S., Kirkpatrick, A.£., Lomax, A.} ., and Mackenzie, C.L. (2004). Eye gaze palterns differentiate novice and experts in a virtuallaparoscopic surgery training environment. In A. Duchowsk;i, and R.. Vertegan (eds.) Proceedings of the 2004 Sympositrm on Eye Tracking Resrarc11 and Applicatiom (pp. 41-48). San Antonio, TX: Association for Computing Machinery.

Leong,}.). H., Nicolaou, M., Emery, R.]., Darzi, A.W., and Yang, G.Z. (2007). Visual search behaviour in skeletal radiographs; A cross-speciality study. Cli11iml Radiology, 62, 1069- 1077.

Litchfield, D., Ball. l.)., Donovan,T., Manning, D.j., and Crawford, T. (2008, February 20). Learning from others: Effects of viewing another person's eye movements while s~arching for chest nodules. Paper presented at the Medical Imaging 2008: Image Perception, Observer Performance, ~nd Technology Assessment, San Diego, CA, USA.

tocher, P. (2006) . The usefulness of eye movement recordings to subject an aesthetic episode with visual art to empirical scrutiny. Psychology Science, 48(2), 106-114.

MadeU, j. and Heben, S. (2008). Eye Movements and Music Reading: Where Do We Look Next? Music PerceptioiT, 26(2), 157- 170.

Mann, D.T., Williams, A.M., Ward, I'~ and Janelle, C.M. (2007). Perceptual-cognitive expertise in sport: a meta-analysis. Journal of Sport a11d Exercise Psychology, 29( 4), 457-78.

Manning, D., Ethcll, S.C., and Crawford, T. (2003, February 18). An eye-tracking AFROC study of the influence of experience and training on chest x· ray interpretation. Paper presented at the Medical Imaging 2003: Image Perception, Obse.rver Performance, and Technology Assessrnent. San Diego. CA, USA.

Manning, D .• Ethell, S., and Donovan, T, (2004, February 171. Categories of observer error from eye-tracking and AFROC data. Paper presented at the Medical Imaging 2004: Image Perception, Observer Performance, and Technology Assessment, SaJJ Diego, CA, USA.

Manning, 1)., Ethel!, S., Donovan, T., anc.l Crawford. '1'. (2006a). How do radiologists do it? The intluencc of experience and training on searching for chest nodules. Radiogtnphy, 12, 134-142.

Manning, 0., Barker-Mill, S.C., Donovan, T., and Crawford, T. (2006ll). Time-dependent observer errors in pulmonary nodule detection. British journal of Radiology, 79, 342-346.

McCarley,].$., Kramer, A.F., Wickens, C.D .• Vidoni, E.D., and Boot, W.R. (2004). Visual skills in airport-security screening. Psychological Scie11ce, 15(5 ), 302-6.

McConkie, G.W. and Rayner, K. (J 975). The span of the effective stimulus during a fixation in reading. Perception and Psychophysics, 17, 578- 58£>.

Mello-Thoms, C. (2003). PerceptiOil of breast cancer: Eye-position analysis of mammogram in terpretation. Academic Radiology, 10. 4-12.

Mello-Thoms, C., Nodine, C. F., and Kundel, H.L. (2002, February 26). Relating in1age-based features to mammogram interpretation. Paper presented at the Medical Imaging 2002: fmage Perception, Observer Performance, and Technology Assessment, San Diego, CA, USA.

Mello-Thoms, C., Nodine, C.Fry Weinstein, S.P., Ku ndel, H.l., and Toto, L.C. (2000, February 16). An unobtrusive met bod for monitoring visual attention dltring mammogram reading. Paper presented atthe Medical Imaging 2000: Image Perception, Observer Perform3Jlce, and Technology Assessment, San Diego, CA, USA.

Miller. G.A. (1956) . The magic number seven, plus or minus two. Psyc/tological Review, 63,81-87.

Eye movements and visual expertise in chess and medicine · 549

Moise, A., Atkins, M.S., and Rohling, R. (2005). Evaluating different radiology workstation interaction techniques with radiologists and laypersons. ]oumal of Digiwl Tmagii•g. 18(2), 116- 130.

Mugglestone, M.D .• Ga le, A. G., Cowley, H.C., and Wilson,I\.R.M. ( 1995). Diagnostic performance on brielly presented mammographic images. SP(E, 2436, 106- 115.

Newell, A. and Simon, H .A. ( 1972). Human pro/1lem solving, Englewood Cliffs, N. ).~ Prentice Hall. Nodine. C. F. and Krupinski, E.A. ( 1998). Perceptua l skill, radiology expertise, and visual test pcrforn1ance with NlNA and

WALDO. Academic Radiology, 5, 603-612.

Nodine, C. F. and Kundel, H.L. ( 1987). The wgnitive side of visual search in radiology. In J.K. O'Regan and A. Levy-Schoen (eds.) Eye movemems: From physiology to cognition (pp. 573-582). Amsterdam: Elsevier.

Nodine; C.F. and Kundel, H.L. ( 1997). AI Hirschfeld's NlNA as a prototype search task for studying perceptual error in radiology. SPIE, 3036, 308-3 12.

Nodine, C.F. and Mello-Thoms, C. (2000). The nature of expertise in radiolof,'Y· lo ). Beutel, 1-I.L. Kundel. ~nd R.L. Van Metter (eds.) The handbook of medical imaging: Physics a11d psyc/r(Jphysics (pp. 1!59- 894). Bellingham, WA: SPIE Press.

Nodine, C. F., Carmody, D.P. and Kundd, H.L. ( 1978). Searching for N ina. In /.W. Sanders, D.F. Fisher, and ItA. Mt)nty (eds.) Eye movements artd the hig/rer psyci!Ologiml flmctiOirs (pp. 24 1-258). Hillsdale, N/t Erlbaum.

Nodine, C. F., Kundel, H.L., Lauver, S.C., and Toto, L.C. ( 1996a). Nature of expertise in searching mammograms for breast masses. Academic Radiology, 3, 1000--1006.

Nod ine, C. F., Kundel, H.L, tauver, S.C., and Toto, L.C. ( l996b). The nature of expertise in searching mammograms for breast lesions. SPIH. 2712,89-94.

Nod ine, C.F., Kundel, H.L., Mello-Thoms, C .• and Weinstein, S.P. (2001, February 2 1 ). Role of computer-assisted visual search in mammograph icinterpretation. Paper presented at the Medica l Imaging 2001: Image Perception and Performance, San Diego, CA. USA.

Nodine, C. F .. Kw1del, H.L, Toto, L.C., and Krupinski, E.A. ( 1992). Recording and analyzing eye-position data using a microcomputer worksta tion . Behavior Research Metlwds,lnstrullre/11.5, & Computers, 24, 475-485.

Nodine, C. F., Locher, P./., aod Krupinski, E.A. ( 1993). The role offormal art training on perception and aesthetic judgment of art compositions. Leonardo, 26(3 ), 2 19-227,

Nodine, C. F .. Mello-Thoms, C., Kundel, H.L., and Weinstein, S.P. (2002). Time course of perception and decision making during mammographic intetptetation. Amerfm11 fournnlofRoelltgenology., 179,917-923.

Norman, G.R., Coblentz, C.L., Brooks, L.R., and 13abcook, C.). ( 1992). Expert ise in visual diagnosi.s; A review oft he literature. Academic Medicine, 67, $78-$83.

Ocstmann, J,W., Greene. R. , Kushner. D.C., Bourgouin, P.M., Unetsky, r.., and Urwell yn, H.J. (1988). Lung Lesions: Correlat ion between viewing time and detection. Radiology. 166,451-453.

Petrakis, E. ( l993 ). Analysis of visual sear<'h pallerru of te111ri; ter1chers. In G. d 'Y dewalle, and J. Van Renshergcn (cds.) Perception and cognitiom Advances i11 eye movement research (Vol. 4, pp. 159-168). Amsterdam: Elsevier.

Polanyi, M. ( 1969). Knowing and being. In M. Grene (ed.) JC"o •vh•g rmd being: essay.f. London: Rout ledge and K. Paul. Rayner, K. (1998). Eye movements in readinga11d information processing: 20 years of research. Psyd,ologica/ Bulleti11, 124,

372-422. Ra)'lter, K. and Pollatsek, A. ( 1997). Eye movements, the eye-hand span, and the perceptual span during sight-reading of

music. Current Directions ir1 Psychologiclll Science, 6(2), 49-53.

Reingold, E.M, and Charness, N. (2005). Perception in chess: Evidence from eye movements. lo1 G. Underwood (ed. ) Cognitive processe$ i11 ~ye guidm~ee (pp. 325- 354). Oxford: Oxford University Press.

Reingold, E.M., Charness. N., Pomplun, M .• ~nd Stampe, D M. (200 1a). Visual span in expert chess pl~yers: Evidence fi:om eye movements. Psychological Sciencf, 12, 48- 55.

Reingold, E.M .. Charness, N., Schultetu.<, R.S., and Stampe. D.M. (2001b). Perceptual automaticity in expert chess players: Parallel encoding of chess relations. Psyclronomic Bulletin and Review, 8, 504- 5 \0.

Rensink, R.A., O'Regan, ).K., and Clark, j.J. ( 1997). To see or nnt to see: The need for attention to perceive changes in scenes. Psychologiwl Science. 8, 368-373.

Reynolds, R.I. (1982). Search heuristics of chess playets of different cali bers. Amcricmr Journal of Psychology, 95, 383-392. Saariluoma, P, ( 1985). Chess players' intake of task-relevant cues. Ml'llwry twd Coguitirm, 13, 385- 391. Saner, N.B., Mumaw, R.)., and Wickens. C.D- (2007). Pilots' monitoring strategies and per·formance on automated Oi.ght

decks: An empirical study combining behaviora l and eye-tracking d<Jta. Humm• Factors, 49(3), 347-357. Saunders, R.S. and Samei, E. (2006). Lmproving mammogmphic decision ~ccuracy by incorporating observer ratings with

interpretation time. The British Journal of Radiology, 79, SI I7- S122. Scheff, T. ( 1963). Decision rules and types of error, and their consequences in medical diagnosis. Belurviournl SC'im,·e, 8,

97-107. Shinar, D. (2008). Looks are (almost) everything: '~here dri,ers look to get information. Humrw Faclors, 50(3), 380-384. Simon, H.A., and Barenfeld, M. ( 1969). rnformation. processing analysis of perceptual processes in problem solving.

Psychological Review, 76, 4 73-483.

Simon, 1-i.t\, and Chase, W.G. (1973). Skill in chess. Americmr Sciemist, 61 (4), 394-403. Simon, H.k and Gilmartin, K. (I 973 ). A simulation of memory for chess posi tions. Cognitive Psychology. 5, 29-46. Simons, D.)., and Levin, D.T, ( 1997) . Ch;111ge blindness. Treuds in Cognitive Sciet1crs, I, 26 1-267.

550 · Eyal M. Reingold and Heather Sheridan

Swensson, R.G. ( 1980). A two-stage detection model applied to skilled visual search by radiologists. Perceptior1 & Psychophysics, 27, ll - 16.

Swensson, R.G., Hessel, S.)., and Herman, P.G. (1982). Radiographic interpretation with and without search: visual search aids the recognition of chest pathology. Investigative Radiology, 17(2), 145-51.

Swensson, R.G., Hessel, S.j., and Herman, P.G. ( 1985). The value of searching films without specific preconceptions. Investigative Radiology, 20(1), 100-114.

Tai, R., Loehr, j., and Brigham, F. (2006). An exploration of the use of eye-gaze tracking to study problem-solving on standardized science assessments. International journal of Research and Method i11 Educati011, 29(2), 185-208.

Tikhomirov, O.K. and Poznyanskaya, E. ( 1966). An investigation of visual search as a means of analyzing heuristics. Soviet Psycltology. 5, 2- 15.

Underwood, G. (2007). Visual attention and the transition from novice to advanced driver. Ergonomics, 50(8), 1235-1249. Van Gog, T., Paas, F .• and Van Merrienboer, j.j. (2005). Unco•tering expertise-related differences in troubleshooting

performance: combining eye movement and concurrent verbal protocol data. Applied Ccgnitive PsyciJology, 19(2), 205-221. Vickers, j.N. (1992). Gaze control in putting. Perception, 21(1), 117-132. Win ikoff, A.W. (1967). Eye movements as an aid to protocol analysis of problem solving behavior. Unpublished doctoral

dissertation. Carnegie-Mellon University, Pittsburgh, PA. Wood, B.P. (1999). Visual expertise. Radiology, 21 1, 1-3.