Guest User

kahneman adversarial collaboration

a guest
Aug 2nd, 2018
547
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 26.60 KB | None | 0 0
  1. Expert Intuition: When Can We Trust It?
  2.  
  3.  
  4. Professional controversies bring out the worst in academics. Scientific
  5. journals occasionally publish exchanges, often beginning with someone’s
  6. critique of another’s research, followed by a reply and a rejoinder. I have
  7. always thought that these exchanges are a waste of time. Especially when
  8. the original critique is sharply worded, the reply and the rejoinder are often
  9. exercises in what I have called sarcasm for beginners and advanced
  10. sarcasm. The replies rarely concede anything to a biting critique, and it is
  11. almost unheard of for a rejoinder to admit that the original critique was
  12. misguided or erroneous in anyway. On a few occasions I have responded
  13. to criticisms that I thought were grossly misleading, because a failure to
  14. respond can be interpreted as conceding error, but I have never found the
  15. hostile exchanges instructive. In search of another way to deal with
  16. disagreements, I have engaged in a few “adversarial collaborations,” in
  17. which scholars who disagree on the science agree to write a jointly
  18. authored paper on their differences, and sometimes conduct research
  19. together. In especially tense situations, the research is moderated by an
  20. arbiter.
  21.  
  22. My most satisfying and productive adversarial collaboration was with
  23. Gary Klein, the intellectual leader of an association of scholars and
  24. practitioners who do not like the kind of work I do. They call themselves
  25. students of Naturalistic Decision Making, or NDM, and mostly work in
  26. organizations where the"0%Jb ty often study how experts work. The N
  27. DMers adamantly reject the focus on biases in the heuristics and biases
  28. approach. They criticize this model as overly concerned with failures and
  29. driven by artificial experiments rather than by the study of real people doing
  30. things that matter. They are deeply skeptical about the value of using rigid
  31. algorithms to replace human judgment, and Paul Meehl is not among their
  32. heroes. Gary Klein has eloquently articulated this position over many
  33. years.
  34.  
  35. This is hardly the basis for a beautiful friendship, but there is more to the
  36. story. I had never believed that intuition is always misguided. I had also
  37. been a fan of Klein’s studies of expertise in firefighters since I first saw a
  38. draft of a paper he wrote in the 1970s, and was impressed by his book
  39. Sources of Power, much of which analyzes how experienced professionals
  40. develop intuitive skills. I invited him to join in an effort to map the boundary
  41. that separates the marvels of intuition from its flaws. He was intrigued by
  42. the idea and we went ahead with the project—with no certainty that it would
  43. succeed. We set out to answer a specific question: When can you trust an
  44. experienced professional who claims to have an intuition? It was obvious
  45. that Klein would be more disposed to be trusting, and I would be more
  46. skeptical. But could we agree on principles for answering the general
  47. question?
  48.  
  49. Over seven or eight years we had many discussions, resolved many
  50. disagreements, almost blew up more than once, wrote many draft s,
  51. became friends, and eventually published a joint article with a title that tells
  52. the story: “Conditions for Intuitive Expertise: A Failure to Disagree.”
  53. Indeed, we did not encounter real issues on which we disagreed—but we
  54. did not really agree.
  55.  
  56.  
  57. Marvels and Flaws
  58.  
  59. Malcolm Gladwell’s bestseller Blink appeared while Klein and I were
  60. working on the project, and it was reassuring to find ourselves in
  61. agreement about it. Glad well’s book opens with the memorable story of art
  62. experts faced with an object that is described as a magnificent example of
  63. a kouros, a sculpture of a striding boy. Several of the experts had strong
  64. visceral reactions: they felt in their gut that the statue was a fake but were
  65. not able to articulate what it was about it that made them uneasy. Everyone
  66. who read the book—millions did—remembers that story as a triumph of
  67. intuition. The experts agreed that they knew the sculpture was a fake
  68. without knowing how they knew—the very definition of intuition. The story
  69. appears to imply that a systematic search for the cue that guided the
  70. experts would have failed, but Klein and I both rejected that conclusion.
  71. From our point of view, such an inquiry was needed, and if it had been
  72. conducted properly (which Klein knows how to do), it would probably have
  73. succeeded.
  74.  
  75. Although many readers of the kouros example were surely drawn to an
  76. almost magical view of expert intuition, Gladwell himself does not hold that
  77. position. In a later chapter he describes a massive failure of intuition:
  78. Americans elected President Harding, whose only qualification for the
  79. position was that he perfectly looked the part. Square jawed and tall, he
  80. was the perfect image of a strong and decisive leader. People voted for
  81. someone who looked strong and decisive without any other reason to
  82. believe that he was. An intuitive prediction of how Harding would perform
  83. as president arose from substituting one question for another. A reader of
  84. this book should expect such an intuition to be held with confidence.
  85.  
  86. Intuition as Recognition
  87.  
  88. The early experiences that shaped Klein’s views of intuition were starkly
  89. different from mine. My thinking was formed by observing the illusion of
  90. validity in myself and by reading Paul Meehl’s demonstrations of the
  91. inferiority of clinical prediction. In contrast, Klein’s views were shaped by
  92. his early studies of fireground commanders (the leaders of firefighting
  93. teams). He followed them as they fought fires and later interviewed the
  94. leader about his thoughts as he made decisions. As Klein described it in
  95. our joint article, he and his collaborators
  96.  
  97. investigated how the commanders could make good decisions
  98. without comparing options. The initial hypothesis was that
  99. commanders would restrict their analysis to only a pair of options,
  100. but that hypothesis proved to be incorrect. In fact, the
  101. commanders usually generated only a single option, and that was
  102. all they needed. They could draw on the repertoire of patterns that
  103. they had compiled during more than a decade of both real and
  104. virtual experience to identify a plausible option, which they
  105. considered first. They evaluated this option by mentally simulating
  106. it to see if it would work in the situation they were facing.... If the
  107. course of action they were considering seemed appropriate, they
  108. would implement it. If it had shortcomings, they would modify it. If
  109. they could not easily modify it, they would turn to the next most
  110. plausible option and run through the same procedure until an
  111. acceptable course of action was found.
  112.  
  113. Klein elaborated this description into a theory of decision making that he
  114. called the recognition-primed decision (RPD) model, which applies to
  115. firefighters but also describes expertise in other domains, including chess.
  116. The process involves both System 1 and System 2. In the first phase, a
  117. tentative plan comes to mind by an automatic function of associative
  118. memory—System 1. The next phase is a deliberate process in which the
  119. plan is mentally simulated to check if it will work—an operation of System
  120. 2. The model of intuitive decision making as pattern recognition develops
  121. ideas presented some time ago by Herbert Simon, perhaps the only
  122. scholar who is recognized and admired as a hero and founding figure by
  123. all the competing clans and tribes in the study of decision making. I quoted
  124. Herbert Simon’s definition of intuition in the introduction, but it will make
  125. more sense when I repeat it now: “The situation has provided a cue; this
  126. cue has given the expert access to information stored in memory, and the
  127. information provides the answer. Intuition is nothing more and nothing less
  128. than recognition.”
  129.  
  130. This strong statement reduces the apparent magic of intuition to the
  131. everyday experience of memory. We marvel at the story of the firefighter
  132. who has a sudden urge to escape a burning house just before it collapses,
  133. because the firefighter knows the danger intuitively, “without knowing how
  134. he knows.” However, we also do not know how we immediately know that a
  135. person we see as we enter a room is our friend Peter. The moral of
  136. Simon’s remark is that the mystery of knowing without knowing is not a
  137. distinctive feature of intuition; it is the norm of mental life.
  138.  
  139. Acquiring Skill
  140.  
  141. How does the information that supports intuition get “stored in memory”?
  142. Certain types of intuitions are acquired very quickly. We have inherited
  143. from our ancestors a great facility to learn when to be afraid. Indeed, one
  144. experience is often sufficient to establish a long-term aversion and fear.
  145. Many of us have the visceral memory of a single dubious dish tto hat still
  146. leaves us vaguely reluctant to return to a restaurant. All of us tense up when
  147. we approach a spot in which an unpleasant event occurred, even when
  148. there is no reason to expect it to happen again. For me, one such place is
  149. the ramp leading to the San Francisco airport, where years ago a driver in
  150. the throes of road rage followed me from the freeway, rolled down his
  151. window, and hurled obscenities at me. I never knew what caused his
  152. hatred, but I remember his voice whenever I reach that point on my way to
  153. the airport.
  154.  
  155. My memory of the airport incident is conscious and it fully explains the
  156. emotion that comes with it. On many occasions, however, you may feel
  157. uneasy in a particular place or when someone uses a particular turn of
  158. phrase without having a conscious memory of the triggering event. In
  159. hindsight, you will label that unease an intuition if it is followed by a bad
  160. experience. This mode of emotional learning is closely related to what
  161. happened in Pavlov’s famous conditioning experiments, in which the dogs
  162. learned to recognize the sound of the bell as a signal that food was
  163. coming. What Pavlov’s dogs learned can be described as a learned hope.
  164. Learned fears are even more easily acquired.
  165.  
  166. Fear can also be learned—quite easily, in fact—bywords rather than by
  167. experience. The fireman who had the “sixth sense” of danger had certainly
  168. had many occasions to discuss and think about types of fires he was not
  169. involved in, and to rehearse in his mind what the cues might be and how he
  170. should react. As I remember from experience, a young platoon
  171. commander with no experience of combat will tense up while leading
  172. troops through a narrowing ravine, because he was taught to identify the
  173. terrain as favoring an ambush. Little repetition is needed for learning.
  174.  
  175. Emotional learning may be quick, but what we consider as “expertise”
  176. usually takes a long time to develop. The acquisition of expertise in
  177. complex tasks such as high-level chess, professional basketball, or
  178. firefighting is intricate and slow because expertise in a domain is not a
  179. single skill but rather a large collection of miniskills. Chess is a good
  180. example. An expert player can understand a complex position at a glance,
  181. but it takes years to develop that level of ability. Studies of chess masters
  182. have shown that at least 10,000 hours of dedicated practice (about 6 years
  183. of playing chess 5 hours a day) are required to attain the highest levels of
  184. performance. During those hours of intense concentration, a serious chess
  185. player becomes familiar with thousands of configurations, each consisting
  186. of an arrangement of related pieces that can threaten or defend each
  187. other.
  188.  
  189. Learning high-level chess can be compared to learning to read. A first
  190. grader works hard at recognizing individual letters and assembling them
  191. into syllables and words, but a good adult reader perceives entire clauses.
  192. An expert reader has also acquired the ability to assemble familiar
  193. elements in a new pattern and can quickly “recognize” and correctly
  194. pronounce a word that she has never seen before. In chess, recurrent
  195. patterns of interacting pieces play the role of letters, and a chess position
  196. is a long word or a sentence.
  197.  
  198. A skilled reader who sees it for the first time will be able to read the
  199. opening stanza of Lewis Carroll’s “Jabberwocky” with perfect rhythm and
  200. intonation, as well as pleasure:
  201.  
  202.  
  203. ’Twas brillig, and the slithytoves
  204. Did gyre and gimble in the wabe:
  205. All mimsy were the borogoves,
  206. And the mome raths outgrabe.
  207.  
  208.  
  209. Acquiring expertise in chess is harder and slower than learning to read
  210. because there are many more letters in the “alphabet” of chess and
  211. because the “words” consist of many letters. After thousands of hours of
  212. practice, however, chess masters are able to read a chess situation at a
  213. glance. The few moves that come to their mind are almost always strong
  214. and sometimes creative. They can deal with a “word” they have never
  215. encountered, and they can find a new way to interpret a familiar one.
  216.  
  217.  
  218. The Environment of Skill
  219.  
  220.  
  221. Klein and I quickly found that we agreed both on the nature of intuitive skill
  222. and on how it is acquired. We still needed to agree on our key question:
  223. When can you trust a self-confident professional who claims to have an
  224. intuition?
  225.  
  226. We eventually concluded that our disagreement was due in part to the
  227. fact that we had different experts in mind. Klein had spent much time with
  228. fireground commanders, clinical nurses, and other professionals who have
  229. real expertise. I had spent more time thinking about clinicians, stock
  230. pickers, and political scientists trying to make unsupportable long-term
  231. forecasts. Not surprisingly, his default attitude was trust and respect; mine
  232. was skepticism. He was more willing to trust experts who claim an intuition
  233. because, as he told me, true experts know the limits of their knowledge. I
  234. argued that there are many pseudo-experts who have no idea that they do
  235. not know what they are doing (the illusion of validity), and that as a general
  236. proposition subjective confidence is commonly too high and often
  237. uninformative.
  238.  
  239. Earlier I traced people’s confidence in a belief to two related
  240. impressions: cognitive ease and coherence. We are confident when the
  241. story we tell ourselves comes easily to mind, with no contradiction and no
  242. competing scenario. But ease and coherence do not guarantee that a
  243. belief held with confidence is true. The associative machine is set to
  244. suppress doubt and to evoke ideas and information that are compatible
  245. with the currently dominant story. A mind that follows WY SIATI will achieve
  246. high confidence much too easily by ignoring what it does not know. It is
  247. therefore not surprising that many of us are prone to have high confidence
  248. in unfounded intuitions. Klein and I eventually agreed on an important
  249. principle: the confidence that people have in their intuitions is not a reliable
  250. guide to their validity. In other words, do not trust anyone—including
  251. yourself—to tell you how much you should trust their judgment.
  252.  
  253. If subjective confidence is not to be trusted, how can we evaluate the
  254. probable validity of an intuitive judgment? When do judgments reflect true
  255. expertise? When do they display an illusion of validity? The answer comes
  256. from the two basic conditions for acquiring a skill:
  257.  
  258.  
  259. • an environment that is sufficiently regular to be predictable
  260.  
  261. • an opportunity to learn these regularities through prolonged practice
  262.  
  263.  
  264. When both these conditions are satisfied, intuitions are likely to be skilled.
  265. Chess is an extreme example of a regular environment, but bridge and
  266. poker also provide robust statistical regularities that can support skill.
  267. Physicians, nurses, athletes, and firefighters also face complex but
  268. fundamentally orderly situations. The accurate intuitions that Gary Klein has
  269. described are due to highly valid cues that es the expert’s System 1 has
  270. learned to use, even if System 2 has not learned to name them. In contrast,
  271. stock pickers and political scientists who make long-term forecasts
  272. operate in a zero-validity environment. Their failures reflect the basic
  273. unpredictability of the events that they try to forecast.
  274.  
  275. Some environments are worse than irregular. Robin Hogarth described
  276. “wicked” environments, in which professionals are likely to learn the wrong
  277. lessons from experience. He borrows from Lewis Thomas the example of
  278. a physician in the early twentieth century who often had intuitions about
  279. patients who were about to develop typhoid. Unfortunately, he tested his
  280. hunch by palpating the patient’s tongue, without washing his hands
  281. between patients. When patient after patient became ill, the physician
  282. developed a sense of clinical infallibility. His predictions were accurate—
  283. but not because he was exercising professional intuition!
  284.  
  285.  
  286. Meehl’s clinicians were not inept and their failure was not due to lack of
  287. talent. They performed poorly because they were assigned tasks that did
  288. not have a simple solution. The clinicians’ predicament was less extreme
  289. than the zero-validity environment of long-term political forecasting, but they
  290. operated in low-validity situations that did not allow high accuracy. We
  291. know this to be the case because the best statistical algorithms, although
  292. more accurate than human judges, were never very accurate. Indeed, the
  293. studies by Meehl and his followers never produced a “smoking gun”
  294. demonstration, a case in which clinicians completely missed a highly valid
  295. cue that the algorithm detected. An extreme failure of this kind is unlikely
  296. because human learning is normally efficient. If a strong predictive cue
  297. exists, human observers will find it, given a decent opportunity to do so.
  298. Statistical algorithms greatly outdo humans in noisy environments for two
  299. reasons: they are more likely than human judges to detect weakly valid
  300. cues and much more likely to maintain a modest level of accuracy by using
  301. such cues consistently.
  302.  
  303. It is wrong to blame anyone for failing to forecast accurately in an
  304. unpredictable world. However, it seems fair to blame professionals for
  305. believing they can succeed in an impossible task. Claims for correct
  306. intuitions in an unpredictable situation are self-delusional at best,
  307. sometimes worse. In the absence of valid cues, intuitive “hits” are due
  308. either to luck or to lies. If you find this conclusion surprising, you still have a
  309. lingering belief that intuition is magic. Remember this rule: intuition cannot
  310. be trusted in the absence of stable regularities in the environment.
  311.  
  312.  
  313. Feedback and Practice
  314.  
  315. Some regularities in the environment are easier to discover and apply than
  316. others. Think of how you developed your style of using the brakes on your
  317. car. As you were mastering the skill of taking curves, you gradually learned
  318. when to let go of the accelerator and when and how hard to use the brakes.
  319. Curves differ, and the variability you experienced while learning ensures
  320. that you are now ready to brake at the right time and strength for any curve
  321. you encounter. The conditions for learning this skill are ideal, because you
  322. receive immediate and unambiguous feedback every time you go around
  323. a bend: the mild reward of a comfortable turn or the mild punishment of
  324. some difficulty in handling the car if you brake either too hard or not quite
  325. hard enough. The situations that face a harbor pilot maneuvering large
  326. ships are no less regular, but skill is much more difficult to acquire by sheer
  327. experience because of the long delay between actions and their
  328. noticeable outcomes. Whether professionals have a chance to develop
  329. intuitive expertise depends essentially on the quality and speed of
  330. feedback, as well as on sufficient opportunity to practice.
  331.  
  332. Expertise is not a single skill; it is a collection of skills, and the same
  333. professional may be highly expert in some of the tasks in her domain while
  334. remaining a novice in others. By the time chess players become experts,
  335. they have “seen everything” (or almost everything), but chess is an
  336. exception in this regard. Surgeons can be much more proficient in some
  337. operations than in others. Furthermore, some aspects of any
  338. professional’s tasks are much easier to learn than others.
  339. Psychotherapists have many opportunities to observe the immediate
  340. reactions of patients to what they say. The feedback enables them to
  341. develop the intuitive skill to find the words and the tone that will calm anger,
  342. forge confidence, or focus the patient’s attention. On the other hand,
  343. therapists do not have a chance to identify which general treatment
  344. approach is most suitable for different patients. The feedback they receive
  345. from their patients’ long-term outcomes is sparse, delayed, or (usually)
  346. nonexistent, and in any case too ambiguous to support learning from
  347. experience.
  348.  
  349. Among medical specialties, anesthesiologists benefit from good
  350. feedback, because the effects of their actions are likely to be quickly
  351. evident. In contrast, radiologists obtain little information about the accuracy
  352. of the diagnoses they make and about the pathologies they fail to detect.
  353. Anesthesiologists are therefore in a better position to develop useful
  354. intuitive skills. If an anesthesiologist says, “I have a feeling something is
  355. wrong,” everyone in the operating room should be prepared for an
  356. emergency.
  357.  
  358. Here again, as in the case of subjective confidence, the experts may not
  359. know the limits of their expertise. An experienced psychotherapist knows
  360. that she is skilled in working out what is going on in her patient’s mind and
  361. that she has good intuitions about what the patient will say next. It is
  362. tempting for her to conclude that she can also anticipate how well the
  363. patient will do next year, but this conclusion is not equally justified. Short¬
  364. term anticipation and long-term forecasting are different tasks, and the
  365. therapist has had adequate opportunity to learn one but not the other.
  366. Similarly, a financial expert may have skills in many aspects of his trade
  367. but not in picking stocks, and an expert in the Middle East knows many
  368. things but not the future. The clinical psychologist, the stock picker, and the
  369. pundit do have intuitive skills in some of their tasks, but they have not
  370. learned to identify the situations and the tasks in which intuition will betray
  371. them. The unrecognized limits of professional skill help explain why experts
  372. are often overconfident.
  373.  
  374.  
  375. Evaluating Validity
  376.  
  377. At the end of our journey, Gary Klein and I agreed on a general answer to
  378. our initial question: When can you trust an experienced professional who
  379. claims to have an intuition? Our conclusion was that for the most part it is
  380. possible to distinguish intuitions that are likely to be valid from those that
  381. are likely to be bogus. As in the judgment of whether a work of art is
  382. genuine or a fake, you will usually do better by focusing on its provenance
  383. than by looking at the piece itself. If the environment is sufficiently regular
  384. and if the judge has had a chance to learn its regularities, the associative
  385. machinery will recognize situations and generate quick and accurate
  386. predictions and decisions. You can trust someone’s intuitions if these
  387. conditions are met.
  388.  
  389. Unfortunately, associative memory also generates subjectively
  390. compelling intuitions that are false. Anyone who has watched the chess
  391. progress of a talented youngster knows well that skill does not become
  392. perfect all at once, and that on the way to near perfection some mistakes
  393. are made with great confidence. When evaluating expert intuition you
  394. should always consider whether there was an adequate opportunity to
  395. learn the cues, even in a regular environment.
  396.  
  397. In a less regular, or low-validity, environment, the heuristics of judgment
  398. are invoked. System 1 is often able to produce quick answers to difficult
  399. questions by substitution, creating coherence where there is none. The
  400. question that is answered is not the one that was intended, but the answer
  401. is produced quickly and may be sufficiently plausible to pass the lax and
  402. lenient review of System 2. You may want to forecast the commercial future
  403. of a company, for example, and believe that this is what you are judging,
  404. while in fact your evaluation is dominated by your impressions of the
  405. energy and competence of its current executives. Because substitution
  406. occurs automatically, you often do not know the origin of a judgment that
  407. you (your System 2) endorse and adopt. If it is the only one that comes to
  408. mind, it may be subjectively undistinguishable from valid judgments that
  409. you make with expert confidence. This is why subjective confidence is not
  410. a good diagnostic of accuracy: judgments that answer the wrong question
  411. can also be made with high confidence.
  412.  
  413. You may be asking, Why didn’t Gary Klein and I come up immediately
  414. with the idea of evaluating an expert’s intuition by assessing the regularity
  415. of the environment and the expert’s learning history—mostly setting aside
  416. the expert’s confidence? And what did we think the answer could be?
  417. These are good questions because the contours of the solution were
  418. apparent from the beginning. We knew at the outset that fireground
  419. commanders and pediatric nurses would end up on one side of the
  420. boundary of valid intuitions and that the specialties studied by Meehl would
  421. be on the other, along with stock pickers and pundits.
  422.  
  423. It is difficult to reconstruct what it was that took us years, long hours of
  424. discussion, endless exchanges of draft s and hundreds of e-mails
  425. negotiating over words, and more than once almost giving up. But this is
  426. what always happens when a project ends reasonably well: once you
  427. understand the main conclusion, it seems it was always obvious.
  428.  
  429. As the title of our article suggests, Klein and I disagreed less than we
  430. had expected and accepted joint solutions of almost all the substantive
  431. issues that were raised. However, we also found that our early differences
  432. were more than an intellectual disagreement. We had different attitudes,
  433. emotions, and tastes, and those changed remarkably little over the years.
  434. This is most obvious in the facts that we find amusing and interesting. Klein
  435. still winces when the word bias is mentioned, and he still enjoys stories in
  436. which algorithms or formal procedures lead to obviously absurd decisions.
  437. I tend to view the occasional failures of algorithms as opportunities to
  438. improve them. On the other hand, I find more pleasure than Klein does in
  439. the come-uppance of arrogant experts who claim intuitive powers in zero-
  440. validity situations. In the long run, however, finding as much intellectual
  441. agreement as we did is surely more important than the persistent
  442. emotional differences that remained.
Add Comment
Please, Sign In to add comment