Visions of rationality
View Dublin Core Metadata for this article external link

Chase, Valerie M.
Hertwig, Ralph
Gigerenzer, Gerd

 

Please note:
This paper is a preprint of an article published in Trends in Cognitive Sciences, 2 (1998) 6 (pp. 206-214), therefore there may be minor differences between the two versions.
The published version is available at Trends in Cognitive Sciences, 2 (1998) 6.
Access to the article may require a contract or license with the publisher.
The copyright of this electronic version remains with the authors and the Max Planck Institute for Human Development.

   

Summary

The classical view that equates rationality with adherence to the laws of probability theory and logic has driven much research on human inference. Recently, an increasing number of researchers have begun to espouse a view of rationality that takes account of organisms' adaptive goals, natural environment, and cognitive constraints. We argue that inference is carried out using boundedly rational heuristics, that is, heuristics that allow organisms to reach their goals under conditions of limited time, information, and computational capacity. These heuristics are ecologically rational in that they exploit aspects of both the physical and social environment in order to make fast and frugal inferences. We review recent work exploring this multifaceted conception of rationality.

Keywords: bounded rationality, environmental structure, social cognition, judgment and decision making, problem solving

Humans and animals alike make inductive inferences. Firefighters predict how fires will progress from cues such as smoke and roof "sponginess"[1], while peahens use the elaborateness of peacocks' tails to infer their fitness before deciding whether to mate with them[2]. The cues on which organisms base their inductive inferences are typically uncertain: the old adage aside, sometimes there's no smoke even where there's fire.

How do people make inferences, and are their inferences rational? Most researchers of inference share a vision of rationality whose roots trace back to the Enlightenment. This now classical view holds that the laws of human inference are equivalent to the laws of probability and logic. For French astronomer Pierre Laplace, for example, probability theory embodied human intuition: "The theory of probability is at bottom nothing more than good sense reduced to a calculus"[3]. Nineteenth-century German philosopher Theodor Lipps wrote that logic "is nothing if not the physics of thought"[4]. So fundamental was the belief that the mind worked by the rules of probability and logic that when human intuition was observed to deviate from them, the rules were questioned[5]. In short, many pre-twentieth-century thinkers believed that the psychological defines the rational.

Variants of the classical view have flourished in twentieth-century psychology. In particular, many researchers believe that the laws of probability theory and logic at least approximately describe human inference. In Cameron Peterson and Lee Beach's view, for example: "Probability theory and statistics can be used as the basis for psychological models that integrate and account for human performance in a wide range of inferential tasks"[6]. According to Jean Piaget, cognitive development culminates in a set of logico-mathematical abilities that essentially reflect the laws of probability and logic. More recently, Lance Rips has argued for the existence of "mental logic"[7]. Finally, rational choice theorists and economists often model people's decisions using probability theory as an approximation (e.g.,[8],[9]). Unlike their Enlightenment predecessors, however, these modern researchers see classical models as norms against which human reasoning can be evaluated rather than as codifications of it: When the two diverge, it is concluded that there is something wrong with the reasoning, not with the norms.

In the past 25 years, the idea that human inference can be either defined or described by probability theory and logic has been increasingly challenged. Proponents of the heuristics-and-biases program have argued that inference is systematically biased and error-prone, powered by quick and dirty cognitive heuristics[10]. Numerous departures from classical norms in inductive reasoning - "cognitive illusions" such as overconfidence, base-rate neglect, and the conjunction fallacy (all discussed further below) - have been attributed to application of these heuristics. A parallel research program has been devoted to accounting for departures of deductive inference from logical norms (for a review of this literature, see [11]).

As the heuristics-and-biases program grew, the view that human reasoning is fundamentally irrational supplanted the belief that it accords with classical rational norms within and outside of psychology[12]. In the words of Slovic, Fischhoff, and Lichtenstein: "It appears that people lack the correct programs for many important judgmental tasks.... It may be argued that we have not had the opportunity to evolve an intellect capable of dealing conceptually with uncertainty"[13]. The conjunction fallacy impelled paleontologist Stephen Jay Gould to speculate: "Our minds are not built (for whatever reason) to work by the rules of probability"[14]. Some have even argued that deviations from rational norms "should be considered the rule rather than the exception"[15].

Are of rational norms really the rule? Given the analogy between inference anviolationsd perception behind the illusion metaphor[10], they should be considered an exception. Just as vision researchers construct situations in which the functioning of the visual system leads to incorrect inferences about the world (e.g., about line lengths in the Müller-Lyer illusion), researchers in the heuristics-and-biases program select problems in which reasoning by cognitive heuristics leads to violations of probability theory[12]. However, the conclusions they draw from such unrepresentative designs differ sharply from those drawn by researchers of perception. Vision scientists do not conclude from the robustness of the Müller-Lyer illusion, for instance, that people are generally poor at inferring object lengths. However, many advocates of the heuristics-and-biases program conclude from the cognitive illusions found in laboratory tasks that human judgment is subject to severe and systematic biases that compromise its general functioning[16],[17].

How does judgment look when one does not select problems systematically? The use of representative design, which entails simulating real-world conditions of interest in order to test and evaluate human performance[18]-[20], casts new light on inference. For instance, when one uses a representative sample of general knowledge questions, the overconfidence bias found in selected samples of questions disappears (e.g., [21]-[23]; but see also [24]). In a recent meta-analysis of more than 40 general knowledge tasks, Peter Juslin and colleagues[25] found an average overconfidence of practically zero. In addition, people's estimates of the frequency with which letters appear in various positions within words are better-calibrated when participants judge a large, representative sample of letters[26] than a small, selected sample[27]. While systematic design is often desirable when we want to decide between cognitive models, only representative design allows us to draw conclusions about the quality of human judgment in the real world.

Problems with the Classical Definition of Rationality

Despite their disagreements, proponents of the neo-Enlightenment view and the heuristics-and-biases view agree on one critical point: Rationality requires reasoning in accord with the rules of probability theory. Even if we accept this definition (which we will challenge shortly), we still see three major problems with it. First, no single conception of probability is shared by all statisticians and philosophers. The applicability of the rules of probability theory to unique events is hotly disputed, with some contending that they apply to unique events and others arguing that they apply only to classes of events[28]. For someone who interprets probability in a strictly frequentistic sense, these rules are irrelevant to the many tasks involving unique events that have been studied in the heuristics-and-biases program. In our view, wherever a norm's applicability depends on our interpretation of probability in this way, we are not justified in treating it as an unequivocal norm of sound reasoning ([29], [30]; for a recent debate on this point, see [31], [32]).

The second problem with the classic definition of rationality is its blindness to content and context. In much research on inference, the rules of probability are taken a priori as normative, and content is only later filled in. In other words, rather than following the practices of good statisticians, who tailor statistical models to suit specific problems, those who subscribe to this definition of rationality sometimes fail to analyze problem content and people's assumptions about it. Unless this is done, we cannot know how to interpret their judgments. In studies of Bayesian inference, for example, participants may make intelligent assumptions that render some of the given information irrelevant to their judgments, which can be mistaken for neglect of base-rate information[33].

The third and most serious problem we see with the classical definition of rationality is that outside of the simple problems used in most research, it makes unrealistic demands of the mind. In the real world, matters are more complicated than the simple content-blind norms tested in most laboratory problems assume. Here, Bayes' theorem and subjective expected utility maximization often become mathematically complex and computationally intractable. Moreover, in many situations, a rational model cannot even be specified because the problem space is unbounded (see [34] Appendix, [35]). Expecting people's inferences to conform to classical rational norms in such complex environments requires believing that the human mind is a Laplacean demon[36]: a supercalculator with unlimited time, knowledge, and computational power.

Is there any view between the two extremes we have so far considered: that the mind is an omniscient, omnipotent Laplacean demon or that it simply "lacks the correct programs"[13] for making many important judgments? Herbert Simon set the stage for what we consider the most promising alternative: "Human rational behavior ... is shaped by a scissors whose two blades are the structure of task environments and the computational capabilities of the actor"[37]. In other words, rationality cannot be defined except by reference to environmental and cognitive constraints. Moreover, rationality is a tool for helping organisms to reach their real-world goals, not necessarily to conform to rational norms. In Simon's words: "Reason is.... a gun for hire that can be employed in any goals we have..."[38]. In the remainder of this review, we describe how recent researchers have used Simon's scissors to fashion a new area of research on human inference.

Bounded Rationality

The human mind has to solve complex problems under difficult working conditions: limited time, knowledge, and computational capacity. Consider Charles Darwin, who methodically listed the pros and cons of marriage and bachelorhood before deciding to marry Emma Wedgewood. Despite his willingness to take such an analytic approach to deciding affairs of the heart, Darwin could not have hoped to make this decision rationally in any classical sense. Suppose that he had attempted to maximize his subjective expected utility. While he deliberated about whether marrying was the right choice, listing each of the infinite conceivable consequences of marrying and not marrying, assigning probabilities to each, and searching for more information about his prospective wives, they would all most likely have married other men (not to mention had children and died). Even if they were infinitely patient, he would still need an infinite number of supercomputers to integrate all of this information for him.

What does Darwin's dilemma illustrate? First, life's important problems cannot necessarily be solved by optimization because the space of possibilities that must be taken into account is often unlimited. Second, even when this space is limited and knowledge is complete, optimization is often impossible to achieve in any real system due to the computational demands it poses; after all, even Gary Kasparov's archrival Deep Blue is unable to fully optimize its moves in the well-defined problem space of chess.

Simon proposed that, because of the above constraints, human inference in the real world exhibits "bounded rationality" rather than the classical rationality assumed by optimal models in psychology, economics, and biology (for a relevant debate, see [35], [39]). The key feature of bounded rationality is limited search, which requires some kind of stopping rule. Note that here search can refer either to search for alternatives (e.g., mates) or search for each alternative's values on particular cues (e.g., a potential mate's age, sense of humor, etc.). We now describe various interpretations of bounded rationality, saving the one we favor for last[40].

Some researchers in psychology and economics define bounded rationality as constrained optimization, that is, optimizing relative to a criterion while taking the costs of time, information search, and computation into account (e.g., [34], [41]-[43]). This stopping rule - to terminate search when its costs outweigh its benefits - is deceptively simple. In fact, the optimization is simply shifted to the problem of determining when to terminate search, which means that this brand of bounded rationality is saddled with the very intractability that it is intended to eliminate[44]. Unsurprisingly, most economists have not embraced bounded rationality in this form because they "are in the market for methods for reducing the number of parameters to explain data, and a reduction is not what bounded rationality promises"[42].

The most prominent approach of this type, "rational analysis" [34], [41], is predicated on the assumptions that human cognition is adaptive and that adaptation amounts to optimization. It entails specifying the goals of the cognitive system, developing a formal model of the environment, and deriving the optimal behavioral function based on the goals, formal model, and minimal cognitive constraints. This function is then compared to human performance, and the model duly refined to bring the two into closer correspondence. The cognitive constraints that rational analysis takes into account include deliberation costs and short-term memory limitations. Rational heuristics can conserve cognitive resources by exploiting environmental regularities (e.g., the rarity of most cues and target variables; see [45], [46]) to simplify the task of optimization.

The rational analysis approach has made impressive progress in developing ecologically appropriate norms to which one can compare human performance in memory, categorization, hypothesis testing, and causal inference [34], [41], [45]. Furthermore, it has demonstrated that some of the most robust findings in cognitive psychology (e.g., power-law learning) can be illuminated by ecological analysis. Still, it has three important limitations. First, it can be performed only in those relatively uncommon situations in which an optimal solution can be worked out. Second, devising a computationally tractable rational model of a real-world environment requires making dramatically simplifying assumptions (e.g., assuming that cues are independent to trim down Bayesian computations, see [34], [41]). Moreover, because rational analysis starts with a full-blown optimal model made up of mathematical rather than psychological components, it is not well suited to building plausible models of human cognitive processes[35]. As John R. Anderson, who spearheaded rational analysis, has himself observed: "It is in the spirit of a rational analysis to prescribe what the behavior of a system should be rather than how to compute it"[41].

Some have suggested that the cognitive heuristics identified in the heuristics-and-biases program, such as representativeness and availability, exhibit bounded rationality[10], [13], [47]. Early research on cognitive heuristics certainly served to demonstrate that human inference does not always conform to classical rational norms. It also encouraged researchers to explore the idea that people rely on cognitive heuristics made up of simple psychological processes rather than formal procedures to make inferences. However, to date, the cognitive heuristics posited in this literature have not been formalized such that one could either simulate or mathematically analyze their behavior (for a counterexample, see [48]), leaving them free to account for all kinds of performance post hoc (for a rebuttal of this point, see [31]). Furthermore, it has not been specified whether or how such heuristics capitalize on environmental structure to make inferences.

We now describe some models of bounded rationality that capture both the environmental and the cognitive blades of Simon's scissors. Their most critical feature is that they include smart, simple rules for stopping information search. As a convention, we refer to heuristics based on limited (cue) search as "fast and frugal"[40].

A Fast and Frugal Heuristic: Take The Best

Which has a larger population: San Diego or San Antonio? If you are not American, you will probably guess San Diego. Why? Because you have heard of it, and chances are that you have never heard of San Antonio. If you are American, however, you probably recognize both cities, and thus cannot rely on the recognition heuristic to make your choice[40]. In this context, the recognition heuristic can be summarized in one sentence: If you recognize one object and not the other, then infer that the recognized object has the higher value on the target variable. If you do not recognize either object, then guess.

What happens if you recognize both cities? In that case, you have to retrieve information from memory to make an inference. Take The Best is a heuristic for using this information[49]. Imagine that we have a set of objects, all German cities with more than 100,000 inhabitants, and a target variable, population size. Each city can be characterized on a number of binary (or dichtomized) cues, each of which predicts population size to varying degrees. For instance, cities with major league soccer teams tend to be larger. In Take The Best, the objects are compared on the most valid cue, the second most valid cue, and so on until a cue on which the objects differ is found (see Figure 1). All that Take The Best needs to learn - or to estimate - is the rank order of cues by validity. Moreover, its stopping rule for information search is very simple: Take the best cue (i.e., the most valid one that discriminates) and ignore the rest. The first step of Take The Best is always the recognition heuristic, which enables it to exploit ignorance to make smart inferences. (The recognition heuristic could be the first step of other inference strategies as well; see Ref. [37]).

Figure 1

Using the recognition heuristic might be smart, but do people actually use it? Empirical results suggest that they do. In one study, American students were asked to make hundreds of inferences about which member of pairs of German cities (e.g., Bielefeld or Munich) was larger. In over 90% of the choices in which recognition discriminated between alternatives, participants in this study opted for the recognized city[40]. Clearly, a person who recognizes all objects cannot use the recognition heuristic (which in the German city environment generally discriminates well between larger and small cities for this American sample). A counter-intuitive implication of this is that someone with less knowledge can actually make more accurate inferences under some conditions, a prediction supported by empirical results (the "less-is-more effect"; see [40], [49]).

Table 1            
 
Algorithm
Average number of cues looked up
% correct when test set same as training set
% correct when test set different from training set
Minimalist
 
2.1
 
61
 
56
Take The Best
2.4
69
63
Dawes' Rule
 
6
 
66
 
58
Multiple Regression
 
6
 
70
 
61

Inferring homelessness: a competition between algorithms (a); (a) = modified from [50]

When inferring target variables as diverse as population sizes and high school drop-out rates, computer simulation results show that Take The Best roughly matches or outperforms in inferential accuracy a variety of linear models that integrate across all cues, such as multiple regression and a unit-weighted linear model called "Dawes' Rule". Even more surprisingly, the "Minimalist" heuristic, a poor cousin of Take The Best that selects cues in random order, also fares well relative to these computationally more expensive algorithms. Take The Best thrives particularly well compared to integration algorithms when generalizations have to be made (i.e., when test set NOTtraining set) rather than when data have to be fitted (i.e., when test set = training set). This is because algorithms that integrate all available information, such as multiple regression, tend to suffer from overfitting, whereas Take The Best relies disproportionately on cues that exhibit greater invariance, at least in the data environments in which it has so far been tested.

Table 1 shows the results of a simulated competition (excluding the recognition heuristic) between Take The Best, Minimalist, Dawes' Rule, and multiple regression in which the target variable was rates of homelessness in 50 U. S. cities[50]. The left column indicates the average number of cues that each algorithm had to look up (out of a total possible of six), which was roughly the same in both types of competition. The center and right columns show the percentages of correct binary inferences each algorithm made when the test set was equal to and not equal to the training set, respectively.

Take The Best and Minimalist are clearly more frugal in their use of information than the two integration algorithms, yet are about as accurate as the others. Even more remarkably, Take The Best actually outperforms multiple regression when generalization is required.

Ecological Rationality

Fast and frugal heuristics can perform about as well as algorithms that require much more information - and in a serial architecture - more time. What is their secret? The answer lies in their "ecological rationality". They use environmental regularities to make smart inferences. For instance, the recognition heuristic exploits the fact that our ignorance is often systematically related to variables we want to infer (for example, we are more likely to recognize big cities, companies, and universities than small ones).

Mathematical analysis can help us to understand where and why Take The Best can (or cannot) be more accurate than a weighted linear model in which the weights are the correlations between cues and the target variable[40], [51]. The set of non-redundant binary cues in any environment is finite. Where known cues are abundant (i.e., their number approaches this finite maximum), weighted linear models tend to be more accurate (and their accuracy approaches 100%), whereas where known cues are scarce (i.e., their number is small relative to log2N, where N is the total number of objects), Take The Best is on average more accurate. In addition, when the weights are noncompensatory; that is, the weight of each cue exceeds the weights of all other below it in the cue order, the faster and more frugal Take The Best cannot be surpassed by any weighted linear model.

Because the information available in the environment (and in the organism's memory) is often scarce, Take The Best may do well in a wide range of real-world situations. However, no single heuristic can make good decisions in every environment because ecological rationality necessarily implies specificity. The more ecological assumptions are built into a heuristic, the less well it will generalize to environments in which those assumptions are not met. Thus, the mind’s "adaptive toolbox" most likely includes a panoply of heuristics suited for use in different situations[52]-[54]. As candidate tools, we and our colleagues have developed fast and frugal heuristics for a variety of problems[40], including categorization (see also [55]), mate choice, and quantitative estimation.

In natural environments, information comes in some forms and not others. Ecologically rational heuristics not only take specific environmental structures for granted, but are tuned to work on specific information representations. Unlike probability theory, real systems - whether computers or brains - are not indifferent to how numerical information is represented. For instance, a pocket calculator has an algorithm for multiplication. However, because it is designed to work on numbers entered in base 10 rather than base 2, it would appear to have no algorithm for multiplication at all if one gave it binary numbers as input[56].

To what representations of numerical information might our cognitive algorithms be tuned? The problems typically used in research on inductive inference express information in terms of probabilities or percentages, which are historically a very recent invention. They would not have been encountered in any form in the environments of our evolutionary ancestors, nor can they be directly experienced today, notwithstanding their ubiquity in the media. A more naturalistic way to represent numerical information is in natural frequencies: absolute frequencies that have not been normalized with respect to the base rates (see Box 1). From these considerations we can predict that our cognitive algorithms are probably designed to reason about numerical information in the form of natural frequencies rather than probabilities.

Box 1

One of the key findings of the heuristics-and-biases program is that people overweight new data relative to base rates in judging posterior probabilities in Bayesian inference problems (e.g., the probability that a person who tests positive for HIV really has it), which is generally referred to as "base-rate neglect"[10]. Some of these results may be attributable to the fact that the Bayesian model taken as normative ignores relevant aspects of content or context[33], [61]. Still, there are problems in which the Bayesian answer seems appropriate and yet to which people give decidedly non-Bayesian responses. How well would people solve these problems if the information were presented in terms of natural frequencies rather than probabilities? As it turns out, they look much more like Bayesians (see Box 1 and [58]). The reason seems to be computational simplicity: whereas plugging the necessary probabilities into Bayes' theorem requires several steps of multiplication and division, computing posterior probabilities from natural frequencies boils down to simply dividing the number of hits (e.g., people who test positive and really have HIV) by the sum of hits and false alarms (e.g., all people who test positive). In other words, the frequency representation does part of the work for us.

Teaching people to convert probabilities into natural frequencies has been shown to be a powerful tool for training students in Bayesian reasoning[59]. Natural frequency representations also help experts, such as physicians, to make diagnostic inferences[60] and have immediate applications in other contexts, for instance, in helping AIDS counselors and their patients interpret HIV test results[61].

Social Rationality

So far we have characterized Simon's ecological blade strictly in terms of the physical environment, but it reflects the social environment (the world of other organisms) as well. We begin our discussion of "social rationality" by describing some situations in which adhering to social norms is rational although it conflicts with internal consistency, which is often seen as the defining feature of rational choice in decision theory and behavioral economics.

In real-world social contexts, consistency in choice is not always in one's best interest. In competitive situations, it is sometimes desirable to exhibit adaptively unpredictable, or protean, behavior, so that other people and animals cannot predict what one will do[62], [63]. For example, our chances of winning a tennis match would be compromised if our opponent knew a stable, consistent order in which we chose to serve to the left or the right during a match. Similarly, when being pursued by a predator that might be able to outrun it, a prey animal would be unwise to flee along a straight, predictable path, even if not doing so means taking longer to cover the same distance[62].

One of the basic principles of internal consistency in choice is known as "Property a". Informally speaking, it requires that if you choose A over B, you should do so independent of the other alternatives in the choice set. At first blush, one might believe that all violations of Property aare irrational. However, our social values and goals may sometimes conflict with this principle. Imagine, for example, that you are at a dinner party. At dessert, it looks as if there are fewer pastries than there are guests. By the time the dessert tray gets to you, there is only one pastry left, a chocolate eclair. If you know that another of the guests has not yet taken a dessert, out of politeness you might choose to have nothing over having the eclair. However, if the host replenishes the pastry supply, you might well choose to eat that same eclair over having nothing. In other words, after choosing B (nothing) over A (the eclair), you might choose A over B just because other items were added to the choice set.

We have probably all violated Property ain similar situations. Does this make us irrational? Not if we take the social environment into account[64]: Being polite pays off. Not being so could anger others and lessen the chances that other people will cooperate with us in the future. Thus, for many of us, violating the social rule "Don't take the last piece of cake" carries higher costs than violating Property a, which, after all, will only offend a handful of decision theorists and economists. Of course, we can always account for this example of inconsistency by arguing that the nature of the eclair changed with the choice set, that is, it ceased to be the last dessert. But the point here is that if we allow the meeting of social expectations into our definition of rationality, then we can predict such choices rather than having to explain them post hoc.

Another context in which being socially rational requires deviating from a content-blind norm is in the famous Linda problem. In it participants read: "Linda is 31 years old, single, outspoken, and very bright.... As a student, she was deeply concerned with issues of discrimination and social justice.... " They are then asked to choose which hypothesis is more "probable": that Linda is a bank teller (T) or that Linda is a bank teller and is active in the feminist movement (T+F). In most studies, 80%-90% of participants rank T+F as more probable than T [65]. This effect - known as the conjunction fallacy - is widely interpreted as a violation of the conjunction rule, according to which the probability of a conjoint event cannot exceed the probability of any of its constituents.

In the Linda problem, participants have to infer what the experimenter means by "probable," a term that in natural language has multiple, related meanings, most of which cannot be reduced to mathematical probability (e.g., "plausible" or "conceivable"). Which of these meanings do they infer? One possible answer can be derived from Paul Grice's[66] theory of conversational reasoning, which holds that it is reasonable for the audience (participant) to assume that the communicator (experimenter) will follow certain social rules governing communication. If they assume that the "relevance maxim, " by which the audience expects the communicator's contribution to be relevant to the conversation, applies in the Linda problem, participants should infer that "probable" does not refer to mathematical probability, because a mathematical interpretation would render the description of Linda irrelevant to the requested judgment[67].

Based on this analysis, one can construct a social context in which people following the relevance maxim are more likely to infer a mathematical meaning. If, immediately before the probability judgment, participants are asked for a judgment that renders Linda's description relevant - such as a typicality judgment (i.e., "How good an example of a bank teller is Linda?") - then adherence to the conjunction rule should increase. Indeed, this is what Hertwig[68] observed: among participants asked to make a typicality judgment first, the percentage who followed the conjunction rule was on average more than 40 percentage points higher than that among participants who made the probability judgment immediately. Other researchers have shown that asking participants to estimate frequencies in conjunction problems (e.g., "How many people like Linda are bank tellers") dramatically increases the percentage of judgments consistent with the conjunction rule[65], [69], perhaps because the frequency representation eschews the ambiguity of the term "probable"[68]. Conversational analysis has revealed still other relevance-preserving inferences drawn in the Linda problem that lead participants to violate the conjunction rule (e.g., [70]).

Such re-analyses of apparent cognitive biases highlight the hazards of confusing with irrationality the human ability to make intelligent semantic and pragmatic inferences[71]. Many researchers are pushing the limits of our knowledge about social rationality in still other ways by studying, for instance, how people reason about deontic conditionals and social contracts[53], [72]-[74] and how emotions, traditionally thought to undermine reason, might actually help people to think and decide rationally[75], [76]. This research demonstrates that social values and goals deserve a place in cognitive explanations and definitions of rationality.

Box 2

Conclusion

We began by challenging the classical vision of human rationality as adherence to the norms of probability theory and logic. Not only are these norms inherently problematic when applied without regard to content and context, but they fail to capture what it means to be rational either in ancestral environments or the modern world. The mind has evolved to tackle important adaptive problems, not to solve mathematical brain-teasers. We argue that to discover how the mind works, and how well, we need to understand how the mind functions under its own constraints - its bounded rationality - and how it exploits the structure of the social and physical environments in which it must reach its goals - its ecological rationality. By adding these perspectives to our theoretical scope, we broaden and deepen our vision of rationality.


Figure captions

Figure 1

Figure 1: Flow diagram of a fast and frugal heuristic: Take The Best (Copyright 1996, American Psychological Association; modified, with persmission, from [49].)
<back to the text>

 

Boxes

Box 1. Anyone can be a Bayesian

Consider the way in which information is represented in the mammography problem (adapted from [77]):

The probability of breast cancer is 1% for a woman at age forty who participates in routine screening. If a woman has breast cancer, the probability is 80% that she will have a positive mammography. If a woman does not have breast cancer, the probability is 9.6% that she will also have a positive mammography.

A woman in this age group had a positive mammography in a routine screening. What is the probability that she actually has breast cancer? ____%

Inserting the given numbers into Bayes' thereom (see Box 1) gives a posterior probability, p(cancer|positive), of 7.8%. However, 95 out of 100 physicians who solved this problem estimated the probability to be between 70% and 80%, an order of magnitude greater than that given by Bayes' theorem. This is a gross error, one that in a real medical context could have serious consequences for patients' well being (at least until a biopsy can be performed). This apparent example of base-rate neglect seems to justify the dire conclusions about human rationality drawn in the heuristics-and-biases program.

However, notice that the mammography problem above is presented in terms of single-event probabilities (e.g., that a particular woman has breast cancer). Human cognitive algorithms for this type of inference, if they exist, are most likely designed to operate on numerical information in the form in which humans have gathered it over evolutionary history: natural frequencies, that is, absolute frequencies that have not been normalized with respect to the base rates. Here is how the mammography problem looks when expressed in natural frequencies:

Ten out of every 1,000 women at age 40 who participate in routine screening have breast cancer. Eight out of these 10 women with breast cancer will get a positive mammography. Of the 990 women without breast cancer, 95 will also get a positive mammography.

Here is a new representative sample of women at age 40 who got a positive mammography in routine screening. How many of these women do you expect actually to have breast cancer? ____

When students who had never heard of Bayesian inference were given problems like this one in frequencies, they responded like Bayesians in 46% of problems, whereas students who received them in probabilities solved only 16% correctly[76]. Among physicians with an average of 14 years of professional experience, the benefit of frequencies was equally strong: 46% Bayesian responses (again) compared to 10% with probabilities[58]. In a less complex medical decision-making problem, the percentage of Bayesian solutions found with frequencies rose to 76%[56].
<back to the text>

 

Figure 2

 

Figure 2: Differential complexity of Bayesian computations based on probabilities (left) and natural frequencies (right). (see text for explanation.) (Copyright 1995, American Psychological Association; modified, with permission, from from Ref b

Figure 2 shows two people trying to solve the mammography problem. While the person on the left struggles to combine the probabilities according to Bayes' theorem, the person on the right simply has to divide the number of hits (women with cancer and positive mammography) by the total number of hits and false alarms (all women with positive mammography). Thus, instead of taking in frequency information, converting it to probabilities, and plugging them into Bayes' theorem, the mind can simply tally the frequencies and perform a simpler computation. This is an example of how cognitive algorithms make the environment do some of the work for them: Natural frequencies carry base-rate information without explicitly representing it.
<back to the text>

 

Box 2. Outstanding questions

It has been proposed that the mind has an "adaptive toolbox" of specialized cognitive heuristics suited to different problems[52]-[54]. What heuristics are in the adaptive toolbox? What environmental cues might determine whether one heuristic is triggered as opposed to another?

What are the structures of the environments in which specific cognitive heuristics perform well or badly, and why? To what extent are people sensitive to these structures and in what terms can we describe them?

The recognition heuristic relies on recognition memory, which develops early ontogenetically and arose early phylogenetically. What other fundamental psychological abilities might serve as building blocks for fast and frugal heuristics?

What criteria other than accuracy (e.g., speed, computational complexity) are relevant to evaluating the performance of different heuristics? How well do fast and frugal heuristics fare relative to optimal ones by these criteria? Which heuristics come closest to mimicking human performance on these various measures?

What role do emotions and culture play in bounded rationality? How might specific emotions, such as love and disgust, help people to make adaptive decisions (e.g., by stopping information search)? Does following social norms provide a fast and frugal way to bypass deliberation by the individual?
<back to the text>



Acknowledgments

We are grateful to Peter Ayton, Seth Bullock, L. Elizabeth Crawford, David E. Over, and an anonymous reviewer for helpful comments on an earlier draft of this review, and to Laura Martignon and Peter M. Todd for fruitful discussion of some of its major points.

 

References

[1] Klein, G. (1998) Sources of power: How people make decisions. MIT Press
[2] Petrie, M. (1994) Improved growth and survival of offspring of peacocks with more elaborate trains. Nature, 371, 598-599
[3] Laplace, P.S. (1951) A philosophical essay on probabilities (F. W. Truscott & F. L. Emory, Trans.), Dover. (Original work published 1814)
[4] Lipps, T. (1880) Die Aufgabe der Erkenntnistheorie und die Wundt'sche Logik I. (The task of epistemology and Wundtian logic I.) Philosophische Monatshefte, 16, 529-539
[5] Daston, L. (1988) Classical probability in the Enlightenment. Princeton University Press
[6] Peterson, C. R. & Beach, L. R. (1967) Man as an intuitive statistician. Psychological Bulletin, 68, 29-46
[7]

Rips, L.J. (1994) The psychology of proof, MIT Press

[8] Kreps, D. M. (1990) A course in microeconomic theory. Princeton University Press
[9] von Winterfeldt, D., & Edwards, W. (1986) Decision analysis and behavioral research. Cambridge University Press
[10] Kahneman, D., Slovic, P., & Tversky, A. (1982) Judgment under uncertainty: Heuristics and biases. Cambridge University Press
[11] Evans, J. St. B. T., Newstead, S. E., & Byrne, R. M. J. (1993). Human reasoning: The psychology of deduction. Hove, England: Lawrence Erlbaum
[12] Lopes, L.L. (1991) The rhetoric of irrationality. Theory & Psychology, 1, 65-82
[13] Slovic, P., Fischhoff, B., & Lichtenstein, S. (1976) Cognitive processes and societal risk taking. In J. S. Carrol & J. W. Payne (Eds.), Cognition and social behavior. Erlbaum
[14] Gould, S. J. (1992) Bully for brontosaurus: Further reflections in natural history. Penguin Books
[15] Thaler, R. (1991) Quasi rational economics, Russell Sage Foundation
[16] Piattelli-Palmarini, M. (1994) Inevitable illusions: How mistakes of reason rule our minds, John Wiley & Sons
[17] Tversky, A., & Kahneman, D. (1974) Judgment under uncertainty: Heuristics and biases. Science, 185, 1124-1131
[18] Brunswik, E. (1956) Perception and the representative design of experiments (2nd Ed). University of California Press
[19] Cooksey, R. W. (1996) Judgment analysis: Theory, methods, and applications. Academic Press
[20] Hammond, K. (1996). Human judgment and social policy. Oxford University Press
[21] Gigerenzer, G., Hoffrage, U., & Kleinbölting, H. (1991) Probabilistic mental models: A Brunswikian theory of confidence. Psychological Review, 98, 506-528
[22] Juslin, P. (1994) The overconfidence phenomenon as a consequence of informal experimenter-guided selection of almanac items. Organizational Behavior and Human Decision Processes, 57, 226-246
[23] Soll, J.B. (1996) Determinants of overconfidence and miscalibration: The roles of random error and ecological structure. Organizational Behavior and Human Decision Processes, 65, 117-137
[24] Griffin, D., & Tversky, A. (1992) The weighing evidence and the determinants of confidence. Cognitive Psychology, 24, 411-435
[25] Juslin, P., Olsson, H., & Winman, A. (in press) The calibration issue: Theoretical comments on Suantak, Bolger, and Ferrell (1996). Organizational Behavior and Human Decision Processes
[26] Sedlmeier, P., Hertwig, R., & Gigerenzer, G. (in press) Are judgments of the positional frequencies of letters systematically biased due to availability? Journal of Experimental Psychology: Learning, Memory, and Cognition
[27] Tversky, A., & Kahneman, D. (1973) Availability: A heuristic for judging frequency and probability. Cognitive Psychology, 5, 207-232
[28] Gigerenzer, G., et al. (1989) The empire of chance: How probability changed science and everyday life. Cambridge University Press
[29] Gigerenzer, G. (1991) How to make cognitive illusions disappear: Beyond "heuristics and biases." In W. Stroebe & M. Hewstone (Eds.), European Review of Social Psychology, 2, 83-115
[30] Gigerenzer, G. (1994) Why the distinction between single-event probabilities and frequencies is important for psychology (and vice versa). In G. Wright and P. Ayton, Subjective probability, pp 129-161, John Wiley & Sons
[31] Kahneman, D., & Tversky, A. (1996) On the reality of cognitive illusions: A reply to Gigerenzer's critique. Psychological Review, 103, 582-591
[32] Gigerenzer, G. (1996) On narrow norms and vague heuristics: A reply to Kahneman and Tversky (1996). Psychological Review, 103, 592-596
[33] Birnbaum, M. H. (1983) Base rates in Bayesian inference: Signal detection analysis of the cab problem. American Journal of Psychology, 96, 85-94
[34] Anderson, J. R. (1990) The adaptive character of thought. Erlbaum
[35] Simon, H. A. (1991) Cognitive architectures and rational analysis: Comment. In K. van Lehn (Ed.), Architectures for intelligence, pp 25-40, Lawrence Erlbaum Associates
[36] Wimsatt, W. C. (1976) Reductionism, levels of organization, and the mind-body problem. In G. G. Globus, G. Maxwell, & I Savodnik (Eds.), Consciousness and the brain: A scientific and philosophical inquiry, pp 199-267, Plenum
[37] Simon, H. A. (1990a) Invariants of human behavior. Annual Review of Psychology, 41, 1-19

[38]

Simon, H. A. (1990b) Alternative visions of rationality. In P. K. Moser (Ed.), Rationality in action: Contemporary approaches, pp. 189-204, Cambridge University Press
[39] Anderson, J. R. (1991a) The place of cognitive architecture in a rational analysis. In K. van Lehn (Ed.), Architectures for intelligence, pp 1-24, Lawrence Erlbaum Associates
[40] Gigerenzer, G., Todd, P. M., & the ABC Research Group (1999) Simple heuristics that make us smart. Oxford University Press
[41] Anderson, J. R. (1991b) Is human cognition adaptive? Behavioral and Brain Sciences, 14, 471-517
[42] Sargent, T.J. (1993) Bounded Rationality in Macroeconomics, Clarendon Press
[43] Stigler, G.J. (1961) The economics of information. The Journal of Political Economy, 69, 213-225
[44] Vriend, N. J. (1996) Rational behavior and economic theory. Journal of Economic Behavior and Organization, 29, 263-285
[45] Oaksford, M., & Chater, N. (1994) A rational analysis of the selection task as optimal data selection. Psychological Review, 101, 608-631
[46] Klayman, J., Ha, Y.-W. (1987) Confirmation, disconfirmation, and information in hypothesis testing. Psychological Review, 94, 211-228
[47] Oaksford, M., & Chater, N. (1992) Bounded rationality in taking risks and drawing inferences. Theory & Psychology, 2, 225-230
[48] Smith, E. E., & Osherson, D. N. (1989) Similarity and decision making. In S. Vosniadou & A. Ortony (Eds.), Similarity and analogical reasoning (pp. 60-75). Cambridge University Press
[49] Gigerenzer, G., & Goldstein, D. G. (1996) Reasoning the fast and frugal way: Models of bounded rationality. Psychological Review, 103, 650-669
[50] Gigerenzer, G., Czerlinski, J., & Martignon, L. (in press) How good are fast and frugal heuristics? In J. Shanteau, B. Mellers, & D. Schum, Decision research from Bayesian approaches to normative systems: Reflections on the contributions of Ward Edwards. Kluwer
[51] Martignon, L. & Hoffrage, U. (1999) Why does one-reason decision making work? In G. Gigerenzer, P. M. Todd, and the ABC Research Group, Simple heuristics that make us smart. Oxford University Press
[52] Payne, J. W., Bettman, J. R., & Johnson, E. J. (1993) The adaptive decision maker. Cambridge University Press
[53] Barkow, J. H., Cosmides, L., & Tooby, J. (1992) The adapted mind: Evolutionary psychology and the generation of culture. Oxford University Press
[54] Pinker, S. (1997). How the mind words. W. W. Norton
[55] Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1993) Classification and regression trees. Chapman and Hall
[56] Marr, D. (1982) Vision: A computational investigation into the human representation and processing of visual information. Freeman
[57] Koehler, J. J. (1996) The base rate fallacy reconsidered: Descriptive, normative, and methodological challenges. Behavioral and Brain Sciences, 19, 1-53
[58] Cosmides, L. & Tooby, J. (1997) Are humans good intuitive statisticians after all? Rethinking some conclusions from the literature on judgment under uncertainty. Cognition, 58, 1-73
[59] Sedlmeier, P. (1997) BasicBayes: A tutor system for simple Bayesian inference. Behavior Research Methods, Instruments, & Computers, 29, 328-336
[60] Hoffrage, U., & Gigerenzer, G. (in press) Using natural frequencies to improve diagnostic inferences. Academic Medicine
[61] Gigerenzer, G., Hoffrage, U., & Ebert, A. (1998) AIDS counseling for low-risk clients. AIDS Care, 10, 197-211
[62] Driver, P. M., & Humphries, D. A. (1988) Protean behavior: The biology of unpredictability. Oxford University Press
[63] Miller, G. F. (1997) Protean primates: The evolution of adaptive unpredictability in competition and courtship. In White, A., & Byrne, R. W. (Eds.), Machiavellian intelligence, II. Cambridge University Press
[64] Sen, A. (1993) Internal consistency of choice. Econometrica, 61, 495-521
[65] Tversky, A., & Kahneman, D. (1983) Extensional versus intuitive reasoning: T he conjunction fallacy in probability judgment. Psychological Review, 90, 293-315
[66]

Grice, H. P. (1989) Studies in the way of words. Harvard University Press

[67] Adler, J. E. (1991) An optimist's pessimism: Conversation and conjunction. Posnan Studies in the Philosophy of the Sciences and Humanities, 21, 251-282
[68] Hertwig, R. (1995) Why Dr. Gould's homunculus doesn't think like Dr. Gould: The "conjunction fallacy" reconsidered. Hartung-Gorre
[69] Fiedler, K. (1988) The dependence of the conjunction fallacy on subtle linguistic factors. Psychological Research, 50, 123-129
[70] Politzer, G., & Noveck, I. A. (1991) Are conjunction rule violations the result of conversational rule violations? Journal of Psycholinguistic Research, 20, 83-103
[71] Hilton, D. J. (1995) The social context of reasoning: Conversational inference and rational judgment. Psychological Bulletin, 118, 248-271
[72] Gigerenzer, G., & Hug, K. (1992) Domain-specific reasoning: Social contracts, cheating, and perspective change. Cognition, 43, 127-171
[73] Holyoak, K. J., & Cheng, P. W. (1995) Pragmatic reasoning with a point of view. Thinking & Reasoning, 1, 289-313
[74] Manktelow, K. I., & Over, D. E. (1991) Social roles and utilities in reasoning with deontic conditionals. Cognition, 43, 183-188
[75]

Damasio, A. R. (1994) Descartes' error: Emotion, reason, and the human brain. G. P. Putnam's Sons

[76] Frank, R. H. (1988) Passion within Reason, Norton & Company

   

 

   
         
         
  Contact Author    
This paper is an electronic archival version of an article published in a print journal.
Please cite according to the published journal version.
   
       
    » Home   » The Institute   » Electronic Full Texts   
  Update 6/2001   » webmaster-library(at)mpib-berlin.mpg.de
» ©Copyright