Royal Society Publishing

Natural selection can favour ‘irrational’ behaviour

J. M. McNamara , P. C. Trimmer , A. I. Houston

Abstract

Understanding decisions is the fundamental aim of the behavioural sciences. The theory of rational choice is based on axiomatic principles such as transitivity and independence of irrelevant alternatives (IIA). Empirical studies have demonstrated that the behaviour of humans and other animals often seems irrational; there can be a lack of transitivity in choice and seemingly irrelevant alternatives can alter decisions. These violations of transitivity and IIA undermine rational choice theory. However, we show that an individual that is maximizing its rate of food gain can exhibit failure of transitivity and IIA. We show that such violations can be caused because a current option may disappear in the near future or a better option may reappear soon. Current food options can be indicative of food availability in the near future, and this key feature can result in apparently irrational behaviour.

1. Introduction

Rational choice is typically assumed to conform to axiomatic principles such as transitivity (if A is preferred to B and B preferred to C, then A is preferred to C) and independence of irrelevant alternatives (IIA; if A is the preferred option when A, B and C are all available, then A will be preferred to B when C is not available; see IIA(R-M) in [1]. Empirical studies have demonstrated that the behaviour of humans and other animals can violate transitivity [2,3] and IIA [4,5], leading to the claim that such behaviour is irrational.

We use simple models to show that such behaviour can be optimal; i.e. can be favoured by natural selection. The models are based on the standard prey-choice paradigm in which a forager chooses items in such a way that its long-term rate of energy gain is maximized [6]. Our aim is not to claim that real animals always behave so as to maximize their long-term rate of energy gain; this is simply a plausible and straightforward currency that enables us to expose the logic of behaviour which might appear to be irrational.

2. Modelling approach

We consider a foraging individual that is attempting to maximize its average rate of energy gain over an extended foraging period. Three sources of food are labelled options A, B and C. In contrast to standard models of prey choice (e.g. [6]) options are not always available. Sometimes there may be no options available to the individual, sometimes one option, sometimes two to choose between and sometimes all three options are available. If option X is chosen at a given time, the individual obtains a food item of energy content eX that takes time hX to handle. While it is handling a food item the individual is not able to forage; it is only when it has finished handling that it again chooses from the available options. This means that there is an opportunity cost associated with choosing an item, since a better option may appear while it is handling the current item. Consequently, the decision about whether to accept an option needs to take into account the alternative options currently available, the alternatives that may be missed as a result of handling, and what is likely to be available after handling the current option that is chosen.

Without loss of generality, we rank the items according to their profitability, e/h: eA/hA < eB/hB < eC/hC.

When the individual is not handling it can choose an available item (if there are any) or choose to do nothing for a unit of time.

Suppose all options are always present. When a choice is made between simultaneously available items, the behaviour that maximizes the rate of energetic gain depends on the time between decisions [7]. In our model, the next choice is made at the end of the current handling time. Under such conditions (i.e. with no delay between handling one item and starting the next), rate of gain is maximized by repeatedly choosing the option with the highest profitability; i.e. C is chosen, the individual handles the resultant item for time hC, chooses C again, and so on. Similarly, if the set of available options is the same fixed subset of those that are possible, it is always optimal to take the option with the greatest profitability. Thus there is a strict preference order A < B < C. Consequently, there are no IIA or transitivity violations.

However, in the real world possibilities change over time. It may be worth taking an option before it becomes unavailable or ignoring a profitable option in the expectation that a better one will soon appear. To allow for such possibilities we suppose that an option that is present at a given time has a probability of disappearing in each unit of time. Similarly, if an option is not present it may reappear and become available in the next unit of time. We assume that when the individual has chosen not to take an item (or been forced by lack of availability), a unit of time elapses before options are assessed again. Each of the three options A, B, C is assumed to appear and disappear independently the others, and independently of choices made (i.e. there is no depletion effect from taking an option).

When the options that are available can change, both the profitability of an item and the absolute amount of energy that the item yields are relevant to choice, as the two following examples illustrate.

3. Disappearance of options

Consider the options as shown in table 1.

View this table:
Table 1.

Baseline parameters (disappearance case).

A strategy specifies which option to choose for each possible combination of available options; i.e. it is a contingent rule for choice. The long term mean rate of energy gain to an individual depends on the strategy that the individual follows. For the options shown in table 1, computations of the optimal strategy (see the electronic supplementary material) reveal the preferences given in table 2. The table also gives the intuitive reasons why these preferences occur.

View this table:
Table 2.

Preferences between options from table 1.

When only two options are available at a time, it can be seen that B is preferred to A, C is preferred to B and A is preferred to C. Thus transitivity is violated.

It can be seen that A is preferred to C when B is not available but C is preferred when B is also available. Thus, the introduction of a non-preferred option (B) changes the preference between A and C; i.e. the principle of IIA is violated.

By varying the energy content and handling time of option B, we obtain figure 1.

Figure 1.

Energy and handling time parameters associated with option B that produce violations of transitivity and IIA. Above the region of transitivity violation, option B is always preferred. Below the region of IIA violation, B is never preferred. Options A and C are shown as black squares on the graph.

The violations in figure 1 are driven by the different probabilities of options disappearing.

4. Reappearance of options

Table 3 shows parameter values for the options which result in the optimal preferences that are shown in table 4. Again, this shows violations of transitivity and IIA, but rather than being driven by the probable disappearance of currently available options (as above), the effect is largely driven by the probable reappearance of option B (when it is not currently available).

View this table:
Table 3.

Baseline parameters (reappearance case).

View this table:
Table 4.

Preferences between options from table 3.

Figure 2 shows that violations of IIA and transitivity occur over a range of parameter values. If the probability of option B reappearing is low, option A is preferred to option C since it gives the greater amount of energy and the system is transitive. If option B is almost certain to reappear, it is preferred to option A since it has a greater rate of gain, again resulting in a transitive system.

Figure 2.

Regions of violation with respect to probability of reappearance and energy content of option B, holding all other values at those of table 3.

5. Discussion

Individuals sometimes make decisions which conform with transitivity and IIA [8] but violations of these principles are reported in many situations [4,912]. We have shown that it can be adaptive for these principles to be broken.

The long-term mean rate of energy gain to an individual depends on the strategy that the individual follows. Different strategies can be ranked in order of their mean rate of energy gain. Thus, in comparing different strategies, there cannot be any violations of transitivity or IIA. We have based our analysis on the choices made under the optimal strategy, i.e. the strategy maximizing mean gain rate. Although there is a strict ordering of strategies, it does not mean that there is a strict ordering of preference for food options under the optimal strategy; transitivity and IIA can fail to hold when an individual follows this strategy.

Our results are stronger than any previously obtained in this area. Houston [13] shows that stochastic transitivity can be violated when an individual maximizes the rate subject to errors and costly errors are rare. We obtain violations when choice is deterministic and not subject to errors. Houston et al. [14] assume that an individual makes optimal choices between options on the basis of its level of energy reserves. They show that transitivity can be violated at particular (intermediate) levels of reserves. Trimmer [15] uses the same approach to show that IIA can be violated. In both cases decisions are context-dependent because the available options influence the individual's future expectations. The effect of the future arises because future levels of energy reserves might require the non-selected option to be chosen. We have shown in the simple context of rate maximization that violations of transitivity and IIA do not require errors or decisions that depend on energy reserves.

IIA is very similar (or equivalent) to principles which go by other names: the principle of regularity, Sen's property, α, the constant-ratio-rule and Luce's choice axiom. For a summary of various definitions and their logical differences, see Ray [1]. Behaviour which does not conform to IIA also goes by more than one name, such as the violation of regularity, and the asymmetrically dominated decoy effect. Others (e.g. Tsetsos et al. [16]) have attempted to explain such violations by referring to mechanistic models of computational processes—i.e. they assume that such behaviour comes about through constraints. By dealing with cases where individuals always make the same choice given the same range of options, we have demonstrated that all forms of IIA can be violated by an optimal decision-maker.

A formal logician might argue that neither transitivity nor IIA has necessarily been violated, because the choice sets can alter over time. However, it would not be possible to test for transitivity or IIA if the choice set could not alter, so from an operational perspective (i.e. measuring the preferences of animals by giving them options), it is clear that the principles of transitivity and IIA should sometimes be violated if fitness is to be maximized.

Rationality is a fundamental principle of decision-making in economics, psychology and philosophy. Economic rationality is based on the maximization of utility [17]. The existence of a utility function is guaranteed given certain consistency conditions on decisions. The consequence is that utility is inferred from the data. In evolutionary biology, there is always a utility measure known as reproductive value. The reproductive value of an individual is defined as its expected future lifetime reproductive success; this typically depends on the state of the individual [18]. In our model, the state of the animal is the set of options which are currently available. Each possible action has an associated reproductive value. Natural selection favours individuals that behave as if they are maximizing reproductive value [18]. This means that in contrast to the descriptive utility function in economics, reproductive value is a normative measure of utility, i.e. it specifies the decision that should be made [17,19]. As we show, even individuals that behave optimally can violate IIA and transitivity. This can occur because options do not have absolute reproductive values; the value of an option depends on the future [13]. The key point is that reproductive value is a future expectation and currently available options provide information about the future and hence affect reproductive value. In other words reproductive value cannot be assigned to options individually; the value of choosing an option depends on the other options that are present. The options presented to humans and other animals often give information about an individual's future prospects. Thus, the insight gained from our particular model applies to a whole range of choice phenomena, with widespread implications for how human choices—and the behaviour of other animals—should be analysed.

Funding statement

This work was supported by the European Research Council (Evomech Advanced grant no. 250209 to A.I.H.).

Acknowledgements

Thanks to Sean Collins for mathematical help with the appendix (available as the supplementary material).

  • Received November 6, 2013.
  • Accepted December 19, 2013.

References

View Abstract