A counterfactual conditional, subjunctive conditional, or remote conditional, abbreviated cf, is a conditional (or "if-then") statement indicating what would be the case if its antecedent were true (although it is not true). This is to be contrasted with an indicative conditional, which indicates what is (in fact) the case if its antecedent is (in fact) true (which it may or may not be).
Contents |
The difference between indicative and counterfactual conditionals, in a context of past time reference, can be illustrated with a pair of examples in which the if clause is in the past indicative in the first example but in the pluperfect subjunctive in the second:
The protasis (the if clause) of the first sentence may or may not be true according to the speaker, so the apodosis (the then clause) also may or may not be true; the apodosis is said by the speaker to be true if the protasis is true. In this sentence the if clause and the then clause are both in the past tense of the indicative mood. In the second sentence, the speaker is speaking with a certainty that Oswald did shoot Kennedy (according to the speaker, the protasis is false), and therefore the main clause deals with the counterfactual result — what would have happened. In this sentence the if clause is in the pluperfect subjunctive form of the subjunctive mood, and the then clause is in the conditional perfect form of the conditional mood.
A corresponding pair of examples with present time reference uses the present indicative in the if clause of the first sentence but the past subjunctive in the second sentence's if clause:
Here again, in the first sentence the if clause may or may not be true; the then clause may or may not be true but certainly (according to the speaker) is true conditional on the if clause being true. Here both the if clause and the then clause are in the present indicative. In the second sentence, the if clause is not true, while the then clause may or may not be true but certainly would be true in the counterfactual circumstance of the if clause being true. In this sentence the if clause is in the past subjunctive form of the subjunctive mood, and the then clause is in the conditional mood.
People engage in counterfactual thinking frequently. Experimental evidence indicates that people's thoughts about counterfactual conditionals differ in important ways from their thoughts about indicative conditionals.
Participants in experiments were asked to read sentences, including counterfactual conditionals, e.g., 'if Mark had left home early he would have caught the train'. Afterwards they were asked to identify which sentences they had been shown. They often mistakenly believed they had been shown sentences corresponding to the presupposed facts, e.g., 'Mark did not leave home early' and 'Mark did not catch the train' (Fillenbaum, 1974). In other experiments, participants were asked to read short stories that contained counterfactual conditionals, e.g., 'if there had been roses in the flower shop then there would have been lilies'. Later in the story they read sentences corresponding to the presupposed facts, e.g., 'there were no roses and there were no lilies'. The counterfactual conditional 'primed' them to read the sentence corresponding to the presupposed facts very rapidly; no such priming effect occurred for indicative conditionals (Santamaria, Espino, and Byrne, 2005). They spend different amounts of time 'updating' a story that contains a counterfactual conditional compared to one that contains factual information (De Vega, Urrutia, and Riffo, 2007) and they focus on different parts of counterfactual conditionals (Ferguson and Sanford, 2008).
Experiments have compared the inferences people make from counterfactual conditionals and indicative conditionals. Given a counterfactual conditional, e.g., 'If there had been a circle on the blackboard then there would have been a triangle', and the subsequent information 'in fact there was no triangle', participants make the modus tollens inference 'there was no circle' more often than they do from an indicative conditional (Byrne and Tasso, 1999). Given the counterfactual conditional and the subsequent information 'in fact there was a circle', participants make the modus ponens inference as often as they do from an indicative conditional.
Ruth M.J. Byrne proposed that people construct mental representations that encompass two possibilities when they understand, and reason from, a counterfactual conditional, e.g., 'if Oswald had not shot Kennedy, then someone else would have'. They envisage the conjecture 'Oswald did not shoot Kennedy and someone else did' and they also think about the presupposed facts 'Oswald did shoot Kennedy and someone else did not' (Byrne, 2005). According to the mental model theory of reasoning, they construct mental models of the alternative possibilities (Johnson-Laird and Byrne, 1991).
In order to distinguish counterfactual conditionals from material conditionals, a new logical connective '>' is defined, where A > B can be interpreted as "If it were the case that A, then it would be the case that B."
The truth value of a material conditional, A → B, is determined by the truth values of A and B. This is not so for the counterfactual conditional A > B, for there are different situations agreeing on the truth values of A and B but which yield different evaluations of A > B. For example, if Keith is in Germany, the following two conditionals have both a false antecedent and a false consequent:
Indeed, if Keith is in Germany, then all three conditions "Keith is in Mexico", "Keith is in Africa", and "Keith is in North America" are false. However, (1) is obviously false, while (2) is true as Mexico is part of North America.
Philosophers such as David Lewis and Robert Stalnaker modeled counterfactuals using the possible world semantics of modal logic. The semantics of a conditional A > B are given by some function on the relative closeness of worlds where A is true and B is true, on the one hand, and worlds where A is true but B is not, on the other.
On Lewis's account, A > C is (a) vacuously true if and only if there are no worlds where A is true (for example, if A is logically or metaphysically impossible); (b) non-vacuously true if and only if, among the worlds where A is true, some worlds where C is true are closer to the actual world than any world where C is not true; or (c) false otherwise. Although in Lewis's Counterfacutals it was unclear what he meant by 'closeness', in later writings, Lewis made it clear that he did not intend the metric of 'closeness' to be simply our ordinary notion of overall similarity.
Consider an example:
On Lewis's account, the truth of this statement consists in the fact that, among possible worlds where I ate more for breakfast, there is at least one world where I am not hungry at 11am and which is closer to our world than any world where I ate more for breakfast but am still hungry at 11am.
Stalnaker's account differs from Lewis's most notably in his acceptance of the Limit and Uniqueness Assumptions. The Uniqueness Assumption is the thesis that, for any antecedent A, there is a unique possible world where A is true, while the Limit Assumption is the thesis that, for a given antecedent A, there is a unique set of worlds where A is true that are closest. (Notice that the Uniqueness Assumption entails the Limit Assumption, but the Limit Assumption does not entail the Uniqueness Assumption.) On Stalnaker's account, A > C is non-vacuously true if and only if, at the closest world where A is true, C is true. So, the above example is true just in case at the single, closest world where I eat more breakfast, I don't feel hungry at 11am. Although it is controversial, Lewis rejected the Limit Assumption (and therefore the Uniqueness Assumption) because it rules out the possibility that there might be worlds that get closer and closer to the actual world without limit. For example, there might be an infinite series of worlds, each with my coffee cup a smaller fraction of an inch to the left of its actual position, but none of which is uniquely the closest. (See Lewis 1973: 20.)
One consequence of Stalnaker's acceptance of the Uniqueness Assumption is that, if the law of excluded middle is true, then all instances of the formula (A > C) ∨ (A > ¬C) are true. The law of excluded middle is the thesis that for all propositions p, p ∨ ¬p is true. If the Uniqueness Assumption is true, then for every antecedent A, there is a uniquely closest world where A is true. If the law of excluded middle is true, any consequent C is either true or false at that world where A is true. So for every counterfactual A > C, either A > C or A > ¬C is true. This is called conditional excluded middle (CEM). Consider the following example:
On Stalnaker's analysis, there is a closest world where the coin mentioned in (1) and (2) is flipped and at that world either it lands heads or it lands tails. So either (1) is true and (2) is false or (1) is false and (2) true. On Lewis's analysis, however, both (1) and (2) are false, for the worlds where the coin lands heads are no more or less close than the worlds where they land tails. For Lewis, 'If the coin had been flipped, it would have landed heads or tails' is true, but this does not entail that 'If the coin had been flipped, it would have landed heads, or: If the coin had been flipped it would have landed tails.'
Counterfactual conditionals may also be evaluated using the so-called Ramsey test: A > B holds if and only if the addition of A to the current body of knowledge has B as a consequence. This condition relates counterfactual conditionals to belief revision, as the evaluation of A > B can be done by first revising the current knowledge with A and then checking whether B is true in what results. Revising is easy when A is consistent with the current beliefs, but can be hard otherwise. Every semantics for belief revision can be used for evaluating conditional statements. Conversely, every method for evaluating conditionals can be seen as a way for performing revision.
Ginsberg (1986) has proposed a semantics for conditionals which assumes that the current beliefs form a set of propositional formulae, considering the maximal sets of these formulae that are consistent with A, and adding A to each. The rationale is that each of these maximal sets represents a possible state of belief in which A is true that is as similar as possible to the original one. The conditional statement A > B therefore holds if and only if B is true in all such sets.
The counterfactual conditional is the basis of experimental methods for establishing causality in the natural and social sciences, e.g., whether taking antibiotics helps cure bacterial infection. For every individual, u, there is a function that specifies the state of u's infection under two hypothetical conditions: had u taken antibiotic and had u not taken antibiotic. Only one of these states can be observed, since the other one is literally "counter factual." The overall effect of antibiotic on infection is defined as the difference between these two states, averaged over the entire population. If the treatment and control groups are selected at random, the effect of antibiotic can be estimated by comparing the rates of recovery in the two groups.
The tight connection between causal and counterfactual relations has prompted Judea Pearl (2000) to reject both the possible world semantics and those of Ramsey and Ginsberg. The latter was rejected because causal information cannot be encoded as a set of beliefs, and the former because it is difficult to fine-tune Lewis's similarity measure to match causal intuition. Pearl defines counterfactuals directly in terms of a "structural equation model" -- a set of equations, in which each variable is assigned a value that is an explicit function of other variables in the system. Given such a model, the sentence "Y would be y had X been x" (formally, X = x > Y = y ) is defined as the assertion: If we replace the equation currently determining X with a constant X = x, and solve the set of equations for variable Y, the solution obtained will be Y = y. This definition has been shown to be compatible with the axioms of possible world semantics and forms the basis for causal inference in the natural and social sciences, since each structural equation in those domains corresponds to a familiar causal mechanism that can be meaningfully reasoned about by investigators.