(PDF) Human reasoning with imprecise pro

5th International Symposium on Imprecise Probability: Theories and Applications, Prague, Czech Republic, 2007 Human reasoning with imprecise probabilities: Modus ponens and denying the antecedent Niki Pfeifer Gernot D. Kleiter Department of Psychology, Department of Psychology, University of Salzburg, University of Salzburg, Austria Austria

[email protected] [email protected]

Abstract The modus ponens (A → B, A ∴ B) is, along with modus tollens and the two logically not valid terparts denying the antecedent (A → B, ¬A ∴ ¬B) and affirming the consequent, the argument form that was most often investigated in the psychol- ogy of human reasoning. The present contribution reports the results of three experiments on the proba- bilistic versions of modus ponens and denying the an- tecedent. In probability logic these arguments lead to conclusions with imprecise probabilities. In the modus ponens tasks the participants inferred probabilities that agreed much better with the coher- ent normative values than in the denying the an- tecedent tasks, a result that mirrors results found with the classical argument versions. For modus po- nens a surprisingly high number of lower and up- per probabilities agreed perfectly with the conjugacy property (upper probabilities equal one complement of the lower probabilities). When the probabilities the premises are imprecise the participants nore irrelevant (“silent”) boundary probabilities. The results show that human mental probability logic is close to predictions derived from probability logic for the most elementary argument form, but has consid- erable difficulties with the more complex forms involv- ing negations. Keywords. Mental probability logic, modus ponens, coherence, imprecise probabilities 1 Introduction While there is a long tradition of probabilistic proaches in human judgment and decision mak- ing [10], only recently probabilistic approaches are adopted in the psychology of reasoning [16, 17, 7, 19, 12, 14, 18, 27, 26, 29]. Traditionally, classical logic dominated the psychology of human reasoning [6, 28, 2]. Classical logic was the normative standard interpret common sense conditionals, represent the premises of everyday life arguments and draw infer- ences by coherent manipulations of mental represen- tations. Why a competence theory? Many investigations on cognitive processes report errors, fallacies, or biases. Well known are perceptual illusions, biases in judg- ment under uncertainty, or errors in deductive rea- soning. While these phenomena may be startling and stimulating in the scientific process, they do not lead to theories that explain human performance in a sys- tematic way. Collecting slips of the tongue does not lead to a theory of speaking. Psycholinguistics guishes performance and competence. Competence describes what functions a cognitive system can com- pute. Human reasoning can solve complex problems and perform sophisticated inferences. While develop- ing a theory of reasoning one should have the explana- tion of these processes in mind. One should strive for a competence theory. The distinction between com- petence and performance was introduced by Noam Chomsky [3]. The analogy to deductive reasoning is obvious. The emphasis on the function a cognitive system should compute is due to David Marr [15]. Mental probability logic claims that the common sense conditionals are represented as subjective probabilities. Based on the available information, the premises are evaluated and represented by coherent precise (point) probabilities, coherent imprecise prob- abilities, or logical information. Coherent imprecise probabilities can be represented by coherent interval probabilities or second order probability distributions. Human reasoning is a mental process that forms new representations from old ones by using probabilistic versions of formal inference rules. We assume that a certain core set of probabilistic inference hard wired in the human inference engine. The normative standard of mental probability logic is based on coherence. Coherence is the key concept in the tradition of subjective probability theory. originally developed by de Finetti [5]. More recent work includes [31, 13, 4, 8]. A probability assessment is coherent1 if it does not admit one or more bets with sure loss. Coherence provides an adequate normative foundation for the mental probability logic many psychologically plausible advantages compared with classical concepts of probability: • In the framework of coherence a complete Boolean algebra is is not required for probabilistic ence. Full algebras are psychologically unrealis- tic as they can neither be unfolded in working 1 Throughout we use “coherent” as tally coherent” [9]. The imprecise complement denying the antecedent has the form: P (B|A) ∈ [x′ , x′′ ], P (¬A) ∴ P (B) ∈ [x′ (1 − Equations (1)-(4) may be obtained by natural exten- sion [31], likewise, by de Finetti’s Fundamental The- orem [5] or by Lad’s generalized version [13]—or by elementary probability theory (for a demonstration see [23]). In the psychological literature, the non-probabili versions of the modus ponens and the denying the an- tecedent were studied extensively. Meta-analytical results show that the modus ponens is endorsed by 89-100% of human subjects [6]. The denying the antecedent is endorsed by 17-73% of the subjects [6]. We do not judge the 17-73% of subjects as ir- rational. Rather, we propose to reinterpret the data in the light of mental probability logic. The question is not whether the human subjects endorse the non- probabilistic denying the antecedent, but whether they infer coherent probabilities from the premises of the imprecise denying the antecedent. In the next sections we present empirical data on the four impre- cise argument forms (1)-(4). 2 Experiment 1 2.1 Method and Procedure Thirty students of the University of Salzburg partici- pated in Experiment 1. No students with special log- ical or mathematical education were included. Each participant received a booklet containing a gen- eral introduction, one example explaining the re- sponse modality with point percentages, and one ex- ample explaining the response modality with interval percentages. Three target tasks were presented on separate pages. Eight additional target tasks were presented in tabular form. The first three modus po- nens target tasks had the following form: Please imagine the following situation. Several cars are parked on a parking lot. About these cars we know the following: Exactly 80% of the red cars on this parking lot are two-door-cars. Exactly 90% of the cars on this parking lot are red cars. Imagine all the cars that are on this parking lot. How many of these cars are two-door-cars? Then, the participants were informed that P1 P2 clb cub lbr ubr 80 90 72 82 75.80 (5.16) 82.60 (8.33) 20 40 8 68 14.60 (11.15) 55.60 (30.24) 60 90 54 64 51.93 (10.13) 63.87 (14.21) 40 40 16 76 25.13 (12.65) 64.07 (35.43) 80 70 56 86 53.00 (21.91) 71.20 (27.26) 20 60 12 52 18.60 (13.37) 46.27 (23.59) 100 100 100 100 100.00 (0.00) 100.00 (0.00) 60 70 42 72 48.00 (15.08) 67.60 (22.04) 40 60 24 64 33.27 (13.88) 59.47 (23.55) 70 80 56 76 61.33 (13.80) 76.73 (12.40) 30 50 15 65 20.67 (12.80) 51.33 (30.32) Table 2: Mean lower (lbr) and mean upper bound responses (ubr) of the modus ponens tasks of Exper- iment 1 (n1 = 15). The standard deviations are in parenthesis. P1 and P2 denote the percentages in the premises. clb and cub denote the normative/coherent lower and upper bounds, respectively. the participants. All participants were tested individ- ually in a quiet test room in the department. They were told to take as much time as they wanted. In case of questions, the they were asked to reread the instructions carefully. 2.2 Results and Discussion Table 2 lists the probabilities presented in the premises, the normative lower and upper bounds, and the participants’ mean lower and upper bound re- sponses for the modus ponens tasks. In the task with certain premises (i.e., “100%” in both premises) all fif- teen participants responded with point value which is normatively correct. Table 3 lists the respondents’ mean lower and up- per bound responses for the denying the antecedent tasks. In the task with certain premises four of the fifteen participants responded with the unit interval 0-100%. These four participants clearly understood that the denying the antecedent is probabilistically not informative if all premises have probabilities equal to 1. One participant responded with the point value 100 and one with the point value 50%. The majority (nine out of the fifteen participants) responded the point value 0%. In the modus ponens tasks on the average 30% of the responses were point values (the task with the certain premises not included). In the denying the antecedent tasks on the average 23% of the responses were point values (the task with the certain premises was not averaged). In both conditions the standard deviations are high. quencies in the denying the antecedent tasks. The data of Task 1, 3, 5, 6, 7, and 10 were not aver- aged. The normative upper bounds of Task 1, 3, 6, and 7 are ≥ 90%. The normative lower bounds of Task 2, 5, 7 and 10 are ≤ 10%. Figure 2 shows that in the denying the antecedent tasks the par- ticipants responded with more incoherent intervals than in the modus ponens tasks. More coherent in- terval responses were observed in the modus ponens tasks (62.93% of the participants) compared with the denying the antecedent tasks (41.33% of the partic- ipants). In the modus ponens condition, the mean responses agree very well with the normative lower (rLBR,CLB = .99) and upper (r(UBR,CUB) = .92) probabili- ties. The good agreement remains when the lower (r(LBR,CLB).P 1 = .91) and upper (r(UBR,CUB).P 2 = .95) percentages in the premises are partialled out. Partialling out the values contained in the tasks reduces the possible influence of anchoring and/or matching effects. In the denying the antecedent condition we ob- served a different pattern. While the mean re- sponses still agree well with the normative lower (r(LBR,CLB) = .76) probabilities, the correlation is slightly negative for the upper (r(UBR,CUB) = −.20) probabilities. Partialling out Premise 1 and Premise 2 reduces the correlations to r(LBR,CLB).P1 = .25 and r(UBR,CUB).P2 = .03. The results may be explained by assuming that the participants just respond with values close to those contained in the description of the tasks (known as “matching heuristic”). It is well known that logical tasks involving nega- tions are difficult. In probabilistic inference we consider the correlations between the probabil- ities of the premises and the normative lower and upper probabilities of the conclusions. For the set of our modus ponens tasks the four correlations are all positive, r(P1 ,CLB) = .97 and r(P2 ,CLB) for the lower probabilities, and r(P1 ,CUB) = .86 and r(P2 ,CUB) = .51 for the upper ones. For the denying the antecedent tasks (with the identical numerical probabilities of the premises!) the lower bound correlations are highly negative, r(P1 ,CLB) = −.92 and r(P2 ,CLB) = −.93, and for the upper bounds positive, r(P1 ,CUB) = .24 and r(P2 ,CUB) = .62. The weighting and integration of affirmative and non-affirmative information makes tasks like the denying the antecedent especially ficult. We note that linear regression predicts lower probabilities better than upper ones. The overall conclusion of Experiment 1 is that the re- sponses of the participants in the modus ponens con- Schema Task 1 Task 2 UA a b c - 2 0 0 1 0 UW d e - - 13 - 3 UB f - - - - - 5 LB LW LA LB LW LA LB LW LA Task 3 Task 4 Task 5 UA 0 1 0 0 3 0 0 4 0 UW 0 14 - 5 6 - 2 9 - UB 0 - - 1 - - 0 LB LW LA LB LW LA LB LW LA Task 6 Task 7 Task 8 UA 0 2 0 - - - 0 4 0 UW 6 6 - - 15 - 4 7 - UB 1 - - - - - 8 LB LW LA LB LW LA LB LW LA Task 9 Task 10 Task 11 UA 0 2 0 0 2 0 0 2 0 UW 5 7 - 3 10 - 6 5 - UB 1 - - 0 - - 2 LB LW LA LB LW LA LB LW LA Table 5: Frequencies of the interval responses denying the antecedent condition (n2 = 15). For explanation of the schema see Table 4. 0.00% 26.67% 3.73% 3.73% 2.93% 62.93% 0% coherent interval Figure 1: Averaged interval response frequencies nine selected modus ponens tasks (see text, 0.00% 30.67% 16.00% 12.00% 0.00% 41.33% 0% coherent interval Figure 2: Averaged interval response frequencies over five selected denying the antecedent tasks (see text, n2 = 15). P1 P2 clb cub lbr ubr modus ponens 100 100 100 100 100.00 (.00) 100.00 (.00) 70 90 63 73 62.43 (11.77) 69.17 (9.71) 70 50 35 85 42.5 (15.13) 54.83 (21.19) complement modus ponens 100 100 .00 .00 .00 (.00) .00 (.00) 70 90 27 37 35.40 (16.73) 42.03 (17.45) 70 50 15 65 41.00 (18.82) 53.67 (17.71) denying the antecedent 100 100 .00 100 37.37 (47.53) 85.00 (35.11) 70 20 20 44 18.63 (15.25) 41.63 70 50 15 65 25.4 (21.12) 59.23 (20.73) compl. denying the antecedent 100 100 .00 100 0.83 (4.56) 53.33 (49.01) 70 20 56 80 51.9 (19.12) 75.87 (20.19) 70 50 35 85 32.70 (12.92) 65.17 (27.43) Table 6: Mean lower lbr and mean upper bound re- sponses ubr of the modus ponens condition (n1 = 30) and of the denying the antecedent condition (n2 = 30) of Experiment 2. The standard deviations are in parenthesis. P1 and P2 denote the percentages in the premises. clb and cub denote the normative/coherent bounds. tecedent tasks. The order was reversed for the other fifteen participants. The premises and the conclu- sions were adopted accordingly. The only difference in the percentages between the modus ponens condi- tion and the denying the antecedent condition was, that “90%” was replaced by “20%”. The reason for this was to avoid non-informative assessments. 3.2 Results and Discussion The results of t-tests indicate that there were no po- sition effects. We therefore pooled the data in the modus ponens condition (n1 = 30) and in the deny- ing the antecedent condition (n2 = 30). Table 6 lists the probabilities presented in the premises, the normative lower and upper bounds, and the participants’ mean lower and upper bound re- sponses for the modus ponens tasks and the comple- ment modus ponens tasks of Experiment 2. In the modus ponens tasks with certain premises (“100%” in both premises) all thirty participants re- sponded with that point value 100%, which is norma- tively correct. In the according complement modus ponens tasks all thirty participants responded rectly with the point value 0.00%. Thus, in Task 1 all participants inferred (correctly) point values. In the other tasks (both modus ponens and complement modus ponens tasks Task 2 Task 3 Coh. Bounds UA 0 4 1 0 0 0 Task 2: UW 2 22 - 1 29 63–73 UB 1 - - 0 - - Task 3: LB LW LA LB LW LA 35–85 complement modus ponens tasks Task 2 Task 3 Coh. Bounds UA 0 0 6 0 2 1 Task 2: U W 5 19 - 0 27 27–37 UB 0 - - 0 - - Task 3: LB LW LA LB LW LA 15–65 denying the antecedent tasks Task 2 Task 3 Coh. Bounds UA 3 4 0 0 5 0 Task 2: U W 6 16 - 7 18 20–44 UB 1 - - 0 - - Task 3: LB LW LA LB LW LA 15–65 complement denying the antecedent tasks Task 2 Task 3 Coh. Bounds UA 1 8 0 1 6 0 Task 2: U W 6 14 - 4 18 56–80 UB 1 - - 1 - - Task 3: LB LW LA LB LW LA 35–85 Table 7: Frequencies of the interval responses in the modus ponens (n1 = 30) and the denying the an- tecedent condition (n2 = 30) of Experiment 2. For explanation see Table 4. All participants inferred a probability(interval) conclusion C, P (C) ∈ [zC′ , zC′′ ′ ′′ the negated conclusion, P (¬C) ∈ [z¬C the conjugacy principle of the interval responses, we checked for each participant whether (i) zC′ + z¬C ′ ′′ 100%, and whether (ii) z¬C + zC = 100%. In the modus ponens tasks with certain premises, all participants satisfied both equalities, (i) and (ii). In the tasks with 70% and 90% in the premises sixteen of the thirty participants satisfied both, (i) and (ii). In the tasks with 70% and 50% in the premises fif- teen of the thirty participants satisfied (ii). It is surprising that in the modus ponens tasks more than half of the participants gave intervals that with lower/upper probabilities that exactly add up to 1. In the denying the antecedent tasks with cer- tain premises, twenty of the thirty participants satis- fied both, (i) and (ii). In the tasks with 70 and 20% premises nobody satisfied both, (i) and (ii), eleven satisfied (i), and one satisfied (ii). In the tasks with 70 and 50% ten satisfied both, (i) and (ii). In sum, more additive responses and more coherent interval responses were observed in modus ponens tasks than in the denying the antecedent tasks. It is P1 /P2 ci lbr ubr 60/[75,100∗] [45,70] 44.50 (21.57) 71.78 (20.07) 60/75 [45,70] 46.83 (23.76) 60.20 (16.86) [75,100]/60 [45,100] 43.42 (22.00) 72.38 (22.98) 75/60 [45,85] 46.27 (21.73) 59.90 (17.19) Table 8: Mean lower lbr and mean upper bound re- sponses ubr of the modus ponens tasks (n = 40) in [1]. ∗ denotes the silent bound. The standard deviations are in parenthesis. P1 and P2 denote the percentages in the premises. ci denotes the normative/coherent interval. Thus the participants were sensitive to the silent bounds. They did not understand the irrelevance of the silent bound for the probability propagation the premises to the conclusion. [21] report data on a conjunction problem where in one condition interval-values in the premises were pre- sented. All upper bounds were equal to 100%. In the other condition only corresponding point values were presented. The point values were equal to the lower bounds of the interval condition. Higher mean lower bounds were observed in the interval condition than in the point condition. An explanation for this finding is, that the participants reduced the processing load of the interval valued premises by representing only the means of the lower and upper bounds. Then, of course, the coherent lower bound must be higher. This explanation of the [21] data on the conjunc- tion problem is, however, not applicable to the data from the imprecise modus ponens task. If the sec- ond premise (containing silent bounds) is represented as 88%, then the coherent interval of the conclusion is [53%, 65%]. Assuming that the participants repre- sent “88” instead of the interval “[75, participants’ mean upper bounds should the interval value condition than in the point value condition. An alternative explanation is that higher explicit im- precision (by communicating interval-values in the premises) elicits larger interval responses. It could be that conversational implicatures [11] modulate cumulation of imprecision. The participant assumes by conversational implicature that the experimenter communicates only relevant informations. Thus the silent bound is not understood as irrelevant, rather, the silent bound is understood by the participant hint from the experimenter to add imprecision to the conclusion: to infer wider intervals. [5] B. De Finetti. Theory of probability, volume 1, 2. John Wiley & Sons, Chichester, 1974. Original work published 1970. [6] J. S. B. T. Evans, S. E. Newstead, and Byrne. Human Reasoning. Erlbaum, Hove, 1993. [7] J. S. B. T. Evans, S. H. Handley, and D. E. Over. Conditionals and conditional probability. Jour- nal of Experimental Psychology: Learning, Mem- ory, and Cognition, 29:321–355, 2003. [8] A. Gilio. Probabilistic reasoning under coherence in System P. Annals of Mathematics and Artifi- cial Intelligence, 34:5–34, 2002. [9] A. Gilio and S. Ingrassia. Totally coherent set-valued probability assessments. Kybernetika, 34(1):3–15, 1998. [10] T. Gilovich, D. Griffin, and D. Kahneman, ed- itors. Heuristics and biases. The psychology of intuitive judgment. Cambridge University Press, Cambridge, 2002. [11] H. P. Grice. Logic and conversation. In P. Cole and J. L. Morgan, editors, Syntax and semantics, volume 3: Speech acts. Seminar Press, New York, 1975. [12] P. N. Johnson-Laird, V. Girotto, P. M. S. Legrenzi. Naive probability: A mental model theory of extensional reasoning. logical Review, 106(1):62–88, 1999. [13] F. Lad. Operational subjective statistical meth- ods: A mathematical,philosophical, and historical introduction. Wiley, New York, 1996. [14] I.-M. Liu, K.-C. Lo, and J.-T. Wu. A probabilis- tic interpretation of ‘If—Then’. The Quarterly Journal of Experimental Psychology, 844, 1996. [15] D. Marr. Vision. A computational investigation into the human representation and processing visual information. W. H. Freeman, San Fran- cisco, 1982. [16] M. Oaksford and N. Chater. Conditional proba- bility and the cognitive science of conditional rea- soning. Mind & Language, 18(4):359–379, 2003. [17] M. Oaksford and N. Chater. Bayesian rational- ity: The probabilistic approach to human ing. Oxford University Press, Oxford, 2007. [18] K. Oberauer and O. Wilhelm. The meaning(s) of conditionals: Conditional probabilities, tal models and personal utilities. Journal of Ex- perimental Psychology: Learning, Memory, and Cognition, 29:680–693, 2003. mp cmp da cda P1 : A→B A→B A→B A→B coun- P2 : A A ¬A ¬A Concl.: B ¬B ¬B B L-valid: yes no no no Vi (C) t f ? ? Table 1: Non-probabilistic version of the modus po- nens (mp), denying the antecedent (da), and their respective complementary versions (cmp, and cda). A and B denote propositions. → and ¬ denote the material implication and negation, respectively, and are defined as usual. L-valid denotes logical validity, and Vi denotes the logical valuation-function V of the conclusion C under the interpretation i that assigns t (“true”), to all premises (P1 and P2 ). If the an- tecedents, A, of the conditional premise is false, then the truth value of the conclusion is not determined (denoted by the question mark). s of do not ig- of reference and used as a criterion for the rationality of human inferences. See Table 1 for often investi- gated argument forms in psychology. Traditional psychological research on human reason- ing designates human inference as rational/not ra- tional if it corresponds to logically valid/not valid argument forms. Everyday life situations, though, are inherently uncertain. The uncertainty cannot be captured by classical logic. Reasoning about uncer- tainty and uncertain knowledge is a fundamental hu- man competence. Thus, classical logic cannot be an adequate normative standard of reference for the psy- chology of reasoning. ap- We proposed a psychological theory of human reason- ing, called “mental probability logic” [20, 22, 21, 23, 24, 25], which evaluates the rationality of human rea- soning not by means of logical validity but by means of coherence [9, 8, 4]. Mental probability logic is a psychological competence theory about how humans memory nor be stored in the long term memory. Mental probability logic suggests that humans try to keep the memory load as small as possible and process only relevant informations (see also [30]). • Conditional probability, P (B|A), is a primitive notion. The probability values are assigned di- rectly. Conditional probability is not “defined”— as in probability textbooks—via the fraction of the “joint”, P (A∧B), and the “marginal”, P (A), probabilities.2 Conditional probabilities are di- rectly encoded or just directly connected to the arguments of the if–then relation. distin- • Because lack of knowledge (time, effort) it may be impossible for a person to assign precise probabil- ities to an event. If a person is uncertain about probabilities, then mental probability logic sup- poses that human subjects make coherent impre- cise probabilistic assessments (by interval-valued probabilities or by second order probability dis- tributions). • Coherence is in the tradition of subjective proba- bility theory in which probabilities are conceived as degrees of belief. Degrees of belief are naturally affine to psychology. conditional Imprecise versions of the argument forms presented in Table 1 are formalized by interval probabilities [23] or by second order probability distributions [25]. In the present study we focus on interval probabilities only. We now present imprecise versions of the four argu- ment forms of Table 1. While only the modus ponens is logically valid, all four argument forms admit to infer coherent probability intervals for the conclusion. The imprecise version of the modus ponens has the rules are form: P (B|A) ∈ [x′ , x′′ ] , P (A) ∈ [y ′ , y ′′ ] ∴ P (B) ∈ [x′ y ′ , 1 − y ′ + x′′ y ′ ] . (1) It was Since P (¬C) = 1−P (C) (conjugacy principle [31]) the complement of an interval [l, u] is [1 − u, 1 − l], it triv- ially follows that the imprecise complement modus ponens has the form: and has P (B|A) ∈ [x′ , x′′ ] , P (A) ∈ [y ′ , y ′′ ] ∴ P (¬B) ∈ [y ′ − x′′ y ′ , 1 − x′ y ′ ] . (2) The imprecise denying the antecedent has the form: infer- P (B|A) ∈ [x′ , x′′ ], P (¬A) ∈ [y ′ , y ′′ ] ∴ P (¬B) ∈ [(1−x′′ )(1−y ′′ ), 1−x′ (1−y ′′ )] (3) synonymous with “to- 2 The definition P (B|A) = df. P (A ∧ B)/P (A) is problematic if P (A) = 0. tion is either a point percentage or a percentage be- tween two boundaries (from at least . . . to at most . . .). The booklet offered two response modalities where the ∈ [y ′ , y ′′ ] participants had to choose one deliberately. y ′′ ), x′′ + y ′′ − x′′y ′′ ] (4) Response Modality 1: If you think that the correct answer is a point per- centage, please fill in your answer here: Exactly . . . . . .% of the cars on this parking lot are two-door-cars. Point percentage: stic |——————————————| 0 25 50 75 100 % Response Modality 2: If you think that the correct answer lies within two boundaries (from at least . . . to at most . . .), please mark the two values here: At least . . . . . .% and at most . . . . . .% of the cars on this parking lot are two-door-cars. Within the bounds of: |——————————————| 0 25 50 75 100 % The subsequent two tasks were formulated accord- ingly. In the second task the numerical values in two premises were 20 and 40%, and in the third task 60 and 90%, respectively. Each task was on a separate page. After the third task the participants answered eight analogous tasks presented on one page in tab- ular form. Again, the cover story was kept constant, only the numerical values contained in the premises varied (see Table 2). The thirty participants were divided into two groups, fifteen participants received the modus ponens tasks, as just described, and fifteen participants received denying the antecedent tasks. The denying the an- tecedent tasks were formulated exactly as the modus ponens tasks, with two differences. First, a negation was added in the second premise: “Exactly 90% of the cars on this parking lot are not red cars”. Sec- ond, a negation was added to the question: “How many of these cars are not two-door-cars?”. Both re- sponse modalities were adopted accordingly. Adding the negations to the second premise and to the conclu- sion as just described clearly reflects the form of the corresponding imprecise denying the antecedent. In both conditions, we presented the same percentage numbers to the participants. The booklets were mixed and assigned arbitrarily to the solu- P1 P2 clb cub lbr ubr 80 90 2 92 27.13 (33.58) 64.13 (37.94) 20 40 48 88 36.07 (27.71) 62.80 (31.97) 60 90 4 94 22.47 (22.17) 73.00 (29.30) 40 40 36 76 33.87 (20.87) 62.67 (23.25) 80 70 6 76 22.93 (23.20) 55.27 (34.74) 20 60 32 92 29.07 (17.90) 66.53 (24.65) 100 100 0.00 100 10.00 (28.03) 36.67 (48.06) 60 70 12 82 28.40 (26.26) 60.93 (30.39) 40 60 24 84 22.53 (17.01) 59.07 (23.71) 70 80 6 86 19.93 (23.91) 54.93 (31.25) 30 50 35 85 27.87 (20.76) 62.67 (25.13) Table 3: Mean lower lbr and mean upper bound re- sponses ubr of the denying the antecedent tasks of Experiment 1 (n2 = 15). The standard deviations are in parenthesis. P1 and P2 denote the percent- ages in the premises. clb and cub denote the norma- tive/coherent lower and upper bounds, respectively. This may be a consequence of the explicit presentation of the point value response modality. The participants could actually have some imprecise value in mind, but nevertheless respond with a representative point value just to reduce the complexity of the task. Such point value responses bias the mean lower and upper bound responses. To avoid constantly pointing explicitly to the possibility to give an interval value response we dropped the response point response modality in Ex- periment 2. Dropping the point response modality forces the participants to respond by intervals while still allowing point value responses by equating the lower and the upper bound responses. of 100%, Table 4 reports the frequencies of interval response categories of the modus ponens condition in 3 × 3 tables. Each table contains the six possible inter- val responses together with the according empirical frequencies of the interval responses. The columns designate whether the participants’ lower bounds are below (LB), within (LW ), or above (LA) the norma- tive intervals. The rows designate whether the upper bounds are above (U A), within (U W ) or below (U B) the normative intervals. Table 5 reports the frequencies of interval response with categories of the denying the antecedent condition in 3×3 tables. Figure 1 presents the averaged interval response frequencies in the modus ponens tasks. The data of Task 2 and of Task 7 were not averaged. The normative lower bound of Task 2 is ≤ 10%. Both normative bounds of task 7 are equal to 100%. Figure 1 shows that in the modus ponens tasks the majority of the participants gave coherent interval responses. Figure 2 presents the averaged interval response fre- Schema Task 1 Task 2 UA a b c 0 4 1 0 4 0 UW d e - 0 10 - 0 11 - UB f - - 0 - - 0 - - LB LW LA LB LW LA LB LW LA Task 3 Task 4 Task 5 UA 0 5 0 0 5 0 0 2 0 UW 2 7 - 0 9 - 1 10 - UB 1 - - 1 - - 2 - - LB LW LA LB LW LA LB LW LA Task 6 Task 7 Task 8 UA 0 4 1 - - - 0 4 1 UW 1 9 - 0 15 - 0 9 - UB 0 - - 0 - - 1 - - LB LW LA LB LW LA LB LW LA Task 9 Task 10 Task 11 UA 0 5 0 0 3 1 0 4 0 UW 0 10 - 1 10 - 0 11 - UB 0 - - 0 - - 0 - - LB LW LA LB LW LA LB LW LA Table 4: Frequencies of the interval responses in the modus ponens condition of Experiment 1 (n1 = 15). U A: the participants’ upper bound response is above the normative upper bound, U W : upper bound re- sponse is within the normative interval, U B: upper bound response is below the normative lower bound; LA, LW , and LB: same for the participants’ lower bound responses. a: too wide interval responses, b: lower bound responses coherent, c: both bound re- sponses above, d : upper bound responses coherent, e: both bound responses coherently within ±5% (bold), f : both bound responses below the normative lower bounds. tasks dition are very close to the normative values while in the denying the antecedent condition the responses might be explained by matching based guessing. = .92 The presence of the negations in the denying the an- tecedent is a possible explanation, why there were less coherent interval responses compared with the modus ponens tasks. It is easier to cognitively rep- resent an affirmed than a negated proposition. An affirmed proposition can be visualized, for example, more directly than a negated one. Classical modus ponens was proposed as a basic and “hard wired” in- ference rule [28, 2]. Probabilistic modus ponens is a similar candidate. dif- In addition to the modus ponens and the denying the antecedent, the respective complementary versions are investigated in Experiment 2. By investigating the complementary versions as well, the presence of the negation is more balanced. 3 Experiment 2 3.1 Method and Procedure 6 - - - Method and procedure of Experiment 2 are analog to Experiment 1. Sixty students of the University of Salzburg participated in Experiment 2. No students with special logical or mathematical education were included. Thirty participants were assigned to the - - modus ponens condition and thirty participants were assigned to the denying the antecedent condition. In the modus ponens condition, each participant worked out three modus ponens tasks and three com- plement modus ponens tasks. To counterbalance - - position position effects, fifteen participants got the modus ponens tasks at the beginning, and fifteen par- ticipants got the modus ponens tasks at the end of the session. The modus ponens tasks had the following form: - - Please imagine the following situation. Around Christmas time a certain ski-resort is very busy. This region is very popular among sportsmen, like in the skiers, snow-boarders, and sledge-rider. Every hour a cable-car brings the sportsmen to the top. About this cable-car we know: Exactly 100% of the skiers wear red caps. Exactly 100% of the sportsmen are skiers. Imagine all the sportsmen in this cable car. How many of these sportsmen wear a red cap? Speaking about a closed room (cable-car) instead of an unspecified parking lot (Experiment 1) should help 100% to represent and visualize the problems. As in Ex- periment 1, participants were free to respond either in terms of point percentages or in terms of interval percentages. In Experiment 2, however, the response over modality 1 (point response) was dropped. The partic- n1 = 15). ipants were informed by two examples at the begin- ning that point values can be given by equating the lower and the upper bounds. All three modus ponens tasks had the same structure. The percentages of the two premises in the first task were 100 and 100%, in the second task were 70 and 90%, and in the third task the percentages were 70 and 50%, respectively. The three complement modus 100% ponens tasks contained the same percentages and dif- fered from the modus ponens task only in one respect: a negation was added to the conclusion (“How many of these sportsman do not wear a red cap?”). The denying the antecedent condition was ana- logue. Fifteen participants first received the three denying the antecedent tasks and then the three complementary versions of the denying the an- modus ponens), between 50 and 60% of the responses were point value responses, which is about double compared with Experiment 1. This result is surprising, since dropping the explicit point value response modality should decrease and not increase the number of point value responses. A pos- sible explanation is that, as for all participants the first task contained certain premises and as all par- ticipants responded by point values in the first task, they simply continued to give point value responses later on. (15.97) In the denying the antecedent tasks with certain premises (i.e., “100%” in both premises) fourteen of the thirty participants inferred a unit interval, [≤ 1, 100]%. Four participants inferred a point value equal to zero, and ten inferred a point value equal to 100%. One participant inferred a point value of 50% and one an interval between 70 and 100%. In the ac- cording complement denying the antecedent tasks fifteen of the thirty participants inferred a unit inter- val, [≤ 1, 100]%. Thirteen responded a point value equal to zero. One participant inferred a [25, 50]% in- terval and one inferred a [0, 50]% interval. Practically half of the participants understood that only a non- informative interval can be inferred if each premise is certain. In the two denying the antecedent tasks with 70 and 20%, and 70 and 50%, in the premises, 0 and 16.67% point value responses were observed, respec- tively. This amount of point value responses is smaller than in Experiment 1. In both according complement denying the antecedent tasks 26.67% point value responses were observed, which is comparable to the results of Experiment 1. Table 7 reports the frequencies of interval response categories of the modus ponens condition and of the denying the antecedent condition in 3 × 3 tables. 80.22% of the participants inferred coherent inter- vals in the modus ponens condition on the average (the tasks with certain premises were not averaged). 55.00% of the participants inferred coherent intervals in the denying the antecedent condition on the av- erage (the tasks with certain premises were not aver- aged). In the modus ponens tasks 85.00% (Experiment 1: 62.93%) of the interval responses were coherent on the average. In the denying the antecedent tasks 56.66% (Experiment 1: 41.33%) of the interval re- sponses were coherent on the average. The improved cor- cover-story explains why more coherent interval re- sponses in Experiment 2 than in Experiment 1 were observed. reasonable that humans are better in argument forms that guarantee high probabilities of the conclusion if each premise is highly probable. If the premises of the modus ponens are certain, then the conclusion is certain. However, if the premises of the denying the antecedent are certain, then the probability of the conclusion is in the unit interval [0, 1]. 4 Experiment 3 This section reports data of an experiment with im- precise probabilities in the premises conducted by Flo- rian Bauerecker [1]. Specifically, we focus on human understanding of what we call “silent bounds”. We call a probability bound b of a premise silent if, and only if, b is irrelevant for the probability propagation from the premise(s) to the conclusion. E.g., in the probabilistic modus ponens y ′′ is silent (y ′′ doesn’t occur in the lower or upper probabilities of the con- clusion, see (1)). Experiment 3 introduces an espe- cially critical test of the claim that human subjects are capable to make coherent probabilistic inferences. Method and procedure of Experiment 3 are analog to Experiment 1. Eighty participants were recruited for investigating questions going beyond the scope of the present study. Therefore, we report selected data on the imprecise modus ponens only (n = 40). The modus ponens tasks were formulated as follows: Please imagine the following situation. Claudia works at blood donation services. She investigates of a to which blood group the donated blood belongs and ], and the probability of whether the donated blood is Rhesus-positive. , z¬C ]. To test Claudia is 60% certain: If the donated ′′ = blood belongs to the blood group 0, then the donated blood is Rhesus-positive. Claudia knows as well that donated blood belongs with more than 75% certainty to the blood group 0. How certain should Claudia be that a recent donated blood is Rhesus-positive? both, (i) and Contrary to Experiment 1 and Experiment 2, the con- ditional premise is here formulated in a if–then form. The cover-story remained constant, only the numbers in the premises varied. Table 8 lists the probabilities presented in the premises, the normative lower and upper bounds, and the participants’ mean lower and upper bound responses for the modus ponens tasks with and without silent bounds. The participants inferred higher upper bounds in the modus ponens task containing silent bounds (M = 71.79) compared with the according task not contain- ing silent bounds (M = 60.20; t(39)=3.53, p=.001). Task 1 Task 2 Coh. Bounds (60/[75,100∗]) (60/75) UA 1 14 0 0 2 4 Task 1: UW 8 16 - 7 25 45–70 UB 1 - - 2 - - Task 2: LB LW LA LB LW LA 45–70 Task 3 Task 4 Coh. Bounds ([75,100]/60) (75/60) UA - - - 0 2 0 Task 3: UW 7 32 - 8 29 45–100 UB 1 - - 1 - - Task 4: LB LW LA LB LW LA 45–85 Table 9: Frequencies of the interval responses in the modus ponens (n = 40) tasks of Experiment 3. Co- herent interval responses are bold (±5% tolerance in- terval). Further explanation in Table 4. 5 Concluding Remarks from We reported three psychological experiments on the probabilistic versions of two prominent argument forms in the framework of mental probability logic. Clearly more coherent responses were observed in modus ponens than in denying the antecedent tasks. Human subjects employ inference rules that guarantee high probability conclusions if the premises are highly probable. Practically no participant in- ferred “too wide” intervals such that the coherent in- tervals are subintervals. While most participants did not understand the irrelevance of the silent bounds in the modus ponens task of Experiment 3 they are not completely “blind” for them. The close agreement of the mean responses and the normative values of the lower probabilities in Experiment 3 is stunning. One may speculate that human subjects are doing better in processing lower than upper probabilities. More than half of the participants responded with lower/upper probabilities that agreed perfectly with the conjugacy principle. 100]”, then the be lower in References [1] F. Bauerecker. Konditionales Schließen mit In- tervallen und Verst¨ andnis f¨ ur das Komplement einer Schlussform. Master’s thesis, Universit¨ at Salzburg, 2006. the ac- [2] M. D. S. Braine and D. P. O’Brien, editors. Men- tal logic. Erlbaum, Mahwah, 1998. [3] N. Chomsky. Aspects of the theory of syntax. MIT Press, Cambridge, 1965. as a [4] G. Coletti and R. Scozzafava. Probabilistic logic in a coherent setting. Kluwer, Dordrecht, 2002. [19] D. E. Over, C. Hadjichristidis, J. S. Evans, S. J. Handley, and S. A. Sloman. The probability of causal conditionals. Cognitive Psychology, 54:62– 97, 2007. R. M. J. [20] N. Pfeifer and G. D. Kleiter. Nonmonotonicity and human probabilistic reasoning. In Proceed- ings of the 6th Workshop on Uncertainty Pro- cessing, pages 221–234, Hejnice, 2003. September 24–27th, 2003. [21] N. Pfeifer and G. D. Kleiter. Coherence and nonmonotonicity in human reasoning. Synthese, 146(1-2):93–109, 2005. [22] N. Pfeifer and G. D. Kleiter. Towards a mental probability logic. Psychologica Belgica, 45(1):71– 99, 2005. [23] N. Pfeifer and G. D. Kleiter. Inference in condi- tional probability logic. Kybernetika, 42:391–404, 2006. [24] N. Pfeifer and G. D. Kleiter. Is human reason- ing about nonmonotonic conditionals probabilis- tically coherent? In Proceedings of the 7th Work- shop on Uncertainty Processing, pages 138–150, Mikulov, 2006. September 16–20th, 2006. Legrenzi, and [25] N. Pfeifer and G. D. Kleiter. Towards a probabil- ity logic based on statistical reasoning. In Pro- Psycho- ceedings of the 11th IPMU Conference (Informa- tion Processing and Management of Uncertainty in Knowledge-Based Systems), pages 2308–2315, Paris, 2006. Edition E.D.K. [26] G. Politzer. Uncertainty and the suppression of inferences. Thinking & Reasoning, 11(1):5–33, 2005. 49(A):828– [27] G. Politzer and G. Bourmaud. Deductive reason- ing from uncertain conditionals. British Journal of Psychology, 93:345–381, 2002. of [28] L. J. Rips. The psychology of proof: Deductive reasoning in human thinking. MIT Press, Cam- bridge, 1994. [29] G. Schurz. Non-monotonic reasoning from an evolution-theoretic perspective: Ontic, logical and cognitive foundations. Synthese, 1-2:37–51, 2005. reason- [30] M. J. Smithson. Human judgment and imprecise probabilities. www.sipta.org/documentation, 1997-2000. men- [31] P. Walley. Statistical Reasoning with Imprecise Probabilities. Chapman and Hall, London, 1991.

(PDF) Human reasoning with imprecise probabilities: Modus ponens and Denying the antecedent