<<

1

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

A GENERAL APPROACH TO TEACHING RANDOM

VARIABLES

Farida Kachapova

School of Computing and Mathematical Sciences Auckland University of Technology Private Bag 92006 Auckland 1142 New Zealand

Email: [email protected]

Abstract

The paper describes a teaching approach in post- courses when initial topics on random variables are taught in general form instead of doing it separately for discrete and continuous cases. Some common student misconceptions about random variables are analyzed and accompanied with counter-examples.

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

2

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

INTRODUCTION

Most research on teaching probability focus on basic probability and statistical applications. The teaching of probability concepts, including random variables, is studied by Larsen (2006), Miller (1998), Leviatan (2003), Ballman (1997), and Hernandez, Albert Huerta and Batanero (2006). Much research has been done on students’ thinking about probability and related misconceptions (see, for example, Ancker, 2006; Rossman, 1995; San Martin, 2006; Warner, Pendergraft, and Webb, 1998). is “... a fundamental stochastic idea” and “... is the support of many probability and subjects” (Hernandez, Albert Huerta and Batanero, 2006). So it is important that the students in probability courses gain a deep understanding of the concept of random variable, which starts with a proper definition. In post-calculus university courses the students have sufficient mathematical maturity to learn correct definitions and derive mathematical theorems from them. As Leviatan (2003) wrote, “The formal axiomatic approach to probability is important as they are the basis for all other probabilistic rules.” The axiomatic foundation developed by Kolmogorov in 1931 (see the English translation: Kolmogorov, 1950). In Section 2 we look at some common students’ misconceptions about events and random variables. In Section 3 we make suggestions about teaching random variables. In most probability courses random variables are taught in two separate chapters containing the material on discrete variables and continuous ones, respectively. So many topics are taught twice. The expectations of discrete and continuous random variables are defined separately, so the students perceive them as two different concepts and miss the general mathematical meaning of the expected value. Besides some theorems that are true for arbitrary random variables, are stated only for the discrete and continuous ones. So instead of losing generality we suggest to use Riemann-Stieltjes integral to introduce the

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

3

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

expectation of a random variable in general form and to develop the theory for arbitrary random variables as long as possible before teaching particular properties of the discrete and continuous variables. This makes the course slightly more advanced, so our suggestions are more appropriate for the university courses in quantitative areas. Our suggestions do not target introductory service courses in statistics, where students have a limited mathematical background and common teaching methods with minimum mathematics are appropriate.

2. COMMON MISCONCEPTIONS

2.1. EVENTS

One of the common misconceptions in probability is the belief that an event is an arbitrary subset of a , which is usually correct in the case of a finite sample space but not in the general case; sometimes Venn diagrams can strengthen this misconception. The misconception can be avoided if we use the axiomatic approach to probability where a subset A of a sample space  is considered an event if its probability P(A) exists (that is an event is a measurable subset of ). All events on the sample space make a -field. Thus, in every is assigned a probability space (, A, P), where  is a sample space, A is a -field and P is a ; then an event is defined as any element of A. Non-measurable sets are not events, since they cannot be assigned . Whether non-measurable sets exist, appears to be a complicated question, and the probability students can be given a brief overview of the topic: 1) an example of non-Borel set on  was constructed by Luzin (1953); 2) Vitali (1905) proved the existence of a set of real numbers that is not Lebesgue measurable, with the assumption of the axiom of choice;

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

4

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

3) Solovay (1970) created a model, where all subsets of  are Lebesgue measurable and the axiom of choice is false; this demonstrates that the existence of non-measurable subsets of  cannot be proven without the axiom of choice. The standard approach in mathematics is to accept the existence of non- measurable sets; therefore in probability theory some sample spaces can have subsets, which are not events.

2.2. RANDOM VARIABLE

Random variable is a fundamental concept in probability theory that is not intuitive, so some students develop misconceptions about it. One common misconception is that a random variable X is any real-valued on the sample space. Students can overcome this misconception by learning the most important characteristic of the random

variable – its distribution function FX :

FX (x) = P(X  x). For the distribution function to exist the probability P(X  x) should be defined for any real number x. Therefore X is a valid random variable only if for any xR the probability measure P(X  x) is defined, that is {X  x} is an event. Mathematically a random variable on the probability space (, A, P) is defined as a function X:    with the property: (x) ({ X  x }A). (1) The condition (1) is essential because the existence of non-measurable sets is accepted in mathematics, as discussed before.

2.1 CONTINUOUS VARIABLE

Some students’ misconceptions in probability relate to continuous random variables, which make a harder topic than discrete random variables. One of the misconceptions is defining a continuous random variable as a variable with a non-countable set of values

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

5

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

(sometimes this set is expected to be an interval or a combination of intervals). This leads to the misconception that any random variable is either discrete or continuous. This misconception might be related to statistics courses where numerical are categorized into two types: discrete and continuous. To help students to overcome these misconceptions we can emphasize the following standard definition. A random variable X is continuous if its distribution function F can be

x expressed as F (x) =  f tdt for some integrable non-negative function f , which is  called the density function of X. A simple example of a mixed random variable can clarify the matter further. Example 1. Mixed random variable. Consider the following function:

 0 if x  0,   0.2x if 0  x 1, Fx    0.2x  0.6 if 1 x  2,   1 if x  2.

u

1 F(x)

0.8

0.2

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. 0 www.hostos.cuny.edu/departments/math/mtrj1 2 x

6

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

Clearly F is non-decreasing, right-continuous and has limits 0 and 1 at  and , respectively. So F is the distribution function of some random variable X. Here P (X = 1) = 0.6. If we assume that X has a density function f (x), then f (x) = F (x) for any real x except points 0, 1 and 2; so 0.6 = P (X = 1)  F(1) =

1 1   f xdx  0.2dx  0.2, which is a contradiction. Therefore X is not continuous  0 though the set of its values is the interval (0, 2) and it is not countable. This random variable is an example of a mixed variable: it is discrete at point 1 (since the probability of 1 is 0.6) and is continuous on intervals (0, 1) and (1, 2). Another common students’ misconception is thinking of a continuous random variable as the variable with a continuous distribution function; the condition of continuity of the distribution function is necessary but not sufficient. Any singular random variable can be used as a counter-example here, since the distribution function of

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

7

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

such a variable is continuous and its equals 0 almost surely; therefore the singular random variable does not have a density function and is not continuous. The following is a well-known example of a singular variable (see, for example, Billingsley, 1995); the students need only basic knowledge of calculus to understand the example. Example 2. Singular random variable. We will define a distribution function F as the

Cantor function. For any x[0, 1], denote (x1 , x2 , x3 ,...) the expression of x in base 3,

x1 x2 x3 that is x    ... . Denote k = min {r: xr =1}; if there is no r with xr = 1, 3 3 2 3 3 k signx  1 if a  0, then k = . Define r , where sign a  Fx   r    r1 2  0 if a  0. This is the analytical definition of the . Geometrically it can be described as follows. Divide [0, 1] into three equal parts and define F on the middle part 1 1 2 by F (x)  for x , . Next, divide each of the remaining two intervals into three 2 3 3 1 1 2 equal parts and define F in the middle parts by: F (x)  for x , and F (x) 2 2 9 9

3 7 8  for x , . This process is repeated infinite number of times. For other 2 2 9 9 definitions of the Cantor function see Royden (1988). Define F (x) = 0 for x < 0, and F (x) = 1 for x > 1.

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

8

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

u

1

0.5

0 1/3 2/3 1 x

Clearly F (0) < F (1), the function F is continuous, non-decreasing, and its derivative equals zero almost surely. Hence F is a singular function and is the distribution function of some singular random variable Y. The following theorem also helps the students to understand that discrete and continuous are not the only types of random variables (for details see Feller, 1971). Lebesgue decomposition: any random variable X can be uniquely represented in

the form: X = 1 Xd +2 Xc +3 Xs , where numbers  i  0 (i =1, 2, 3), 1 + 2 + 3 = 1,

and Xd , Xc , Xs are discrete, continuous and singular random variables, respectively.

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

9

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

3. TEACHING SUGGESTIONS

Many textbooks define a random variable and its distribution function in general form, then give most definitions and statements about random variables twice: first in discrete case, then in continuous case. Sometimes even the distribution function and independence of random variables are defined separately for the two cases. This is good for a basic probability course on the secondary level. For a university course we suggest a more holistic teaching approach with emphasis on the general theory for all random variables and with less attention to particular details of the discrete and continuous cases. After defining a random variable X, we recommend to introduce its distribution

1 function FX , SX by SX (x) = P(X > x), and function F X by:

1 F X (u) = inf {x: FX (x)  u}, u[0, 1], with inf  = + by convention. The quantile function is used for simulating continuous distributions and is usually introduced later with other simulation methods but it is more logical to introduce this function and its properties at the beginning of the course. Next, we suggest to introduce Riemann-Stieltjes integral with respect to a distribution function F. In a post-calculus course the students can master this concept without difficulty, because there is a natural analogy with the Riemann integral, where increments of x are replaced with increments of F. For a random variable X with the distribution function F, the expectation is defined by the following Riemann-Stieltjes integral:

 E(X) =  x dFx.  Thus, the students see that a random variable does not have to be discrete or continuous to have an expectation. This is illustrated by calculating the expectation for the aforenamed mixed and singular random variables.

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

10

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

Example 3. Find the expectation E(X) for the mixed random variable X from Example 1. Solution: For the continuous part of the variable the density equals f (x) = F (x) =

1 = 0.2, x (0, 1) (1, 2). So the expectation equals E(X) = 1P(X =1) +  x f x dx + 0

2 2 +  x f x dx = 1 (0.8  0.2) + 0.2x dx = 1. 1 0 The following is a more rigorous proof that demonstrates how the answer can be derived from the general definition of expectation; this can be used in more advanced probability courses. Since the random variable X is bounded, the expectation E(X) exists and

2 E(X) =  x dFx. This integral is a limit of integral sums. 0 For an odd integer n = 2m +1 (m > 0) divide [0, 2] into n equal sub-intervals:

0 = x0 < x1 < ... < xm < 1 < xm+1 < ... < xn = 2.

For i = 0, 1, ..., n 1, denote i = x i+1 and Fi = F(x i+1)  F(xi ). Then for i  m:

Fi = 0.2 xi . (2)

n1 1 2 3 The integral sum equals Sn   i Fi  Sn  Sn  Sn , where i0

m1 n1 1 2 3 Sn  m Fm , Sn   i Fi and Sn   i Fi . i0 im1

1   Here Sn  xm1 F xm1 F xm   1 lim F x lim F x  0.8 – 0.2 = n x1 x1  = 0.6.

m1 m1 1 2 By (2), Sn  i 0.2xi  0.2 i xi  0.2 xdx , the Riemann integral.   n  i0 i0 0

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

11

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

n1 2 3 Similarly, Sn  0.2 i xi  0.2 xdx .  n  im1 1

1 2 2

So E(X)  lim Sn  0.6  0.2 xdx  0.2 xdx  0.6  0.2 xdx  1. n    0 1 0 Similarly it can be shown that for the singular random variable Y from Example 1 2, the expectation E(Y)  . This also follows from the symmetry of the distribution 2  1 1  function with respect to point  ,  .  2 2  The singular random variable seems to be an artificial mathematical construction. Mixed random variables, on the other hand, have natural applications, in in particular. ‘A mixed type rv frequently encountered in actuarial science is an insurance risk for which there is a probability mass in zero (the probability of non-occurrence of claims), while the claim amount given that a claim occurs is a continuous rv.’ (Denuit, Dhaene, Goovaerts, & Kaas, 2005, p. 17). We illustrate this with the following two examples (for more examples see Kaas, Goovaerts, Dhaene, & Denuit, 2009). Example 4. Consider an insurance payment X against accidental loss with excess of $200 and maximum payment of $2000. Assume that the loss is below $200 with probability 0.08 and above $2200 with probability 0.12, with the uniform distribution in between. Find the distribution and expectation of X. Solution: Clearly P(X =0) = 0.08, P(X =2000) = 0.12, P(0 < X < 2000) = 0.8, and X is uniformly distributed between 0 and 2000. The distribution function of X is given by:  0 if x  0,  Fx   0.0004x  0.08 if 0  x  2000,   1 if x  2000. For the continuous part of the variable the density equals f (x) = F (x) = = 0.0004, x (0, 2000). So the expectation (expected payment) equals E(X) =

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

12

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

2000 2000 =0P(X =0) + 2000P(X =2000) +  x f x dx = 20000.12 + 0.0004x dx = $1040. 0 0

Example 5. Consider an insurance claim of size X, where X equals 0 with probability 0.2 and the distribution of its positive values is proportional to with 0.005. Find the distribution and expectation of X. Solution: Clearly P(X = 0) = 0.2 and P(X > 0) = 0.8, so the distribution function of X is

 0 if x  0, given by: Fx   0.005x 1 0.8e if x  0. For the continuous part of the variable the density equals f (x) = F (x) = = 0.004 e0.005x, x > 0. So the expectation (expected claim size) equals E(X) =

  = 0P(X =0) +  x f x dx = 0.004xe 0.005x dx = $160. 0 0 The properties of expectation can be stated and proved in general form using the properties of Riemann-Stieltjes integral. Next it is logical to introduce in general form through expectation other numerical characteristics of random variables, such as moments, , , covariance, and Pearson correlation. Besides general properties of these characteristics, it is useful to consider Chebyshev’s inequality and Hoeffding equality (Hoeffding, 1940):

  Cov X ,Y  F x, y  F x F y dxdy ,      X ,Y   X   Y    

where FX,Y is the joint distribution function of X and Y, and Cov(X, Y) is their covariance. The Hoeffding equality is not sufficiently known, though it has useful applications, and it is easily stated and proved (see Lehmann, 1966, p. 1139). Finally, formulas can be derived for the expectation of a discrete random variable and a continuous one. In addition to the standard formulas, it is suggested to include these interesting alternative formulas:

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

13

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

 0 E X  S x dx  F x dx for a continuous X and    X    X   0 

 0 E X   PX  k PX  k for a discrete, integer-valued X. k0 k They have quite simple proofs.

4. DISCUSSION

The described approach is not suggested for introductory statistics courses, where students’ mathematical background is limited. Our suggestions are intended for post- calculus probability courses, which are part of university courses in quantitative areas. The students in these courses enjoy the mathematical side of probability and are not entirely focused on statistical applications. A standard one-semester course in calculus provides the students with necessary differentiation and integration skills; these students are familiar with Riemann integral and therefore they can understand Riemann-Stieltjes integral, which is defined similarly, only increments of x are replaced with increments of y. So the expectation of a random variable can be defined in general form through a Riemann-Stieltjes integral. The described general approach to teaching random variables covers all types of variables, not only discrete and continuous ones. This is the way random variables are taught in actuarial science where mixed variables have important applications and the expected value has to be introduced through Riemann-Stieltjes integral. This approach was used by the authors for several years in post-calculus probability courses. This approach helps to avoid tedious repetition of definitions and proofs, and to produce more interesting proofs in the general case. Our case studies show that the described examples of unusual random variables stimulate students’ interest in the subject and their critical thinking. The students get interested what other types of

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

14

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

random variables exist, and the Lebesgue decomposition theorem helps to answer such questions. Many university students have got a basic knowledge of discrete and continuous variables from high school. In university courses we can build on this knowledge, generalize and develop it to a more advanced level. The universal approach to random variables can help students to distinguish between general and type-specific properties of random variables. Together with the aforementioned counter-examples the described teaching approach can also help students to avoid misconceptions and gain a deeper understanding of the basic probability concepts. More advanced counter-examples in probability can be found in the book by Stoyanov (1997). The general teaching approach provides students with an abstract knowledge of random variables, which is important in a variety of applications. In particular, this knowledge can be applied in actuarial science (as shown in examples 4 and 5), so there is no need to study mixed random variables separately as a third type of variables.

References

Ancker, J. (2006). The language of conditional probability. Journal of Statistics

Education, 14(2).

Ballman, K. (1997). Greater emphasis on variation in an introductory statistics course.

Journal of Statistics Education, 5(2).

Billingsley, P. (1995). Probability and Measure (3 ed.). New York: Willey.

Denuit, M., Dhaene, J., Goovaerts, M. J., & Kaas, R. (2005). Actuarial Theory for

Dependent Risks. New York: Willey.

Feller, W. (1971). An introduction to probability theory and its applications (Vol. 2).

New York: Willey.

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

15

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

Hernandez, B. R., Albert Huerta, J., & Batanero, C. (2006). An exploratory study of

students’ difficulties with random variables, International Conference on

Teaching Statistics (ICOTS-7), Brazil.

Hoeffding, W. (1940). Masstabinvariante Korrelationstheorie. Schriften Math. Inst. Inst.

Angew. Math. Univ., 5, 179-233.

Kaas, R., Goovaerts, M. J., Dhaene, J., & Denuit, M. (2009). Modern Actuarial Risk

Theory: Using R (2 ed.). New York: Willey.

Kolmogorov, A. N. (1950). Foundations of the Theory of Probability. New York:

Chelsea.

Larsen, M. D. (2006). Advice for new and student lecturers on probability and statistics.

Journal of Statistics Education, 14(1).

Lehmann, E. L. (1966). Some concepts of dependence. Annals of Mathematical Statistics,

34, 1315-1321.

Leviatan, T. (2003). Conceptual, computational and didactic aspects of teachers’

training in probability and statistics, 54th session of International Statistical

Institute, Berlin, Germany.

Luzin, N. N. (1953). Lectures on analytic sets and their applications. Moscow:

Gosudarstv. Izdat. Tehn.-Teor. Lit.

Miller, T. K. (1998). The random variable concept in introductory statistics, International

Conference on Teaching Statistics (ICOTS-5), Singapore.

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj

16

MATHEMATICS TEACHING-RESEARCH JOURNAL ONLINE VOL 5, N 2 May 2012

Rossman, A. (1995). Conditional probability and education reform: are they compatible?

Journal of Statistics Education, 3(2).

Royden, H. L. (1988). (3 ed.): Prentice-Hall.

San Martin, E. (2006). Piaget’s viewpoint on the teaching of probability: a breaking-off

with the traditional notion of chance?, International Conference on Teaching

Statistics (ICOTS-7), Brazil.

Solovay, R. M. (1970). A model of set-theory in which every set of reals is Lebesgue

measurable. Annals of Mathematics, Second Series, 9, 1-56.

Stoyanov, J. M. (1997). Counterexamples in Probability (2 ed.). England: Wiley.

Vitali, G. (1905). Sul problema della misura dei gruppi di punti di una retta. Bologna:

Tip, Gamberini e Parmeggiani.

Warner, B. A., Pendergraft, D., & Webb, T. (1998). That was Venn, this is now. Journal

of Statistics Education, 6(1).

Readers are free to copy, display, and distribute this article, as long as the work is attributed to the author(s) and Mathematics Teaching- Research Journal On-Line, it is distributed for non-commercial purposes only, and no alteration or transformation is made in the work. All other uses must be approved by the author(s) or MT-RJoL. MT-RJoL is published jointly by the Bronx Colleges of the City University of New York. www.hostos.cuny.edu/departments/math/mtrj