# 11. Probability Distributions - Concepts

## Concept of Random Variable

The term "statistical experiment" is used to describe any process by which several chance observations are obtained.

All possible outcomes of an experiment comprise a set that is called the sample space. We are interested in some numerical description of the outcome.

For example, when we toss a coin 3 times, and we are interested in the number of heads that fall, then a numerical value of 0, 1, 2, 3 will be assigned to each sample point.

The numbers 0, 1, 2, and 3 are random quantities determined by the outcome of an experiment.

They may be thought of as the values assumed by some random variable x, which in this case represents the number of heads when a coin is tossed 3 times.

So we could write x1 = 0, x2 = 1, x3 = 2 and x4 = 3.

## Definitions

1. A random variable is a variable whose value is determined by the outcome of a random experiment.

2. A discrete random variable is one whose set of assumed values is countable (arises from counting).

3. A continuous random variable is one whose set of assumed values is uncountable (arises from measurement.).

We shall use:

A capital X for the random variable and

Lower case x1, x2, x3... for the values of the random variable in an experiment. These xi then represent an event that is a subset of the sample space.

The probabilities of the events are given by: P(x1), P(x2), P(x3), ...

We also use the notation P(X). For example, we may need to find some of the probabilities involved when we throw a die. We would write for the probability of obtaining a "5" when we roll a die as:

P(X=5)=1/6

Continues below

### Example 1 - Discrete Random Variable

Two balls are drawn at random in succession without replacement from an urn containing 4 red balls and 6 black balls.

Find the probabilities of all the possible outcomes.

### Example 2 - Continuous Random Variable

A jar of coffee is picked at random from a filling process in which an automatic machine is filling coffee jars each with 1\ "kg" of coffee. Due to some faults in the automatic process, the weight of a jar could vary from jar to jar in the range 0.9\ "kg" to 1.05\ "kg", excluding the latter.

Let X denote the weight of a jar of coffee selected. What is the range of X?

## Distribution Function

### Definitions

1. A discrete probability distribution is a table (or a formula) listing all possible values that a discrete variable can take on, together with the associated probabilities.

2. The function f(x) is called a probability density function for the continuous random variable X where the total area under the curve bounded by the x-axis is equal to 1. i.e.

int_(-oo)^oo f(x)dx=1

The area under the curve between any two ordinates x = a and x = b is the probability that X lies between a and b.

int_a^bf(x)dx=P(a<=X<=b)

See area under a curve in the integration section for some background on this.

## Probabilities As Relative Frequency

If an experiment is performed a sufficient number of times, then in the long run, the relative frequency of an event is called the probability of that event occurring.

### Example 3

Refer to the previous example. The weight of a jar of coffee selected is a continuous random variable. The following table gives the weight in kg of 100 jars recently filled by the machine. It lists the observed values of the continuous random variable and their corresponding frequencies.

Find the probabilities for each weight category.

Weight X Number
of Jars
0.900 - 0.925 1
0.925 - 0.950 7
0.950 - 0.975 25
0.975 - 1.000 32
1.000 - 1.025 30
1.025 - 1.050 5
Total 100

## Expected Value of a Random Variable

Let X represent a discrete random variable with the probability distribution function P(X). Then the expected value of X denoted by E(X), or μ, is defined as:

E(X) = μ = Σ (xi × P(xi))

To calculate this, we multiply each possible value of the variable by its probability, then add the results.

Σ (xi × P(xi)) = { x1 × P(x1)} + { x2 × P(x2)} + { x3 × P(x3)} + ...

E(X) is also called the mean of the probability distribution.

### Example 4

In Example 1 above, we had an experiment where we drew 2 balls from an urn containing 4 red and 6 black balls. What is the expected number of red balls?

## Variance of a Random Variable

Let X represent a discrete random variable with probability distribution function P(X). The variance of X denoted by V(X) or σ2 is defined as:

V(X) = σ2 = Σ[{X E(X)}2 × P(X) ]

Since μ = E(X), (or the average value), we could also write this as:

V(X) = σ2 = Σ[{X μ}2 × P(X) ]

Another way of calculating the variance is:

V(X) = σ2 = E(X2) − [E(X)]2

## Standard Deviation of the Probability Distribution

sigma=sqrt(V(X) is called the standard deviation of the probability distribution. The standard deviation is a number which describes the spread of the distribution. Small standard deviation means small spread, large standard deviation means large spread.

In the following 3 distributions, we have the same mean (μ = 4), but the standard deviation becomes bigger, meaning the spread of scores is greater.

Normal Curve

μ = 4, σ = 0.5

Normal Curve

μ = 4, σ = 1

Normal Curve

μ = 4, σ = 2

The area under each curve is 1.

### Example 8

Find V(X) for the following probability distribution:

 X P(X) 8 12 16 20 24 1/8 1/6 3/8 1/4 1/12

top

### Online Algebra Solver

This algebra solver can solve a wide range of math problems.

### Math Lessons on DVD

Easy to understand math lessons on DVD. See samples before you commit.