Essential Statistics

Chapter 6: Categorical Data

245

Definition:

A conditional distribution for a (first) variable given another (second)

variable is the percentage of items for the first variable that is contained in

the second variable.

Notes:



The conditional distribution of the row variable, given the column

variable, is obtained by dividing the frequency values in the original

table by the column totals.



This is usually expressed as a percentage.

Notes:



The conditional distribution of the column variable, given the raw

variable, is obtained by dividing the frequency values in the original

table by the row totals.



This is usually expressed as a percentage.

Note:

These conditional distributions will be referred to as

conditional

probabilities

in a later chapter.

Example 6-2:

Compute the conditional distributions for the age

classifications (row) given the degree classifications (column) for the data

given in

Example 6-1

. Use two decimal places.

Solution:

From the original two-way distribution, we need to compute each

frequency entry as a percentage of the respective column totals.

For the class of 29 and younger, the conditional distribution for the entry

4286, given the BS degree classification, will be (4286/21527)



100 % =

19.91%. For the class of 60 and older, the conditional distribution for the

entry 261, given the Ph.D degree classification, will be (261/1026)



100 % =

25.44%. One can continue in this manner to compute the remaining