7.4.6. Do all the processes have the same<br>proportion of defectives?

7. Product and Process Comparisons
7.4. Comparisons based on data from more than two processes

7.4.6. Do all the processes have the same proportion of defectives?

The contingency table approach

Testing for homogeneity of proportions using the chi-square distribution via contingency tables

When we have samples from $n$ populations (i.e., lots, vendors, production runs, etc.), we can test whether there are significant differences in the proportion defectives for these populations using a contingency table approach. The contingency table we construct has two rows and $n$ columns.

To test the null hypothesis of no difference in the proportions among the $n$ populations

$ \mbox{H}_0: \,\,\, p_1 = p_2 = \cdots = p_n $ against the alternative that not all $n$ population proportions are equal

$ \mbox{H}_a: \,\,\, \mbox{Not all } p_i \mbox{ are equal } (i = 1, \, 2, \, \ldots, \, n) \, , $

The chi-square test statistic

we use the following test statistic: $$ \chi^2 = \sum_{\mbox{all cells}} \frac{(f_o - f_c)^2}{f_c} \, , $$ where $f_o$ is the observed frequency in a given cell of a $2 \times n$ contingency table, and $f_c$ is the theoretical count or expected frequency in a given cell if the null hypothesis were true.

The critical value

The critical value is obtained from the $\chi^2$ distribution table with degrees of freedom $(2-1)(n-1) = n-1$, at a given level of significance.

An illustrative example

Data for the example

Diodes used on a printed circuit board are produced in lots of size 4000. To study the homogeneity of lots with respect to a demanding specification, we take random samples of size 300 from 5 consecutive lots and test the diodes. The results are:

	Lot
Results	1	2	3	4	5	Totals

Nonconforming	36	46	42	63	38	225
Conforming	264	254	258	237	262	1275

Totals	300	300	300	300	300	1500

Computation of the overall proportion of nonconforming units

Assuming the null hypothesis is true, we can estimate the single overall proportion of nonconforming diodes by pooling the results of all the samples as $$ \bar{p} = \frac{36 + 46 + 42 + 63 + 38}{5(300)} = \frac{225}{1500} = 0.15 \, . $$

Computation of the overall proportion of conforming units

We estimate the proportion of conforming ("good") diodes by the complement 1 - 0.15 = 0.85. Multiplying these two proportions by the sample sizes used for each lot results in the expected frequencies of nonconforming and conforming diodes. These are presented below:

Table of expected frequencies

	Lot
Results	1	2	3	4	5	Totals

Nonconforming	45	45	45	45	45	225
Conforming	255	255	255	255	255	1275

Totals	300	300	300	300	300	1500

Null and alternate hypotheses

To test the null hypothesis of homogeneity or equality of proportions

$ \mbox{H}_0: \,\,\, p_1 = p_2 = \cdots = p_5 $ against the alternative that not all 5 population proportions are equal

$ \mbox{H}_a: \,\,\, \mbox{Not all } p_i \mbox{ are equal } (i = 1, \, 2, \, \ldots, \, 5) \, , $

Table for computing the test statistic

we use the observed and expected values from the tables above to compute the $\chi^2$ test statistic. The calculations are presented below:

$f_o$	$f_c$	$(f_o - f_c)$	$(f_o - f_c)^2$	$(f_o - f_c)^2 / f_c$

36	45	-9	81	1.800
46	45	1	1	0.022
42	45	-3	9	0.200
63	45	18	324	7.200
38	45	-7	49	1.089
264	225	9	81	0.318
254	255	-1	1	0.004
258	255	3	9	0.035
237	255	-18	324	1.271
262	255	7	49	0.192

				12.131

Conclusions

If we choose a 0.05 level of significance, the critical value of $\chi^2$ with 4 degrees of freedom is 9.488 (see the chi square distribution table in Chapter 1). Since the test statistic (12.131) exceeds this critical value, we reject the null hypothesis.