1. Consider once again the coffee-tea example, presented in Example 10.9. The following two tables are the same as the one presented in Example 10.9 except that each entry has been divided by 10 (left table) or multiplied by 10 (right table).


Table 10.7. Beverage preferences among a group of 100 people (left) and 10,000 people (right).



a. Compute the p-value of the observed support count for each table, i.e., for 15 and 1500. What pattern do you observe as the sample size increases?


P value for table 1=0.5319

P value for table 2=4.104E-10

We observe that as sample size increases p value decreases!

  Coffee No Coffee     Coffee No Coffee  
Tea 15 5 20 Tea 1500 500 2000
No Tea 65 15 80 No Tea 6500 1500 8000
  80 20 100   8000 2000 10000
Expected Expected
  Coffee No Coffee     Coffee No Coffee  
Tea 16 4   Tea 1600 400  
No Tea 64 14   No Tea 6400 1600  
p value 0.531971 p value 4.10453E-10


In excel, we will calculate the expected table for finding out the p value.

Expected table




p value =chitest(observed,expected)

p value for table 1=0.5319

p value for table 2=4.104E-10

we observe that as sample size increases p value decreases

b. Compute the odds ratio and interest factor for the two contingency tables presented in this problem and the original table of Example 10.9. (See Section 5.7.1 for definitions of these two measures.) What pattern do you observe?




c. The odds ratio and interest factor are measures of effect size. Are these two effect sizes significant from a practical point of view?




d. What would you conclude about the relationship between p-values and effect size for this situation?




2. Consider the different combinations of effect size and p-value applied to an experiment where we want to determine the efficacy of a new drug.


(i) effect size small, p-value small

(ii) effect size small, p-value large

(iii) effect size large, p-value small

(iv) effect size large, p-value large


Whether effect size is small or large depends on the domain, which in this case is medical. For this problem consider a small p-value to be less than 0.001, while a large p-value is above 0.05. Assume that the sample size is relatively large, e.g., thousands of patients with the condition that the drug hopes to treat.


a. Which combination(s) would very likely be of interest?



b. Which combinations(s) would very likely not be of interest?



c. If the sample size were small, would that change your answers?


Leave a Reply

Your email address will not be published. Required fields are marked *