Exercises page for Biostats Course VHM 801 at AVC - Fall Semester 2024

Follow this link to datasets (in Minitab (.mtw) and comma-separated (.csv) file formats).
Follow this link to extra exercises (labeled as x:number) and this link to AI discussion exercises.

This page contains links to solutions (either as text files, to be opened directly in a web browser or in Notepad (or similar), or as .pdf files, to be opened in a suitable reader, such as Adobe Acrobat) for selected exercises of VHM 801. The solutions have been compiled by the Biostats 801 course instructor, Henrik Stryhn.


Solutions to exercises

(from Supplementary Exercises for IPS7e)
  1. 1, 10, 16, 22, 42, 51, 65, 72, 77, 110, 111, 113, 117, 121, 123, 127, 144, 145,
  2. 2, 7, 11, 12, 27, 28, 48, 57, 59, 60, 67, 69,
  3. 4, 10, 14, 18, 19, 40, 52, 74, 77, 79, 94, 95,
  4. 9, 10, 14, 26, 48, 52, 56, 60, 71, 73, 75, 76, 78, 92, 107, 108, 115, 122, 123,
  5. 7, 9, 14, 33, 40, 47, 49, 51, 53, 54,
  6. 7, 11, 12, 13, 14, 33, 38, 39, 45, 46, 55, 68, 70, 85, 87, 95, 96, 99, 103, 107, 108, 111, 115, 140, 142,
  7. 1, 4, 40, 58, 59, 64, 66, 68, 73, 74, 91, 93, 102, 103, 104, 127, 129, 132, 143, 145,
  8. 1, 21, 33, 62, 84, 85, 98, 103,
  9. 20, 36, 38, 39, 40, 44, 48, 50, 52, 62,
  10. 7, 12, 17, 18, 26, 27, 33, 38, 39, 40,
  11. 6, 15, 16, 17, 45,
  12. 1, 9, 25, 27, 35, 36, 40, 43, 54, 55,
  13. 3, 4, 15, 16, 19, 31,
  14. 7, 8, 17, 18, 19, 33, 37,
  15. 8, 19, 22, 42,
  16. 4, 14,


Stata do-files for exercises

  1. 10, 42, 51, 72, 77, 110, 111, 127, 145,
  2. 2, 7, 11, 12, 27, 28, 48, 57, 59, 60,
  3. 14, 40,
  4. 10, 71,
  5. 7, 40, 49, 53,
  6. 13, 14, 38, 87, 95, 115, 140,
  7. 4, 4, 58, 59, 64, 68, 73, 74, 91, 93, 102, 103, 127, 143, 145,
  8. 62, 84, 85,
  9. 20, 36, 38, 44, 48, 50, 52,
  10. 7, 12, 17, 33, 38, 39,
  11. 35, 43, 54, 55,
  12. 15, 31,
  13. 33, 37,
  14. 8, 19, 22,


R program files for exercises

  1. 10, 42, 51, 72, 77, 110, 111, 127, 145,
  2. 2, 11, 12, 48,
  3. 14, 40,
  4. 10, 71,
  5. 7, 40, 49, 53,
  6. 13, 14, 87, 95, 115, 140,
  7. 4, 4, 58, 59, 64, 68, 73, 74, 102, 103, 127, 143, 145,
  8. 62, 84, 85,
  9. 20, 36, 38, 48, 50,
  10. 7, 17, 38, 39,
  11. 35, 43, 54, 55,
  12. 31,
  13. 33,
  14. 8, 19, 22,


Data files

(.zip archive of all current data files)


Data files (.mtw)


Data files (.csv)


Extra exercises


Solutions to extra exercises:

x:1, x:2, x:3, x:4, x:5, x:6, x:7, x:8 (do-file; R-program), x:9, x:10, x:11, x:12, x:13, x:14, x:15, x:16, x:17, x:18, x:19 (do-file), x:20, x:21 (do-file), x:22 (do-file; R-program), x:23, home assignment 2001:1, final exam 2006:3,


Topics for discussion with Generative AI bot

(use (for example) the Openai interface to access ChatGPT and engage in discussion; feel free to post parts of a discussion to the Discussion forum)

#TopicSuggested/Template questions
1Choice of descriptive statisticwhen to use a mean and when to use a median? (follow-up: if computing the median for ordinal categorical data, it may produce a nonsensical value?); what is a good descriptive statistic for spread of a distribution?; is the standard deviation appropriate for skewed distributions?
2Outlierswhat is an outlier?; are observations indicated by asterisks in a boxplot outliers? (follow-up: what do you mean, are they potential or actual outliers?); should outliers be removed from the data?
3Association vs causationcan I conclude a causal effect from an observed effect?; can an experiment prove causation?; does association in a randomized controlled trial imply causation?
4Probability (conditional)A father of two children tells you one of his children is a boy. What is the probability the other child is a girl?; Suppose he then tells you the oldest child is a boy. What is the probability the younger child is a girl? (should these probabilities not be the same?)
5Assessing normalityhow to decide whether my data are normally distributed?; why not just use a normality test and go by p<0.05?; how to determine that a normal plot is straight enough?; which is the best normality test?
6Versions of two-sample t-testsis it better to use the t-test for paired or independent samples?; is it better to use the two-sample t-test with pooled or separate variances?; does the t-test with separate variances assume the variances to be unequal?
7CIs for a proportionhow to compute a confidence interval for a proportion?; does this method always work, or are there conditions for its use?; in which sense is an exact binomial confidence interval exact?; is it not true that an exact binomial confidence interval is conservative, i.e. has theoretically too large coverage?
8Chi-square testsin statistics, what is a chi-square test?; is there not also a chi-square test for homogeneity?; does the chi-square test have assumptions?; must all expected frequencies be larger than 5?
9anova (one-way)in statistics, what is anova?; what assumptions are needed for anova? what robustness to assumptions does anova have?; is it necessary to test for equal variances?



Henrik Stryhn (hstryhn@upei.ca) 2024-12-02