Assignment IV for Biostats Course VHM 801 at AVC - Fall Semester 2024

The assignment is worth 15% of the final course mark. Please be aware that by handing in the home assignment you implicitly acknowledge to have read and accepted the instructions for home assignments as described on the VHM 801 homepage.

This assignment focuses on the reporting of statistical analysis in published articles. You are required to select one or several published articles, and answer the questions listed below about the statistical methods reported in the article(s). Your assignment must include a paper (electronic) copy of the article(s) used, including the full reference(s). You are free to select the article(s) as you like, with the following restrictions:

A few obvious journals in which you can search for useful articles are: Canadian Veterinary Journal, American/Canadian Journal of Veterinary Research, American/Canadian Journal of Public Health, Journal of Experimental Biology, Journal of Fish Diseases, Journal of Internal Veterinary Medicine, Animal Welfare; however, you are free to select a journal within your research area.

For Questions 1-4 you should focus on one statistical analysis reported in your selected article. The analysis must be relevant for VHM 801 and must be reported in the article with statistical inference (e.g. by a confidence interval or an indication of statistical significance, such as a P-value). By an analysis "relevant for VHM 801" we understand that it has been covered in Sessions 1-10 of the VHM 801 course; note that this e.g. implies that the only types of ANOVA and regression included are the one-way ANOVA and simple linear regression covered in Sessions 9 and 10 of VHM 801, respectively. (Note: You are not required to explain multiple or all analyses in the article, just the one analysis you are focusing on. It is allowed (though not necessary or recommended) to include two separate analyses for each question, but if your answer includes more than two analyses your mark will be based on your answers for the first two analyses discussed.)

  1. Identify clearly (e.g. by a specific reference to a section or a table in the article) the statistical analysis you are focusing on, and explain its research question/objective/purpose (as well as you can from the information given in the article). Describe the study type and the statistical design for your chosen analysis; make sure to include in your description: the observational/experimental unit, the number of observations, and the variable(s) considered in the analysis.

  2. First, describe the population the statistical inference is intended to be valid for. The second part of this question depends on the study type you have. For a randomized experimental study, describe how the randomization could have been carried out (as recommended in the Bailar & Mosteller (1988) paper); give enough detail to enable another person to reproduce your method. For a non-randomized experimental study that in your view could have been randomized, describe how a randomization could have been carried out (as above). For an observational study, outline instead how a randomized experiment, if feasible, could be carried out to investigate the same research question (assuming there were no restrictions on your access to experimental units etc.). For all other study types and situations, describe how a representative sample from the population was obtained, or how it could be obtained if the actual sample in your view was not representative.

  3. State the statistical model and procedure used to analyze the data, as well as the conclusions drawn from the analysis. Explain any parts of the statistical procedure not covered by VHM 801 (Sessions 1-10); if needed, ask the instructor for guidance. If major parts of the statistical procedure are not relevant for VHM 801 (as defined above), you should choose another analysis to focus on. Review briefly the assumptions behind the analysis.

  4. Give one (critical) question you could ask about the adequacy or validity of the statistical analysis, either as a reviewer of the manuscript or as a member of the audience of an oral presentation of the study. For example, your question could be related to whether one of the key assumptions for the analysis was met. Describe your rationale behind the question. It is important that your question is "reasonable" (and relevant) so you should try to base your question on the information given in the article. Note that if your question is not about the statistical analysis (specifically, its adequacy or validity, as stated above) it will most likely not count. If the answer to your question is given directly in the article or can be obtained from the information provided, you should choose another question. If a partial answer to the question is given or can be obtained, you should explain the partial answer and what additional information you are requesting.

For Questions 5-6 you need to focus on at least one other statistical analysis than the one(s) you used for Questions 1-4, also with relevance for VHM 801, as defined above. It is allowed (and may be preferable) to use the same article (provided it contains multiple analyses), but you may also select another article subject to the same restrictions as described above. Also for each of Questions 5-6 it is allowed to include two analyses, but if your answer includes more than two analyses your mark will be based on your answers for the first two analyses discussed.
  1. Give an example of a statistical analysis that you think is not reported according to the guidelines outlined in Session 8 of VHM 801. You may base your example on either one of the items 1-15 of the Bailar and Mosteller (1988) paper or one of the items/statements in the SAMPL guidelines by Lang and Altman (2013). Describe carefully what the problem is, and how it could be addressed by the authors. Explain to the authors why you think the issue you're bringing up is important (and avoid being overly fussy or petty in your critique, e.g. by focusing on a very minor issue or an issue where the requested information is implicit or understood in the presentation).

  2. Give an example of a statistical analysis that you think most likely is inadequate for the data, or where the conclusion given in the article is inadequate for the actual analysis carried out. Describe carefully what the problem is, and how it could be addressed by the authors.

    As an alternative for this last question, discuss a "conversation" you have had with ChatGPT about a statistical topic relevant for VHM 801 (as defined above) where the advice or views provided by ChatGPT conflict with what you have been taught in the course. You need to document the full conversation with ChatGPT by including a transcript of all messages involved (both from ChatGPT and yourself). Describe carefully what you think the issue is with the message(s) received from ChatGPT, and how it conflicts with what has been taught in VHM 801. Make sure, e.g. by including relevant reference(s) to the course material, that the conflict you describe is not a result of a misunderstanding of the messages from VHM 801.

Henrik Stryhn (hstryhn@upei.ca) 2024-11-18