BCB744 Task E

NoteSelf-Assessment Task E.1
  1. Explain the output of dimnames() when applied to the penguins dataset. (/2)
  2. Explain the output of str() when applied to the penguins dataset. (/3)
NoteSelf-Assessment Task E.2

How would you manually calculate the mean value for the normal_data we generated in the lecture? (/3)

NoteSelf-Assessment Task E.3

Find the faithful dataset and describe both variables in terms of their measures of central tendency. Include graphs in support of your answers (use ggplot()), and conclude with a brief statement about the data distribution. (/10)

NoteSelf-Assessment Task E.4

Manually calculate the variance and SD for the normal_data we generated in the lecture. Make sure your answer is the same as those reported there. (/5)

NoteSelf-Assessment Task E.5

Write a few lines of code to demonstrate that the \((0-0.25]\), \((0.25-0.5]\), \((0.5-0.75]\),\((0.75-1]\) quantiles of the normal_data we generated in the lecture indeed conform to the formal definition for what quantiles are. I.e., show manually how you can determine that 25% of the observations indeed fall below -0.66 for the normal_data. Explain the rationale to your approach. (/10)

NoteSelf-Assessment Task E.6

Why is it important to consider the grouping structures that might be present within our datasets? (/2)

NoteSelf-Assessment Task E.7

Explain the output of summary() when applied to the penguins dataset. (/3)

Reuse

Citation

BibTeX citation:
@online{smit,
  author = {Smit, A. J.},
  title = {BCB744 {Task} {E}},
  url = {https://tangledbank.netlify.app/BCB744/tasks/task_e_blocks.html},
  langid = {en}
}
For attribution, please cite this work as:
Smit AJ BCB744 Task E. https://tangledbank.netlify.app/BCB744/tasks/task_e_blocks.html.