A farmer wonders if his crops grow better in sun or in shade. He measures the amount of fruit gathered from a sample of
Amount of fruit gathered from sunny trees: (
Amount of fruit gathered from shady trees: (
On the same set of axes on grid paper, create a boxplot for each type of tree. Compare the center, shape, spread, and outliers of amount of fruit from sunny trees to the amount from shady trees.
Example boxplots have been provided below.
The blue one is for sunny trees, the red one is for shady trees.
Both data sets have similar spread, but the Q1, median, and Q3 are all higher in the shade. The maximum for the red boxplot is probably an outlier.
The farmer wants to summarize the amount of fruit from each type of orchard with a mean and standard deviation. Is that appropriate? Explain.
Standard deviation is best when the data is not skewed in one direction, and when there are not outliers. Is this data skewed? Are there outliers?