A farmer wonders if his crops grow better in sun or in shade. He measures the amount of fruit gathered from a sample of
Amount of fruit gathered from sunny trees:
Amount of fruit gathered from shady trees:
On the same set of axes on grid paper, create a boxplot for each type of tree. Compare the center, shape, spread, and outliers of amount of fruit from sunny trees to the amount from shady trees.
Both data sets have similar spread, but Q1, the median, and Q3 are all greater in the shade.
The maximum for the red boxplot is probably an outlier.
The farmer wants to summarize the amount of fruit from each type of orchard with a mean and standard deviation. Is that appropriate? Explain.
Standard deviation is best when the data is not skewed in one direction, and when there are not outliers.
Is this data skewed? Are there outliers?