Graphical Summary

Boxplot

  

Boxplots summarize information about the shape, dispersion, and center of your data. They can also help you spot outliers.

·    The left edge of the box represents the first quartile (Q1), while the right edge represents the third quartile (Q3). Thus the box portion of the plot represents the interquartile range (IQR), or the middle 50% of the observations.

·    The line drawn through the box represents the median of the data.

·    The lines extending from the box are called whiskers. The whiskers extend outward to indicate the lowest and highest values in the data set (excluding outliers).

·    Extreme values, or outliers, are represented by dots. A value is considered an outlier if it is outside of the box (greater than Q3 or less than Q1) by more than 1.5 times the IQR.

Use the boxplot to assess the symmetry of the data:

·    If the data are fairly symmetric, the median line will be roughly in the middle of the IQR box and the whiskers will be similar in length.

·    If the data are skewed, the median may not fall in the middle of the IQR box, and one whisker will likely be noticeably longer than the other.

Example Output

image\grph_3n.gif

Interpretation

In the boxplot of the precipitation data the median is centered in the IQR box, and the whiskers are the same length. This indicates that except for the outlier (asterisk), the data are symmetric. This is a good indication that the outlier may not be from the same population as the rest of the sample data.