Ronald Fisher, an English statistician (1890–1962), collected measurements for a sample of 150 irises. Of concern were five variables: species, petal width (PW), petal length (PL), sepal width (SW), and sepal length (SL) (all in mm). Sepals are the outermost leaves that encase the flower before it opens. The goal of Fisher’s experiment was to produce a simple function that could be used to classify flowers correctly. A random sample of his complete dataset is given in the accompanying table.
a. Construct a scatter diagram of petal length, x, and petal width, y. Use different symbols to represent the three species.*
b. Construct a scatter diagram of sepal length, x, and sepal width, y. Use different symbols to represent the three species.
c. Explain what the scatter diagrams in parts a and b portray. Let’s see how well a random sample represents the data from which it was selected.
d. Repeat parts a and b using the dataset containing all 150 of Fisher's data on [EX03-025].
e. Aside from the fact that the scatter diagrams in parts a and b have fewer data, comment on the similarities and differences between the distributions shown for 150 data and for the 30 randomly selected data.

