Discussing Relationships between Numerical and Categorical Variables
- Begin with context: what does the data represent?
- Compare frequencies between the categories of the categorical dataset.
- Compare the numerical data corresponding to each category on the basis of shape, spread, centre and presence of outliers.
Note: if you cannot remember how to choose appropriate measures for centre and spread, revise the notes for 1.6 Describing Numerical Distributions.