FM Numerical Data

2.1 Response and Explanatory Variables

Explanatory Variable

  • The explanatory variable (EV) is the variable used to explain or predict another variable (the response variable).
  • By convention, the explanatory variable is plotted along the x-axis of a graph, if it is numerical.

Response Variable

  • The response variable (RV) is the variable which is explained or predicted by the explanatory variable.
  • By convention, the response variable is plotted along the y-axis of a graph, if it is numerical.

Note: both explanatory and response variables can be either categorical or numerical variables.

1.5 Basic Statistical Concepts


  • The mean of a numerical distribution is found by summing up the values of all individual data points, then dividing by the number of data points.
  • It is represented by either a capital letter with a bar drawn above it, or the Greek symbol mu (µ):

\bar{X}=\frac{\sum_{i=1}^{N} x_{i}}{N}

Where N is the total number of data points, and represents the i’th datapoint.

Note: the symbol \Sigma is short for “sum of”, so \sum_{i=1}^{N} x_{i} represents the sum of all individual data points (from datapoint 1, to datapoint N)

1.1 Overview of Data Types

Categorical Data

  • Data which is sorted into groups is considered categorical data

Nominal Data

      • Categorical data with no hierarchy (i.e. one category is not “greater than” another) is considered nominal data


Eye colour can be considered a nominal data type as the data (each person’s eye colour) can be placed into groups and there is no hierarchy

Ordinal Data

