Explorations in statistics: the log transformation

Adv Physiol Educ. 2018 Jun 1;42(2):343-347. doi: 10.1152/advan.00018.2018.

Abstract

Learning about statistics is a lot like learning about science: the learning is more meaningful if you can actively explore. This thirteenth installment of Explorations in Statistics explores the log transformation, an established technique that rescales the actual observations from an experiment so that the assumptions of some statistical analysis are better met. A general assumption in statistics is that the variability of some response Y is homogeneous across groups or across some predictor variable X. If the variability-the standard deviation-varies in rough proportion to the mean value of Y, a log transformation can equalize the standard deviations. Moreover, if the actual observations from an experiment conform to a skewed distribution, then a log transformation can make the theoretical distribution of the sample mean more consistent with a normal distribution. This is important: the results of a one-sample t test are meaningful only if the theoretical distribution of the sample mean is roughly normal. If we log-transform our observations, then we want to confirm the transformation was useful. We can do this if we use the Box-Cox method, if we bootstrap the sample mean and the statistic t itself, and if we assess the residual plots from the statistical model of the actual and transformed sample observations.

Keywords: Central Limit Theorem; bootstrap; normal quantile plot; residual plots.

MeSH terms

  • Data Collection / methods*
  • Data Collection / statistics & numerical data
  • Humans
  • Linear Models*
  • Models, Statistical
  • Physiology / education*
  • Physiology / statistics & numerical data