Processing data

Author

Andreas Handel

Modified

2024-03-20

Overview

While statistical analysis is important and cool, you will spend the majority of time with your data in the pre-analysis stages.

Estimates are that for any real-world analysis, up to 80% of your time is spent getting the data ready to do the actual statistical model fitting. This module focuses on that 80%.

Learning Objectives

  • Be familiar with the concept of data wrangling.
  • Know what to do with missing data.
  • Know how to handle outliers.
  • Be aware of feature engineering.

Content

The module consists of the following units: