Data Processing Intro

Data pre-processing techniques generally refer to the addition, deletion, or transformation of training set data. Different models have different sensitivities to the type of predictors in the model; how the predictors enter the model is also important. Transformations of the data to reduce the impact of data skewness or outliers can lead to significant improvements in performance.

In general, the work required to make your dataset easy to analyze.

  • Easy to use column names
  • Fixing inconsistent variables
  • Merging with other datasets
  • Reshaping (melt, pivot, ect..)
  • Dealing with missing values
Greydon Gilmore
Greydon Gilmore
Ph.D. Candidate in Biomedical Engineering

My research interests include deep brain stimulation, machine learning and signal processing.

Next