downsampling-and-upweighting

One way to handle an [[Imbalanced Datasets]] is to downsample and upweight the majority class. Here are the definitions of those two new terms:

  • Downsampling (in this context) means training on a disproportionately low subset of the majority class examples.
  • Upweighting means adding an example weight to the downsampled class equal to the factor by which you downsampled.