Spotlights Session 2
Workshop: S2D-OLAD: From shallow to deep, overcoming limited and adverse data

Continuous Weight Balancing

Daniel J Wu

[ Abstract ]
Fri 7 May 11:36 a.m. PDT — 11:40 a.m. PDT


We propose a simple method by which to choose sample weights for problems with highly imbalanced or skewed traits. Rather than naively discretizing regression labels to find binned weights, we take a more principled approach - we derive sample weights from the transfer function between an estimated source and specified target distributions. Our method outperforms both unweighted and discretely-weighted models on both regression and classification tasks. We also open-source our implementation of this method, providing a modular and robust software package to the scientific community.