MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space

Tsz Him Cheung · Dit-Yan Yeung

Keywords: [ automated data augmentation ] [ data augmentation ] [ latent space ] [ deep learning ]

[ Abstract ]
[ Paper ]
Mon 3 May 1 a.m. PDT — 3 a.m. PDT


Data augmentation is an efficient way to expand a training dataset by creating additional artificial data. While data augmentation is found to be effective in improving the generalization capabilities of models for various machine learning tasks, the underlying augmentation methods are usually manually designed and carefully evaluated for each data modality separately, like image processing functions for image data and word-replacing rules for text data. In this work, we propose an automated data augmentation approach called MODALS (Modality-agnostic Automated Data Augmentation in the Latent Space) to augment data for any modality in a generic way. MODALS exploits automated data augmentation to fine-tune four universal data transformation operations in the latent space to adapt the transform to data of different modalities. Through comprehensive experiments, we demonstrate the effectiveness of MODALS on multiple datasets for text, tabular, time-series and image modalities.

Chat is not available.