Skip to yearly menu bar Skip to main content


Poster

GenDataAgent: On-the-fly Dataset Augmentation with Synthetic Data

Zhiteng Li · Lele Chen · Jerone Andrews · Yunhao Ba · Yulun Zhang · Alice Xiang

Hall 3 + Hall 2B #93
[ ] [ Project Page ]
Thu 24 Apr midnight PDT — 2:30 a.m. PDT

Abstract:

We propose a generative agent that augments training datasets with synthetic data for model fine-tuning. Unlike prior work, which uniformly samples synthetic data, our agent iteratively generates relevant samples on-the-fly, aligning with the target distribution. It prioritizes synthetic data that complements difficult training samples, focusing on those with high variance in gradient updates. Experiments across several image classification tasks demonstrate the effectiveness of our approach.

Live content is unavailable. Log in and register to view live content