Expert-based reward function training: the novel method to train sequence generators

Workshop

Expert-based reward function training: the novel method to train sequence generators

Joji Toyama · Yusuke Iwasawa · Kotaro Nakayama · Yutaka Matsuo

East Meeting Level 8 + 15 #21

Tue 1 May, 4:30 p.m. PDT

[ Abstract ]

[ PDF]

The training methods of sequence generator with a combination of GAN and policy gradient has shown good performance. In this paper, we propose expert-based reward function training: the novel method to train sequence generator. Different from previous studies of sequence generation, expert-based reward function training does not utilize GAN's framework. Still, our model outperforms SeqGAN and a strong baseline, RankGAN.

Live content is unavailable. Log in and register to view live content