Workshop
|
Fri 14:10
|
Improving Exploration in Policy Gradient Search: Application to Symbolic Optimization
|
|
Poster
|
Tue 17:00
|
DOP: Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang · Beining Han · Tonghan Wang · Heng Dong · Chongjie Zhang
|
|
Poster
|
Tue 9:00
|
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics
Yanchao Sun · Da Huo · Furong Huang
|
|
Poster
|
Tue 9:00
|
Learning Value Functions in Deep Policy Gradients using Residual Variance
Yannis Flet-Berliac · reda ouhamma · odalric-ambrym maillard · philippe preux
|
|
Poster
|
Tue 9:00
|
Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime
Andrea Agazzi · Jianfeng Lu
|
|
Poster
|
Wed 9:00
|
Differentiable Trust Region Layers for Deep Reinforcement Learning
Fabian Otto · Philipp Becker · Vien A Ngo · Hanna Ziesche · Gerhard Neumann
|
|
Poster
|
Wed 9:00
|
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
Brenden Petersen · Mikel Landajuela Larma · Terrell N Mundhenk · Claudio Santiago · Soo Kim · Joanne Kim
|
|
Oral
|
Tue 19:00
|
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
Brenden Petersen · Mikel Landajuela Larma · Terrell N Mundhenk · Claudio Santiago · Soo Kim · Joanne Kim
|
|