Skip to yearly menu bar Skip to main content


Poster

A new dog learns old tricks: RL finds classic optimization algorithms

Weiwei Kong · Christopher Liaw · Aranyak Mehta · D. Sivakumar

Great Hall BC #26

Keywords: [ reinforcement learning ] [ algorithms ] [ adwords ] [ knapsack ] [ secretary ]


Abstract:

This paper introduces a novel framework for learning algorithms to solve online combinatorial optimization problems. Towards this goal, we introduce a number of key ideas from traditional algorithms and complexity theory. First, we draw a new connection between primal-dual methods and reinforcement learning. Next, we introduce the concept of adversarial distributions (universal and high-entropy training sets), which are distributions that encourage the learner to find algorithms that work well in the worst case. We test our new ideas on a number of optimization problem such as the AdWords problem, the online knapsack problem, and the secretary problem. Our results indicate that the models have learned behaviours that are consistent with the traditional optimal algorithms for these problems.

Live content is unavailable. Log in and register to view live content