[Ai] AI Seminar: Brendan O'Donoghue

10 Jan 2024

      Hi all,

The first AI Seminar this term is by Brendan O'Donoghue of Deep Mind on Friday, 1/12 2-3 pm  (new time) in KEC 1001, which will be followed by a 30 minutes time for Q&A.

Zoom link: https://oregonstate.zoom.us/j/98684050301?pwd=ZzhianQxUFBPUmdYVWJKOFhaVURCQT...

Title: Reinforcement Learning from a Bayesian Perspective

Abstract
Reinforcement learning (RL) involves an agent interacting with an environment over time attempting to maximize its total return. Initially the agent does not know about the environment and must learn about it from experience. As the agent navigates the environment it receives noisy observations which it can use to update its (posterior) beliefs about the environment. Therefore, the RL problem is a statistical inference problem wrapped in a control problem, and the two problems must be tackled simultaneously for good data efficiency. This is because the policy of the agent affects the data it will collect, which in turn affects the policy, and so on. This is in contrast to supervised learning, where the performance of a classifier (for instance) does not influence the data it will later observe. Failure to properly consider the statistical aspect of the RL problem will result in agents that require exponential amounts of experience for good performance. On the other hand, correctly considering the statistical inference problem and the control problem together has the potential to dramatically reduce the compute requirements to solve problems and potentially unlock new domains and capabilities far outside of the range of current agents. In this talk I will introduce these concepts and discuss how Bayesian techniques can provide principled solutions to the problem.
Speaker Biography
Brendan O'Donoghue earned his PhD in 2013 from Stanford working with Stephen Boyd on optimization and control theory. Since then he has worked at DeepMind as a research scientist working on deep reinforcement learning, optimization, and (more recently) large language models.
Please check the seminar page https://engineering.oregonstate.edu/EECS/research/AI-seminars<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fengineering.oregonstate.edu%2FEECS%2Fresearch%2FAI-seminars&data=05%7C02%7Cai%40engr.oregonstate.edu%7Cea34e986b448405a6b6208dc120df4fe%7Cce6d05e13c5e4d6287a84c4a2713c113%7C0%7C0%7C638405098344525128%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=MkrPNTDCJ82YW1M%2Bwf7UWxYNqIQXTFC906%2Fbjr8gntE%3D&reserved=0> for future talks.

Prasad Tadepalli

[Ai] AI Seminar: Brendan O'Donoghue

Tadepalli, Prasad