From Computerphile.
The problem with reinforcement learning in Generative AI is that it’s difficult to turn the real-world into a graph. Sydney Von Arx of METR talks about an approach to solve this.
More about Sydney: https://bit.ly/4aqm1mG
Computerphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: https://jane-st.co/computerphile
This video was filmed and edited by Sean Riley.
Computerphile is a sister project to Brady Haran’s Numberphile. More at https://www.bradyharanblog.com


