The current alignment plan, and how we might improve it | Buck Shlegeris | EAG Bay Area 23
May 26, 2023•50 min
Episode description
In this session, Buck is discussing how he thinks we should try to align artificial general intelligence (AGI) if we made no more fundamental progress on alignment, and then talks about how he thinks alignment researchers should try to improve this plan and ensure that whatever plans are available are executed competently.
Buck is the CTO at Redwood Research, a nonprofit based in Berkeley which does technical alignment research. He spent most of the last year researching mechanistic interpretability and related alignment techniques. He previously worked at MIRI and was a fund manager for the EA Infrastructure Fund.
For the best experience, listen in Metacast app for iOS or Android
