On Adversarial Training & Robustness with Bhavna Gopal - podcast episode cover

On Adversarial Training & Robustness with Bhavna Gopal

May 08, 202444 min
--:--
--:--
Listen in podcast apps:

Episode description

"Understanding what's going on in a model is important to fine-tune it for specific tasks and to build trust."

Bhavna Gopal is a PhD candidate at Duke, research intern at Slingshot with experience at Apple, Amazon and Vellum.

We discuss

  • How adversarial robustness research impacts the field of AI explainability.
  • How do you evaluate a model's ability to generalize?
  • What adversarial attacks should we be concerned about with LLMs?
On Adversarial Training & Robustness with Bhavna Gopal | Thinking Machines: AI & Philosophy podcast - Listen or read transcript on Metacast