StrategyQA and Big Bench
Nov 18, 2022•42 min
Episode description
Did Aristotle Use a Laptop? That's a question from the StrategyQA benchmark which highlights the stretch goals for current artificial intelligence systems. Answering a question like that requires several cognitive steps and reasoning. Constructing a dataset of similarly challenging questions is a major undertaking. On today's episode, Mor Geva returns to share details about the creation of StrategyQA and the larger Big Bench dataset it has been included in.
For the best experience, listen in Metacast app for iOS or Android
