Kyle and Linhda discuss attention and the transformer - an encoder/decoder architecture that extends the basic ideas of vector embeddings like word2vec into a more contextual use case.
The Transformer | Data Skeptic podcast - Listen or read transcript on Metacast