The foundational intuitions behind scaled dot product attention, the core operation of the transformer architecture.