In this video, we present the encoder layer in the transformer. Important components of this presentation is that we introduce multi-head attention, positional encodings and the architecture of the encoder blocks that appear inside the encoder.

The video is part of a series of videos on the transformer architecture, https://arxiv.org/abs/1706.03762. You can find the complete series and a longer motivation here:
https://www.youtube.com/playlist?list=PLDw5cZwIToCvXLVY2bSqt7F2gu8y-Rqje

Slides are available here:
https://chalmersuniversity.box.com/s/c2a64rz0hlp44pdouq9mc24msbz60xf2

Add comment

Your email address will not be published. Required fields are marked *

Categories

All Topics