OverviewInput Encoding ComponentTransformer BlocksLanuage Modeling HeadDecodingExternal Links Lets Rock

Overview

The transformer is a standard architecture for building large language models. A transformer has three major components:

  1. Input Encoding Component
  2. Transformer Blocks
  3. Language Modeling Head

The following diagram shows the block diagram of a transformer architecture for language modeling:

transformer