Recomendation: Please visit reading section first to understand the basic theories Visit Now
Tokens
GPT-2 Small
Embedding

Token Embedding


Positional Embedding

Transformer Blocks x12
Language Modeling Head

LayerNorm


Unembedding

Control Parameters
-2 2 0

1 10 0

Output Distribution

Enter Something.
Next Token: