Glossary

Self-attention:

$\mathbf{Q}$, $\mathbf{K}$, $\mathbf{V}$: all derived from the *same* input sequence, typically through learned linear projections:

Learn More

Related Terms