Glossary

Bahdanau/Luong cross-attention:

$\mathbf{Q}$: decoder hidden states - $\mathbf{K}$: encoder hidden states - $\mathbf{V}$: encoder hidden states (K = V in most formulations)

Learn More

Related Terms