INDEX
Explanations
references to the concept of attention in various contexts
New Auto-Interp
Negative Logits
précédents
-0.67
maestra
-0.67
helst
-0.66
rabh
-0.65
Füße
-0.64
perfección
-0.63
difesa
-0.62
Feinde
-0.61
maux
-0.61
iegler
-0.60
POSITIVE LOGITS
attention
2.80
Attention
2.56
attention
2.35
Attention
2.29
ATTENTION
2.24
attentions
1.96
ATTENTION
1.88
aten
1.54
atención
1.52
atenção
1.46
Activations Density 0.039%