INDEX
Explanations
keywords related to attention, focus, and detail
references to attention and its variations in various contexts
New Auto-Interp
Negative Logits
halves
-0.71
Tale
-0.65
Yugoslavia
-0.64
Pist
-0.64
Recon
-0.64
tto
-0.63
Rouge
-0.61
tre
-0.61
Dani
-0.60
Mehran
-0.60
POSITIVE LOGITS
estinal
0.93
orial
0.89
attention
0.84
ively
0.82
spans
0.78
Attention
0.76
largeDownload
0.75
arios
0.75
seeker
0.75
span
0.71
Activations Density 0.023%