INDEX
Explanations
keywords related to titles or designations
mentions of the term "Attention."
New Auto-Interp
Negative Logits
floating
-0.80
frozen
-0.68
crack
-0.68
pale
-0.68
Swiss
-0.67
savings
-0.67
stew
-0.66
conscience
-0.66
beans
-0.65
slice
-0.64
POSITIVE LOGITS
Att
3.75
Att
1.94
att
1.80
ATT
1.71
Attach
1.52
ATT
1.44
Attention
1.33
att
1.29
Attribute
1.27
Attack
1.27
Activations Density 0.017%