INDEX
Explanations
personal names
mentions of names and proper nouns
New Auto-Interp
Negative Logits
bleach
-0.85
Bleach
-0.73
¡
-0.73
207
-0.72
idth
-0.67
Ble
-0.64
Beckham
-0.64
Chevron
-0.64
ected
-0.64
OUP
-0.63
POSITIVE LOGITS
m
1.41
M
1.25
mic
1.13
MI
1.10
mt
1.10
mill
1.08
mob
1.07
MN
1.06
mo
1.06
Ms
1.05
Activations Density 0.276%