INDEX
Explanations
proper nouns related to a specific figure or name, namely "Bin Laden."
mentions of "Bin Laden."
New Auto-Interp
Negative Logits
anwhile
-0.81
sburgh
-0.79
ITH
-0.74
mble
-0.72
Premium
-0.68
pter
-0.67
Raise
-0.66
Decay
-0.65
ESA
-0.64
ULT
-0.63
POSITIVE LOGITS
ocular
1.41
Bin
1.26
Laden
1.24
omial
1.24
nington
1.03
bin
1.00
thood
0.87
aries
0.86
jamin
0.83
oha
0.81
Activations Density 0.003%