INDEX
Explanations
mentions of the name "Osama bin Laden"
references to Osama bin Laden
New Auto-Interp
Negative Logits
Amazing
-0.74
terday
-0.74
anwhile
-0.71
theless
-0.64
LU
-0.63
IRO
-0.63
Premium
-0.62
pter
-0.62
Parish
-0.62
BLE
-0.62
POSITIVE LOGITS
ocular
1.58
omial
1.48
Laden
1.26
bin
1.17
bins
0.99
mates
0.89
utils
0.88
Bin
0.84
ning
0.84
thood
0.83
Activations Density 0.005%