INDEX
Explanations
references to Osama bin Laden
references and mentions of Osama bin Laden
New Auto-Interp
Negative Logits
anwhile
-0.78
ktop
-0.73
hips
-0.69
theless
-0.68
mble
-0.66
laus
-0.66
Jagu
-0.66
terday
-0.65
ITH
-0.65
Parish
-0.65
POSITIVE LOGITS
omial
1.73
ocular
1.71
Laden
1.55
nington
1.15
ning
1.04
jamin
1.00
ned
1.00
thood
0.97
utils
0.93
ational
0.92
Activations Density 0.042%