INDEX
Explanations
references to explosive devices or weapons
references to pagans and grenades
New Auto-Interp
Negative Logits
sburgh
-0.92
balls
-0.83
yll
-0.83
icle
-0.82
esley
-0.82
angelo
-0.79
cream
-0.76
ening
-0.76
yl
-0.74
y
-0.74
POSITIVE LOGITS
vernment
0.93
adier
0.77
atos
0.72
vier
0.70
merce
0.66
issors
0.65
ACTED
0.63
aceae
0.63
âĸ¬âĸ¬
0.62
ptions
0.59
Activations Density 0.090%