INDEX
Explanations
mentions of the terrorist group Boko Haram
New Auto-Interp
Negative Logits
ner
-0.78
Crow
-0.72
logs
-0.70
Driver
-0.66
priv
-0.66
Sto
-0.64
Wing
-0.64
sv
-0.63
Panzer
-0.62
throttle
-0.62
POSITIVE LOGITS
Haram
3.77
ETH
1.95
vernment
1.63
Mons
1.00
OUS
0.94
arij
0.87
ESA
0.87
icol
0.86
aghd
0.85
merce
0.84
Activations Density 0.055%