INDEX
Explanations
references to military or police operations
references to "raids" or related concepts
New Auto-Interp
Negative Logits
warmed
-0.68
milo
-0.64
chell
-0.63
DonaldTrump
-0.63
Leilan
-0.62
assetsadobe
-0.60
ogyn
-0.60
Ian
-0.59
Hurricanes
-0.58
issues
-0.56
POSITIVE LOGITS
raid
0.95
raids
0.90
raided
0.84
ers
0.84
ishly
0.82
nard
0.81
raiding
0.76
iversary
0.75
artment
0.75
netted
0.73
Activations Density 0.038%