INDEX
Explanations
references to various crimes being committed, especially crimes against humanity and war crimes
references to crimes, particularly war crimes and crimes against humanity
New Auto-Interp
Negative Logits
quickShipAvailable
-0.88
alg
-0.79
pole
-0.75
achev
-0.70
agles
-0.69
ulous
-0.69
flush
-0.64
agin
-0.63
ersed
-0.62
snipp
-0.62
POSITIVE LOGITS
committed
1.09
perpetrated
1.04
punishable
1.04
spree
0.92
crimes
0.87
involving
0.84
against
0.84
offences
0.81
Crimes
0.77
offenses
0.75
Activations Density 0.032%