INDEX
Explanations
mentions of violations or adherence to international laws
references to international law and its violations
New Auto-Interp
Negative Logits
Tate
-0.76
Volunte
-0.67
Bomber
-0.63
ï¸ı
-0.62
Ò
-0.61
CV
-0.61
flyer
-0.60
throats
-0.58
Sold
-0.58
Horde
-0.58
POSITIVE LOGITS
enforcement
1.26
enforcement
1.01
suit
0.96
suits
0.92
Enforcement
0.91
abiding
0.88
lessness
0.85
governing
0.83
makers
0.81
forb
0.80
Activations Density 0.056%