INDEX
Explanations
words related to diplomatic actions and international sanctions
New Auto-Interp
Negative Logits
swick
-0.79
uscript
-0.78
ITNESS
-0.74
omorph
-0.69
NAS
-0.67
igel
-0.66
Bucc
-0.65
liter
-0.64
omorphic
-0.64
ocent
-0.63
POSITIVE LOGITS
imposed
1.29
levied
1.18
punitive
1.10
targeting
1.09
regimes
1.08
enforced
1.05
crackdown
1.05
regime
1.04
prohibiting
1.00
enforcement
0.99
Activations Density 1.855%