INDEX
Explanations
terms related to political parties and their strategies
negative consequences
negative outcomes or flaws
New Auto-Interp
Negative Logits
+:+
-0.74
chyma
-0.71
matorium
-0.68
WireFormatLite
-0.67
autorytatywna
-0.65
öhnt
-0.61
Vanjske
-0.61
protoimpl
-0.60
שוליים
-0.58
Демографія
-0.57
POSITIVE LOGITS
Worse
1.17
Worse
1.07
worse
0.96
akibat
0.91
caused
0.89
causing
0.86
導致
0.85
due
0.83
:(
0.81
导致
0.80
Activations Density 0.750%