INDEX
Explanations
terms related to political affiliation and positions
New Auto-Interp
Negative Logits
posizioni
-0.41
Koc
-0.40
uebe
-0.38
exasperated
-0.37
がか
-0.36
McBride
-0.36
kungen
-0.36
setPosition
-0.36
decken
-0.36
imdi
-0.36
POSITIVE LOGITS
roek
0.83
ieteur
0.82
Kaieteur
0.74
चीज़ों
0.69
Guyana
0.69
Guyana
0.67
disambiguazione
0.66
Suriname
0.57
Искәрмәләр
0.57
Guiana
0.52
Activations Density 0.196%