INDEX
Explanations
terms related to support or opposition in political contexts
New Auto-Interp
Negative Logits
itte
-0.17
akes
-0.15
laz
-0.14
ocha
-0.14
asia
-0.14
akis
-0.14
нод
-0.13
олÑı
-0.13
PGA
-0.13
ental
-0.13
POSITIVE LOGITS
taÅŁ
0.14
filled
0.14
Cong
0.14
ChildIndex
0.14
libft
0.14
daÅŁ
0.14
λια
0.13
.sun
0.13
lish
0.13
479
0.13
Activations Density 0.066%