INDEX
Explanations
terms related to public affairs and policies
New Auto-Interp
Negative Logits
Scroll
-0.79
Scroll
-0.76
uana
-0.69
RANT
-0.69
xual
-0.69
wered
-0.69
nesota
-0.68
kson
-0.67
vell
-0.67
nian
-0.66
POSITIVE LOGITS
relations
1.09
servants
1.07
izing
0.99
sector
0.97
ised
0.93
servant
0.90
outcry
0.89
opinion
0.89
izes
0.85
relations
0.84
Activations Density 2.708%