INDEX
Negative Logits
ãĤ´ãĥ³
-0.78
Scroll
-0.67
bart
-0.66
extraord
-0.65
AMY
-0.65
tl
-0.65
cffffcc
-0.64
ãĤ¶
-0.63
patch
-0.63
alsa
-0.62
POSITIVE LOGITS
respondents
0.91
unfavorable
0.86
either
0.82
they
0.81
majorities
0.76
owning
0.74
lifetime
0.74
negatively
0.73
favorably
0.73
themselves
0.72
Activations Density 0.094%