INDEX
Explanations
percentages and survey responses related to public opinion
New Auto-Interp
Negative Logits
eya
-0.17
Ã¥n
-0.16
925
-0.15
овÑĸд
-0.15
PTY
-0.15
orc
-0.15
dge
-0.15
ubby
-0.14
oldt
-0.14
ìĭ¬
-0.14
POSITIVE LOGITS
increased
0.14
unchanged
0.14
Lies
0.14
Bru
0.14
outright
0.14
Cousins
0.14
Conc
0.13
ozo
0.13
yes
0.13
themselves
0.13
Activations Density 0.020%