INDEX
Negative Logits
immigration
-0.07
restriction
-0.06
investors
-0.06
elimination
-0.06
NotNull
-0.06
defeat
-0.06
Gaines
-0.06
Commands
-0.06
Immigration
-0.06
increasingly
-0.06
POSITIVE LOGITS
(outputs
0.07
CP
0.07
bote
0.06
NDAR
0.06
quirky
0.06
념
0.06
ヾ
0.06
cp
0.06
都市
0.06
кот
0.06
Activations Density 0.062%