INDEX
Negative Logits
yaw
-0.09
Ý
-0.09
Deutsch
-0.08
controversial
-0.08
phạm
-0.08
ém
-0.08
.documentation
-0.08
interviewed
-0.07
Elsa
-0.07
Opinions
-0.07
POSITIVE LOGITS
satisfactor
0.08
勤務
0.08
wholes
0.07
accommodating
0.07
istance
0.07
ence
0.07
mott
0.07
nic
0.07
處
0.07
inches
0.07
Activations Density 0.003%