INDEX
Negative Logits
lady
-0.81
what
-0.79
tending
-0.79
負担
-0.79
предме
-0.79
pseud
-0.78
Drogen
-0.77
capability
-0.76
loading
-0.75
coolest
-0.75
POSITIVE LOGITS
Ẻ
0.95
ϰ
0.79
為に
0.77
̀ng
0.76
questões
0.75
depor
0.75
sí
0.74
Helens
0.74
änd
0.73
wohner
0.73
Activations Density 0.006%