INDEX
Negative Logits
肯
-0.09
actively
-0.08
abre
-0.07
รับ
-0.07
către
-0.07
uciones
-0.07
Nih
-0.07
ulet
-0.07
/u
-0.07
CAA
-0.07
POSITIVE LOGITS
Ay
0.10
osi
0.08
Ay
0.08
AY
0.08
ay
0.08
rhy
0.08
/change
0.07
nhiệm
0.07
ibi
0.07
mits
0.07
Activations Density 0.003%