INDEX
Negative Logits
sınır
-0.07
ければ
-0.06
homo
-0.06
经
-0.06
agog
-0.06
todd
-0.06
disproportionate
-0.06
pierws
-0.06
dropout
-0.06
since
-0.06
POSITIVE LOGITS
Ekim
0.07
Anim
0.06
gums
0.06
Salmon
0.06
imaginative
0.06
sống
0.06
captcha
0.06
ствен
0.06
đàn
0.06
<W
0.06
Activations Density 0.078%