INDEX
Negative Logits
an
-0.07
失败
-0.07
jee
-0.07
ан
-0.07
adians
-0.06
stimuli
-0.06
ahan
-0.06
anomalies
-0.06
večer
-0.06
dalla
-0.06
POSITIVE LOGITS
extingu
0.13
shredded
0.06
Gloss
0.06
çeşit
0.06
TMP
0.06
yapılır
0.06
influenza
0.06
ch
0.06
mitted
0.06
uveden
0.06
Activations Density 0.001%