INDEX
Negative Logits
our
-1.38
the
-1.27
℡
-1.27
πουργ
-1.23
wielu
-1.22
harán
-1.22
'
-1.20
ayudan
-1.20
bearded
-1.17
畦
-1.16
POSITIVE LOGITS
镲
1.83
犼
1.46
fbox
1.44
1.42
about
1.41
涠
1.36
μα
1.34
følge
1.34
posso
1.32
一闪
1.32
Activations Density 0.040%