INDEX
Negative Logits
imos
-0.09
LATIN
-0.09
konkrét
-0.09
ÑģÑĮ
-0.09
spreads
-0.09
rippling
-0.08
heim
-0.08
teh
-0.08
imitation
-0.08
criptor
-0.08
POSITIVE LOGITS
different
0.16
ä¸įåIJĮ
0.16
khác
0.14
ä¸įåIJĮçļĦ
0.13
farklı
0.12
different
0.12
same
0.11
_different
0.11
ÑĢазнÑĭÑħ
0.11
responses
0.11
Activations Density 0.044%