INDEX
Negative Logits
ighting
-0.28
derby
-0.26
Derby
-0.26
ÑĢÑĥÑĤ
-0.25
Weaver
-0.25
å·¥å§Ķ
-0.24
Alt
-0.24
æ¬²æľĽ
-0.23
æIJĵ
-0.23
æ§İ
-0.23
POSITIVE LOGITS
ext
0.27
afx
0.27
åĩłå¼ł
0.26
PÃ¥
0.26
åľ¨åħ¨çIJĥ
0.25
aller
0.25
stret
0.25
çļĦæĹ¶ä»£
0.24
ança
0.24
åĩºå¤Ħ
0.24
Activations Density 0.002%