INDEX
Negative Logits
Fram
-0.32
Gui
-0.26
Corn
-0.26
_gui
-0.25
imper
-0.25
пÑĢинÑı
-0.25
gui
-0.24
implication
-0.24
Ton
-0.24
æľīèī²
-0.24
POSITIVE LOGITS
sez
0.29
stands
0.26
éĵ±
0.26
UNUSED
0.25
standing
0.25
kaldır
0.25
otron
0.25
stand
0.24
åĽ¾çīĩæĿ¥æºIJ
0.24
iid
0.23
Activations Density 0.022%