INDEX
Negative Logits
Dest
-0.07
sym
-0.06
priv
-0.06
.enemy
-0.06
Convention
-0.06
oriously
-0.06
_lm
-0.06
trúc
-0.06
_SSL
-0.06
înt
-0.06
POSITIVE LOGITS
gấp
0.07
conna
0.06
-covered
0.06
盤
0.06
tabl
0.06
%</
0.06
.fin
0.06
rror
0.06
"_
0.06
drFc
0.06
Activations Density 0.002%