INDEX
Negative Logits
-human
-0.06
%/
-0.06
slog
-0.06
ตะ
-0.06
��
-0.06
と思う
-0.06
کنار
-0.06
grac
-0.06
вида
-0.06
_soc
-0.06
POSITIVE LOGITS
);↵↵
0.08
conquest
0.07
Withdraw
0.07
atif
0.07
Episodes
0.06
başlat
0.06
(prob
0.06
Withdraw
0.06
FP
0.06
.cf
0.06
Activations Density 0.026%