INDEX
Negative Logits
springs
-0.07
(IR
-0.07
NFL
-0.06
pun
-0.06
in
-0.06
tabs
-0.06
圧
-0.06
//!
-0.06
_damage
-0.06
=read
-0.06
POSITIVE LOGITS
ientos
0.08
incurred
0.07
&view
0.06
Nicola
0.06
+%
0.06
shameful
0.06
tyto
0.06
purported
0.06
biểu
0.06
_DISABLE
0.06
Activations Density 0.041%