INDEX
Negative Logits
answer
-0.07
receivers
-0.07
accelerator
-0.07
simplicity
-0.06
mixing
-0.06
ubiqu
-0.06
Slate
-0.06
NTN
-0.06
Mixing
-0.06
言った
-0.06
POSITIVE LOGITS
ate
0.07
inhabit
0.07
box
0.07
patible
0.07
ồn
0.06
hoe
0.06
疲
0.06
매
0.06
!!.
0.06
Sao
0.06
Activations Density 0.008%