INDEX
Negative Logits
lur
-0.08
σα
-0.08
_Static
-0.08
overlapping
-0.08
uniformly
-0.08
necess
-0.08
aville
-0.08
zuk
-0.07
가기
-0.07
oxide
-0.07
POSITIVE LOGITS
�
0.09
�
0.09
<|message|>
0.09
�
0.09
conclusão
0.09
�
0.08
안내
0.08
<|end|>
0.08
GPT
0.08
�
0.08
Activations Density 0.075%