INDEX
Negative Logits
<|endoftext|>
-0.13
<|reserved_200016|>
-0.11
mé
-0.08
<Product
-0.08
tiêu
-0.07
---------------↵
-0.07
driven
-0.07
576
-0.07
drivetrain
-0.07
-&
-0.07
POSITIVE LOGITS
haha
0.09
Hitler
0.08
�이
0.08
XOR
0.08
Ya
0.08
intim
0.07
Actually
0.07
Etern
0.07
ราช
0.07
gap
0.07
Activations Density 0.015%