INDEX
Negative Logits
เก
-0.08
TR
-0.07
')↵↵
-0.06
,self
-0.06
mingle
-0.06
freshly
-0.06
Keeping
-0.06
:])↵
-0.06
"↵↵↵↵
-0.06
(unique
-0.06
POSITIVE LOGITS
exclusively
0.07
ゃ
0.07
anyways
0.07
nts
0.06
bob
0.06
.bill
0.06
菲
0.06
eleg
0.06
ects
0.06
прави
0.06
Activations Density 0.002%