INDEX
Negative Logits
#!
-0.07
807
-0.07
Translation
-0.07
disp
-0.06
polite
-0.06
flags
-0.06
itizer
-0.06
_flag
-0.06
-0.06
984
-0.06
POSITIVE LOGITS
meme
0.07
(eq
0.06
<
0.06
cepts
0.06
(UnmanagedType
0.06
conception
0.06
DED
0.06
=[
0.06
-To
0.06
여성
0.06
Activations Density 0.054%