INDEX
Explanations
expressions of physical sensations and body language
New Auto-Interp
Negative Logits
cps
-0.07
yat
-0.07
ÙĪÙĤ
-0.07
koup
-0.07
hoe
-0.06
entina
-0.06
cmc
-0.06
PLAIN
-0.06
aines
-0.06
bel
-0.06
POSITIVE LOGITS
åIJĪåIJĮ
0.06
erot
0.06
Sag
0.06
.Release
0.06
usters
0.06
against
0.06
eyebrows
0.06
Buster
0.06
contro
0.05
±
0.05
Activations Density 0.012%