INDEX
Explanations
emotional expressions of affection and physical intimacy
New Auto-Interp
Negative Logits
NCY
-0.15
agal
-0.15
mbH
-0.14
ulta
-0.14
gangbang
-0.14
richt
-0.14
обÑĢаз
-0.14
åĬª
-0.14
prak
-0.14
uler
-0.14
POSITIVE LOGITS
kiss
0.35
hug
0.35
embrace
0.33
kisses
0.33
kissing
0.33
embraces
0.31
Kiss
0.30
æĭ¥
0.30
hugs
0.30
touch
0.30
Activations Density 0.152%