INDEX
Explanations
situations involving physical intimacy and playful interactions between characters
New Auto-Interp
Negative Logits
Walking
-0.54
walk
-0.54
walking
-0.53
Walking
-0.53
rushed
-0.49
rushing
-0.49
walks
-0.48
walked
-0.47
Walk
-0.47
caminando
-0.46
POSITIVE LOGITS
munch
0.70
contemplating
0.65
contempl
0.65
rumin
0.64
fidd
0.64
contemplated
0.62
contemplates
0.62
kasarigan
0.62
chatted
0.59
chatting
0.58
Activations Density 0.303%