INDEX
Explanations
expressions of affection and intimate connections among individuals
New Auto-Interp
Negative Logits
Camp
-0.30
potensi
-0.29
interests
-0.28
interessiert
-0.28
ugyan
-0.28
Camp
-0.28
campagna
-0.28
camp
-0.28
生
-0.27
swf
-0.26
POSITIVE LOGITS
cuddle
1.01
cuddling
0.98
hug
0.98
hugging
0.97
hugged
0.94
hugs
0.93
cudd
0.87
cuddly
0.81
hugs
0.80
Hug
0.79
Activations Density 0.172%