INDEX
Explanations
physical closeness and warmth
New Auto-Interp
Negative Logits
Buckets
-0.81
Bypass
-0.78
specular
-0.75
गां
-0.75
ಗೆ
-0.75
ervis
-0.73
temporada
-0.72
Imagery
-0.71
ailles
-0.71
mercy
-0.71
POSITIVE LOGITS
nu
1.48
snug
1.41
bury
1.31
burying
1.29
nest
1.19
cling
1.13
buried
1.09
closeness
1.06
against
1.05
warm
1.02
Activations Density 0.007%