INDEX
Explanations
concepts related to feelings of shame and embarrassment
expressions of shame and embarrassment
New Auto-Interp
Negative Logits
agnetic
-0.80
ondo
-0.78
yip
-0.76
irements
-0.75
ello
-0.74
enegger
-0.72
ingham
-0.72
foreseen
-0.71
arms
-0.69
skill
-0.69
POSITIVE LOGITS
ashamed
1.03
faced
0.83
Shame
0.78
Zucker
0.78
embarrassed
0.72
embarrassment
0.71
shame
0.70
ness
0.70
rets
0.69
NESS
0.67
Activations Density 0.010%