INDEX
Explanations
emotions and emotional responses related to personal experiences and relationships
New Auto-Interp
Negative Logits
iesel
-0.16
foil
-0.14
_ARGUMENT
-0.14
abase
-0.13
uder
-0.13
orre
-0.13
.updateDynamic
-0.13
Ñĩила
-0.13
785
-0.13
пÑĸон
-0.13
POSITIVE LOGITS
cath
0.33
vulnerability
0.31
vent
0.29
outlet
0.29
processing
0.28
vulner
0.27
vulnerable
0.26
sharing
0.26
Processing
0.25
honesty
0.25
Activations Density 0.208%