INDEX
Explanations
phrases related to confidence and mental states
expressions of confidence and emotional resilience
New Auto-Interp
Negative Logits
ulence
-0.90
translation
-0.74
alogy
-0.69
validity
-0.69
Mutual
-0.68
nce
-0.68
tesy
-0.68
Translation
-0.68
nuance
-0.67
omy
-0.67
POSITIVE LOGITS
addicted
1.40
aware
1.35
able
1.26
afraid
1.24
confident
1.22
unable
1.21
asleep
1.20
involved
1.19
awake
1.19
intoxicated
1.18
Activations Density 0.833%