INDEX
Explanations
emotional experiences and the complex nature of feelings
New Auto-Interp
Negative Logits
ÑĪка
-0.16
azy
-0.15
ICTURE
-0.15
utow
-0.14
lack
-0.14
pery
-0.14
onus
-0.14
ynes
-0.14
.va
-0.14
å¿ĥçIJĨ
-0.14
POSITIVE LOGITS
sad
0.30
Sad
0.28
sadness
0.24
Sad
0.23
anger
0.22
pain
0.22
sorrow
0.21
laugh
0.20
hope
0.20
laughter
0.20
Activations Density 0.182%