INDEX
Explanations
expressions of awe and emotional engagement in experiences
New Auto-Interp
Negative Logits
oplast
-0.15
retrie
-0.14
aÄį
-0.14
loy
-0.14
oku
-0.13
enda
-0.13
aru
-0.13
LOY
-0.13
ibi
-0.13
406
-0.13
POSITIVE LOGITS
adm
0.35
marvel
0.33
awe
0.32
aw
0.31
wonder
0.31
gas
0.30
admiration
0.29
adm
0.29
mar
0.28
admire
0.27
Activations Density 0.391%