INDEX
Explanations
words related to strong negative emotions like horror, shock, and being appalled
expressions of strong negative emotions, particularly related to shock or horror
New Auto-Interp
Negative Logits
amins
-0.71
impro
-0.71
estate
-0.71
cially
-0.70
aho
-0.69
eworks
-0.69
ramid
-0.68
icipated
-0.66
ello
-0.65
enture
-0.65
POSITIVE LOGITS
aback
0.82
ĸļ
0.80
ingly
0.74
silence
0.72
ãĤ¦ãĤ¹
0.71
disbelief
0.70
onlook
0.68
horrified
0.68
urous
0.66
bystanders
0.65
Activations Density 0.065%