INDEX
Explanations
emotional attributes and moral judgments related to characters and actions in narratives
New Auto-Interp
Negative Logits
rego
-0.18
kara
-0.16
oro
-0.15
lobs
-0.14
Welch
-0.14
.bid
-0.14
aday
-0.14
stag
-0.14
echn
-0.14
.DataType
-0.14
POSITIVE LOGITS
apus
0.17
æĥħ
0.14
tees
0.14
TOOLS
0.14
иж
0.14
/MPL
0.14
-kit
0.14
dal
0.14
McN
0.14
Dream
0.13
Activations Density 1.010%