INDEX
Explanations
evaluative sentiments towards characters and their actions in narratives
New Auto-Interp
Negative Logits
/by
-0.16
AGO
-0.14
Anyway
-0.14
actly
-0.14
訳
-0.14
sic
-0.13
å±Ĭ
-0.13
座
-0.13
Anyway
-0.13
iÅŁte
-0.13
POSITIVE LOGITS
granted
0.28
maybe
0.26
overall
0.24
Overall
0.22
Granted
0.22
Overall
0.21
maybe
0.21
personally
0.20
Maybe
0.20
sure
0.19
Activations Density 0.330%