INDEX
Explanations
emotional responses and relationships in narratives
New Auto-Interp
Negative Logits
642
-0.19
oller
-0.15
ault
-0.15
abee
-0.15
vd
-0.14
aul
-0.14
acho
-0.14
iform
-0.14
kar
-0.14
ild
-0.13
POSITIVE LOGITS
such
0.26
è¿Ļæł·çļĦ
0.26
è¿Ļç§į
0.25
these
0.23
Such
0.23
this
0.22
è¿Ļæł·
0.22
ÚĨÙĨÛĮÙĨ
0.22
è¿Ļä¸Ģ
0.22
Such
0.22
Activations Density 0.403%