INDEX
Explanations
expressions of positive or negative emotional reactions
New Auto-Interp
Negative Logits
aver
-0.16
ordes
-0.15
u
-0.14
/from
-0.14
acades
-0.14
iry
-0.14
NR
-0.14
adge
-0.13
edException
-0.13
afort
-0.13
POSITIVE LOGITS
ness
0.20
NESS
0.19
HeaderCode
0.16
GuidId
0.16
.nih
0.16
Král
0.15
ENTRY
0.14
-looking
0.14
iye
0.14
capt
0.14
Activations Density 0.156%