INDEX
Explanations
emotional responses and reactions
New Auto-Interp
Negative Logits
Hers
-0.15
/if
-0.15
dle
-0.15
/Private
-0.15
JKLM
-0.14
iner
-0.14
kers
-0.14
ysis
-0.14
tx
-0.14
mie
-0.14
POSITIVE LOGITS
ingly
0.24
everyone
0.22
everybody
0.21
us
0.21
both
0.19
audiences
0.19
many
0.18
/conf
0.18
imag
0.17
crowds
0.17
Activations Density 0.118%