INDEX
Explanations
expressions of disappointment and frustration in interpersonal situations
New Auto-Interp
Negative Logits
oops
-0.16
ká
-0.15
Dank
-0.15
Hmm
-0.15
_VO
-0.15
drv
-0.14
Oops
-0.14
åĵ
-0.14
Yup
-0.14
æĮĤ
-0.14
POSITIVE LOGITS
seriously
0.37
come
0.31
Come
0.28
Seriously
0.27
Seriously
0.27
come
0.26
ser
0.25
why
0.25
c
0.24
serious
0.24
Activations Density 0.298%