INDEX
Explanations
expressions of eagerness or willingness to engage and communicate
New Auto-Interp
Negative Logits
ickey
-0.15
Tong
-0.15
nic
-0.15
fak
-0.15
_DDR
-0.14
c
-0.14
felt
-0.14
663
-0.14
åĮĸ
-0.14
elling
-0.13
POSITIVE LOGITS
hearing
0.32
hear
0.29
Hearing
0.26
hears
0.24
Hear
0.22
hear
0.19
heard
0.19
chat
0.18
chat
0.18
ear
0.18
Activations Density 0.058%