INDEX
Explanations
the phrases related to listening and responding to someone's thoughts or opinions
expressions of communication and requests for attention or feedback
New Auto-Interp
Negative Logits
ucket
-0.83
entin
-0.78
\/\/
-0.75
iant
-0.75
ccording
-0.71
thumbnails
-0.71
abase
-0.68
teness
-0.67
igsaw
-0.66
ritional
-0.66
POSITIVE LOGITS
voices
1.17
loud
1.14
roar
1.12
firsthand
1.05
voice
1.03
noises
1.00
cries
0.99
footsteps
0.96
screams
0.94
whispers
0.94
Activations Density 0.110%