INDEX
Explanations
words related to listening or attention
New Auto-Interp
Negative Logits
Janeiro
-0.65
Palm
-0.63
eros
-0.63
mber
-0.63
rament
-0.62
INAL
-0.61
\/\/
-0.60
Leap
-0.59
esan
-0.58
redemption
-0.58
POSITIVE LOGITS
attent
1.34
listened
0.94
aloud
0.92
closely
0.92
listens
0.91
passively
0.88
listen
0.85
listening
0.85
loudly
0.83
patiently
0.83
Activations Density 0.017%