INDEX
Explanations
mentions of the name "Billy."
New Auto-Interp
Negative Logits
551
-0.15
alar
-0.14
elling
-0.14
chal
-0.14
571
-0.14
lassian
-0.14
KeyPress
-0.14
:"-
-0.14
itas
-0.14
391
-0.14
POSITIVE LOGITS
Joe
0.17
boy
0.17
Bob
0.16
à¸Ĺะ
0.16
Boy
0.16
Boy
0.15
Goat
0.15
inging
0.15
odom
0.15
joe
0.15
Activations Density 0.006%