INDEX
Explanations
mentions of the name "Stephen" and variations of it
New Auto-Interp
Negative Logits
lef
-0.16
nell
-0.16
_detach
-0.14
erval
-0.14
lain
-0.14
orta
-0.14
ickle
-0.14
aits
-0.14
oline
-0.14
ç½®
-0.14
POSITIVE LOGITS
ie
0.26
Colbert
0.24
Haw
0.20
Spielberg
0.19
Cove
0.18
Merchant
0.16
stown
0.16
Curry
0.16
Minister
0.15
Singular
0.15
Activations Density 0.008%