INDEX
Explanations
that the neuron is looking for phrases indicating someone is about to share information or opinion
instances of the phrase "let me tell you."
New Auto-Interp
Negative Logits
hift
-0.67
BuyableInstoreAndOnline
-0.66
Joined
-0.60
untarily
-0.60
©¶æ¥µ
-0.58
iband
-0.57
calling
-0.55
ioned
-0.54
=~
-0.54
yip
-0.53
POSITIVE LOGITS
guys
1.14
somet
0.90
why
0.74
something
0.74
anecd
0.72
tub
0.71
tonight
0.70
Majesty
0.70
what
0.70
're
0.70
Activations Density 0.044%