INDEX
Explanations
Similar to Neuron 3, this neuron looks for phrases related to well-wishing and greetings
phrases indicating related content or references
New Auto-Interp
Negative Logits
arrang
-0.88
bably
-0.82
thereafter
-0.82
itionally
-0.81
carbohyd
-0.80
eatures
-0.80
tremend
-0.80
idine
-0.79
subsequently
-0.77
compositions
-0.76
POSITIVE LOGITS
Why
1.38
How
1.34
Latest
1.31
Ranking
1.21
Meet
1.20
What
1.17
Where
1.16
Exclusive
1.14
Inside
1.14
Should
1.13
Activations Density 0.087%