INDEX
Explanations
The neuron is looking for instances of the word "going" followed by a verb
instances of the word "going" in various contexts
New Auto-Interp
Negative Logits
Horus
-0.79
ullah
-0.69
unc
-0.69
fold
-0.65
chery
-0.63
ividually
-0.62
rehens
-0.61
cipl
-0.61
boxing
-0.60
cia
-0.60
POSITIVE LOGITS
Ń·
0.98
rall
0.78
ãĥ£
0.77
bankrupt
0.69
GREEN
0.67
externalActionCode
0.66
lems
0.66
ãĤĭ
0.66
estone
0.66
sour
0.65
Activations Density 0.054%