INDEX
Explanations
The neuron seems to be searching for occurrences of words related to the concept of time, specifically emphasizing promptness or immediacy
the temporal adverb "soon."
New Auto-Interp
Negative Logits
atively
-0.63
UTF
-0.62
rament
-0.62
olding
-0.62
ocker
-0.60
Keeping
-0.59
Saharan
-0.59
cheon
-0.59
ribe
-0.58
iscal
-0.58
POSITIVE LOGITS
thereafter
0.93
Soon
0.79
Soon
0.78
ĪĴ
0.77
enough
0.70
succumb
0.70
aneously
0.69
puberty
0.67
afterward
0.67
Ukrain
0.67
Activations Density 0.030%