INDEX
Explanations
days of the week - specifically, the neuron is likely finding mentions of specific days in news articles or documents
days of the week or specific dates mentioned in the text
New Auto-Interp
Negative Logits
atorial
-0.71
ced
-0.67
tumblr
-0.66
cius
-0.66
Vin
-0.65
gy
-0.64
76561
-0.63
tnc
-0.62
wb
-0.61
mental
-0.61
POSITIVE LOGITS
afternoon
1.05
morning
1.00
evening
0.94
night
0.83
redund
0.79
mornings
0.73
revealing
0.71
emphatically
0.67
»Ĵ
0.66
Tonight
0.65
Activations Density 0.112%