INDEX
Explanations
It appears that Neuron 4 does not respond to any specific content in the provided excerpts, as indicated by the absence of any non-zero activation values in the activations given. Without any non-zero activations to analyze, it is not possible to determine what Neuron 4 is looking for
New Auto-Interp
Negative Logits
pire
-0.72
largeDownload
-0.69
ihilation
-0.68
ption
-0.66
conservancy
-0.65
wic
-0.64
merce
-0.64
ument
-0.63
IPP
-0.62
Eternity
-0.62
POSITIVE LOGITS
referen
0.75
Lank
0.75
acus
0.67
Leban
0.63
QUI
0.62
questioning
0.62
boa
0.61
compar
0.60
bal
0.60
Vall
0.59
Activations Density 0.000%
No Known Activations
This feature has no known activations.