INDEX
Explanations
I'm sorry, it seems like Neuron 4 did not show consistent activations to provide a clear summary. Let me know if you would like more assistance with this neuron or any other aspect
the phrase "Continue reading" in various contexts
New Auto-Interp
Negative Logits
ranch
-0.83
arse
-0.70
»Ĵ
-0.70
MpServer
-0.68
bid
-0.66
arious
-0.66
ĪĴ
-0.66
opic
-0.66
rador
-0.63
EStreamFrame
-0.63
POSITIVE LOGITS
Continue
0.81
Continued
0.80
Loading
0.73
...]
0.71
chu
0.68
Reading
0.68
taboola
0.67
Transcript
0.66
Advertisement
0.64
CLASSIFIED
0.64
Activations Density 0.012%