INDEX
Explanations
The neuron looks for text prompting the reader to read more information
phrases indicating a desire for additional information or to read more
New Auto-Interp
Negative Logits
cffffcc
-0.75
©¶æ¥µ
-0.63
alties
-0.62
cknowled
-0.61
orously
-0.60
gest
-0.60
ochem
-0.59
ħĭ
-0.58
neither
-0.58
seams
-0.58
POSITIVE LOGITS
VIDEOS
0.99
âĨĴ
0.81
âĢº
0.79
âĸº
0.79
>>
0.78
>>
0.77
»
0.77
>>>
0.76
<|endoftext|>
0.71
»
0.71
Activations Density 0.033%