INDEX
Explanations
Hmm, it appears that Neuron 4 is looking for patterns from a data source that may be encoded in a different format, possibly related to a specific language or system, as there doesn't seem to be a clear pattern based on the text provided
special characters or symbols with specific patterns
New Auto-Interp
Negative Logits
theless
-0.71
aturdays
-0.70
ographically
-0.67
ulators
-0.67
Kens
-0.66
ithing
-0.64
Ness
-0.63
Redd
-0.63
anwhile
-0.61
essa
-0.60
POSITIVE LOGITS
¹
1.95
²
1.88
¨
1.86
°
1.85
Ń
1.84
¶
1.83
¦
1.79
´
1.76
¸
1.76
¯
1.76
Activations Density 0.007%