INDEX
Explanations
Nothing — this neuron remains inactive and does not detect any specific tokens or patterns.
New Auto-Interp
Negative Logits
<eos>
-1.23
↵↵
-1.11
↵
-0.88
(
-0.86
<strong>
-0.85
-0.84
i
-0.83
<em>
-0.82
-0.82
.
-0.81
POSITIVE LOGITS
ValueStyle
2.17
itſelf
2.16
myſelf
2.14
^(@)
2.09
ſelves
2.08
Roskov
1.97
Personendaten
1.95
ſelf
1.94
doubtnut
1.94
ſind
1.92
Activations Density 0.000%
No Known Activations
This feature has no known activations.