INDEX
Explanations
This neuron never fires—it does not detect or respond to any tokens.
New Auto-Interp
Negative Logits
imized
-0.06
/>
-0.06
Psy
-0.06
izzly
-0.06
Butterfly
-0.06
_preferences
-0.06
eren
-0.06
kiện
-0.06
़ो
-0.06
महत
-0.06
POSITIVE LOGITS
Establishment
0.07
_rem
0.07
-Line
0.07
_prev
0.07
acquired
0.07
!',
0.06
_REST
0.06
provisions
0.06
emerges
0.06
Proc
0.06
Activations Density 0.003%