INDEX
Explanations
Explanation of neuron 4 behavior: this neuron does not respond to any token — it remains inactive and does not detect any pattern.
New Auto-Interp
Negative Logits
ویزی
-0.07
DCHECK
-0.06
홈
-0.06
misplaced
-0.06
ousted
-0.06
planners
-0.06
.cells
-0.06
Ди
-0.06
PAIR
-0.06
emain
-0.06
POSITIVE LOGITS
HomeComponent
0.06
simplify
0.06
(op
0.06
mystical
0.06
_USERS
0.06
_movement
0.06
-all
0.06
ص
0.06
OSC
0.06
раб
0.06
Activations Density 0.009%