INDEX
Explanations
The neuron responds to standalone numeric literals (tokens composed of digits or simple numbers).
New Auto-Interp
Negative Logits
ूह
-0.07
daughter
-0.07
_State
-0.07
sermon
-0.07
-0.06
characters
-0.06
(ht
-0.06
ứt
-0.06
-0.06
다면
-0.06
POSITIVE LOGITS
[start
0.06
locate
0.06
роботи
0.06
.work
0.06
раза
0.06
interiors
0.06
onstage
0.06
fisse
0.06
.sum
0.06
atte
0.06
Activations Density 0.007%