INDEX
Explanations
The neuron activates on numerical tokens formatted as decimal (floating-point) numbers.
New Auto-Interp
Negative Logits
nge
-0.06
۱۱
-0.06
fairy
-0.06
veral
-0.06
ображ
-0.06
eleven
-0.06
Experimental
-0.06
xe
-0.06
_disk
-0.06
math
-0.06
POSITIVE LOGITS
kosher
0.07
caption
0.06
modulo
0.06
mens
0.06
assignable
0.06
transformer
0.06
named
0.06
是一个
0.06
nacional
0.06
شر
0.06
Activations Density 0.034%