INDEX
Explanations
This neuron activates on numeric tokens—especially non‐integer or decimal number values.
New Auto-Interp
Negative Logits
espect
-0.07
marital
-0.07
xFF
-0.07
ристи
-0.07
dips
-0.07
_preview
-0.06
fort
-0.06
неправиль
-0.06
.reject
-0.06
_rom
-0.06
POSITIVE LOGITS
WebHost
0.06
both
0.06
StartTime
0.06
the
0.06
evento
0.06
‐
0.06
426
0.06
hold
0.06
COD
0.06
gun
0.06
Activations Density 0.036%