INDEX
Explanations
This neuron activates on numeric tokens (digits and numbers, including decimal‐formatted values).
New Auto-Interp
Negative Logits
henüz
-0.07
gz
-0.07
↵
-0.07
лю
-0.07
Qu
-0.07
desperately
-0.07
.eng
-0.06
Qu
-0.06
.lon
-0.06
_TIM
-0.06
POSITIVE LOGITS
Doctrine
0.07
Transfer
0.06
Characters
0.06
_advance
0.06
enza
0.06
_invite
0.06
prejudice
0.06
zz
0.06
atican
0.06
ymoon
0.06
Activations Density 0.245%