INDEX
Explanations
scientific research
terms related to biological interactions and environmental influences.
This neuron activates on numeric tokens—especially decimal measurements and statistical values (e.g., floating‐point numbers) in the text.
New Auto-Interp
Negative Logits
WHAT
-0.06
ent
-0.06
tmpl
-0.06
Algeria
-0.06
продолж
-0.06
Tpl
-0.06
_body
-0.06
Insp
-0.06
输入
-0.06
Subway
-0.06
POSITIVE LOGITS
internationally
0.07
elli
0.06
postup
0.06
Poly
0.06
snd
0.06
issement
0.06
empower
0.06
činnosti
0.06
0.06
OrDefault
0.06
Activations Density 0.162%