INDEX
Explanations
This neuron activates on tokens that occur in Markdown-style image/link URLs—specifically the journal‐citation strings (abbreviations like “med,” “phys,” “lond,” and numeric identifiers) inside the parentheses of ``.
New Auto-Interp
Negative Logits
international
-0.07
uxe
-0.06
ucwords
-0.06
aprox
-0.06
.radius
-0.06
Naz
-0.06
Classifier
-0.06
Hawth
-0.06
synchronize
-0.06
Europeans
-0.06
POSITIVE LOGITS
#__
0.06
ска
0.06
amespace
0.06
unge
0.06
мали
0.06
AUSE
0.06
illum
0.06
ensible
0.06
incompet
0.06
fö
0.06
Activations Density 0.006%