INDEX
Explanations
science papers
The neuron fires on specialized scientific acronyms, model names, and proper‐noun labels (e.g. “FR I,” “BL Lac,” “SED,” etc.) in astrophysics texts.
New Auto-Interp
Negative Logits
NAND
-0.07
Shift
-0.06
intervals
-0.06
Vis
-0.06
catastrophic
-0.06
/tos
-0.06
Nacht
-0.06
िस
-0.06
採
-0.06
dro
-0.06
POSITIVE LOGITS
slova
0.07
신청
0.07
getPosition
0.06
ción
0.06
alarak
0.06
lang
0.06
třeba
0.06
lanma
0.06
RTVF
0.06
있어서
0.06
Activations Density 0.018%