INDEX
Explanations
Code/errors
The neuron activates on numeric tokens containing a decimal point.
technical acronyms and CamelCase-style identifiers (all-caps abbreviations, versioned tokens, and code-like terms) in scientific or programming contexts.
New Auto-Interp
Negative Logits
.Dialog
-0.07
ηγ
-0.07
Achievement
-0.06
Chad
-0.06
.linear
-0.06
Dy
-0.06
자동
-0.06
_skip
-0.06
requirements
-0.06
Добав
-0.06
POSITIVE LOGITS
cff
0.06
炸
0.06
cpp
0.06
engraved
0.06
impactful
0.06
ensch
0.06
ecut
0.06
ुं
0.06
代
0.06
licting
0.06
Activations Density 0.868%