INDEX
Explanations
The neuron selectively activates on domain-specific jargon or technical nouns (e.g. “growth,” “mechanism”) rather than common function words.
New Auto-Interp
Negative Logits
_matching
-0.07
.Custom
-0.07
climbs
-0.06
NAV
-0.06
при
-0.06
devices
-0.06
_best
-0.06
=com
-0.06
tat
-0.06
LOCITY
-0.06
POSITIVE LOGITS
využí
0.07
BCHP
0.06
bf
0.06
页面存档备份
0.06
chvíli
0.06
删除成功
0.06
(-
0.06
文章
0.06
uma
0.06
688
0.06
Activations Density 0.031%