INDEX
Explanations
This neuron is essentially inactive—it does not consistently respond to any meaningful words or patterns.
New Auto-Interp
Negative Logits
LOC
-0.08
IDS
-0.08
filing
-0.08
DP
-0.07
_DSP
-0.07
QUAL
-0.07
economists
-0.07
ENT
-0.07
biodiversity
-0.06
fuse
-0.06
POSITIVE LOGITS
Sistem
0.07
رای
0.07
غر
0.06
heals
0.06
licken
0.06
payments
0.06
xong
0.06
pla
0.06
_wrap
0.06
ーバ
0.06
Activations Density 0.434%