INDEX
Explanations
This neuron responds to domain‐specific nouns describing a company’s core operations or commitments (e.g., “production,” “environment,” etc.).
New Auto-Interp
Negative Logits
nasty
-0.08
ruku
-0.07
,把
-0.06
holding
-0.06
Sour
-0.06
Tür
-0.06
.col
-0.06
شب
-0.06
peeled
-0.06
найкра
-0.06
POSITIVE LOGITS
tunnels
0.06
Come
0.06
.sam
0.06
MERCHANTABILITY
0.06
!↵↵↵↵↵↵
0.06
Howard
0.06
Geneva
0.06
prematurely
0.06
(土
0.06
phenomena
0.06
Activations Density 0.038%