INDEX
Explanations
outreach
The neuron specifically detects occurrences of the word “outreach,” especially in the phrase “community outreach.”
New Auto-Interp
Negative Logits
ilen
-0.07
Held
-0.07
状態
-0.07
Forbidden
-0.06
FUN
-0.06
风险
-0.06
jehož
-0.06
dividend
-0.06
homes
-0.06
��드
-0.06
POSITIVE LOGITS
outreach
0.09
Outreach
0.07
($__
0.07
предназнач
0.07
.ct
0.07
NB
0.07
Schwe
0.06
Spiele
0.06
_ml
0.06
Sentinel
0.06
Activations Density 0.010%