INDEX
Explanations
This neuron detects mentions of the concept "focus" — appearing as that token (including in brand/product names) or in phrases about focusing, feedback, or focus groups.
New Auto-Interp
Negative Logits
snake
-0.08
CAT
-0.07
scraped
-0.07
implant
-0.07
imposs
-0.06
Tam
-0.06
translator
-0.06
ван
-0.06
Meat
-0.06
_equal
-0.06
POSITIVE LOGITS
Focus
0.14
focus
0.14
Focus
0.11
focus
0.10
focused
0.10
focusing
0.09
focuses
0.09
Focused
0.09
-focused
0.08
focal
0.08
Activations Density 0.032%