INDEX
Explanations
The neuron fires on descriptive words emphasizing physical speed or nimbleness (e.g. “quick,” “agile,” “dodging”), signaling it’s detecting adjectives/adverbs that highlight rapid, graceful movement.
New Auto-Interp
Negative Logits
י�
-0.06
7
-0.06
629
-0.06
iterals
-0.06
movies
-0.06
595
-0.06
/km
-0.06
770
-0.06
wdx
-0.06
sélection
-0.06
POSITIVE LOGITS
uphol
0.07
agility
0.07
фер
0.07
versatility
0.07
oping
0.06
πολυ
0.06
equip
0.06
نگی
0.06
pled
0.06
)animated
0.06
Activations Density 0.014%