INDEX
Explanations
The neuron fires on phrases referring to elderly people (e.g., “old ladies,” “hearing aids”).
New Auto-Interp
Negative Logits
.feed
-0.07
along
-0.06
`:
-0.06
.xz
-0.06
parser
-0.06
[P
-0.06
>*/↵
-0.06
scopy
-0.06
附
-0.06
electrónico
-0.06
POSITIVE LOGITS
Tak
0.07
blah
0.06
prejud
0.06
vysvět
0.06
ζα
0.06
stdafx
0.06
0.06
サー
0.06
Promo
0.06
_games
0.06
Activations Density 0.010%