INDEX
Explanations
encouragement
This neuron activates on imperative action verbs offering advice or instructions.
New Auto-Interp
Negative Logits
Supply
-0.07
apist
-0.07
CNC
-0.06
基地
-0.06
Muss
-0.06
stationary
-0.06
Poster
-0.06
poj
-0.06
exploration
-0.06
声明
-0.06
POSITIVE LOGITS
defs
0.06
değiş
0.06
類
0.06
^[
0.06
keit
0.06
regnum
0.06
브
0.06
*p
0.06
$__
0.06
Surrey
0.06
Activations Density 0.080%