INDEX
Explanations
This neuron detects phrases expressing a personal proposal or intention—particularly first-person (“I would like to…,” “I believe we could…”) statements used when making a request or offering a collaboration.
New Auto-Interp
Negative Logits
튼
-0.08
[max
-0.06
Remark
-0.06
RCS
-0.06
Custom
-0.06
Stage
-0.06
WK
-0.06
minden
-0.06
ohen
-0.05
_log
-0.05
POSITIVE LOGITS
">';↵
0.07
istické
0.07
*>(&
0.07
#↵↵
0.07
());↵
0.06
crud
0.06
uers
0.06
strugg
0.06
);↵↵↵
0.06
:↵↵↵
0.06
Activations Density 0.039%