INDEX
Explanations
diseases and medical concerns
This neuron is sensitive to medical-treatment terms and risk/benefit language (e.g. procedures like C-section or major surgery, medication names, and discussion of safety/harm).
New Auto-Interp
Negative Logits
simmer
-0.07
exposure
-0.07
Patrick
-0.06
purpos
-0.06
conscious
-0.06
Roger
-0.06
refurbished
-0.06
-blind
-0.06
Пар
-0.06
comeback
-0.06
POSITIVE LOGITS
));↵↵↵
0.07
_VALIDATE
0.07
ooky
0.07
bitk
0.07
=");↵
0.07
Abilities
0.07
елефон
0.06
Yok
0.06
pwd
0.06
);↵↵↵↵↵
0.06
Activations Density 0.137%