INDEX
Explanations
mentions of "physician"
references to physicians and their assessments of health
New Auto-Interp
Negative Logits
urat
-0.87
yip
-0.85
yrinth
-0.77
skirts
-0.75
footed
-0.74
eworld
-0.74
runners
-0.72
nings
-0.72
eat
-0.72
ptions
-0.70
POSITIVE LOGITS
practitioner
1.00
physician
1.00
physicians
0.84
examiner
0.84
prescribing
0.83
prescriptions
0.82
psychiatrist
0.81
doctor
0.81
prescription
0.80
specializing
0.79
Activations Density 0.015%