INDEX
Explanations
words related to medical institutions or healthcare settings
references to hospitals and hospital-related contexts
New Auto-Interp
Negative Logits
lass
-0.79
yip
-0.75
pas
-0.71
ozy
-0.71
artisan
-0.70
Helpful
-0.69
mire
-0.68
xual
-0.66
BOOK
-0.66
lyr
-0.65
POSITIVE LOGITS
ization
0.94
itals
0.94
patients
0.94
outpatient
0.89
NHS
0.88
Patients
0.88
ised
0.87
ity
0.86
patient
0.84
ilitating
0.84
Activations Density 0.048%