INDEX
Explanations
mentions of medical conditions or treatments related to patients
references to patients
New Auto-Interp
Negative Logits
FLAG
-0.72
ģ«
-0.69
Politics
-0.67
GGGG
-0.63
POL
-0.62
shell
-0.62
Sharp
-0.61
BN
-0.61
ball
-0.61
sg
-0.60
POSITIVE LOGITS
patients
1.12
Patients
1.01
ients
1.00
diagnosed
0.86
patient
0.83
ysis
0.79
iatrics
0.78
hospitalized
0.77
afflicted
0.76
smugglers
0.74
Activations Density 0.014%