INDEX
Explanations
phrases related to a person named Morris
mentions of a specific individual named Morris
New Auto-Interp
Negative Logits
Hispanic
-0.72
choes
-0.71
mate
-0.70
sheet
-0.70
fare
-0.70
REAM
-0.67
ccording
-0.67
rium
-0.65
Instance
-0.64
lder
-0.64
POSITIVE LOGITS
sey
1.59
ison
0.97
Morris
0.97
usalem
0.85
rison
0.82
ons
0.79
olini
0.79
dale
0.77
aceutical
0.77
seys
0.77
Activations Density 0.017%