INDEX
Explanations
the name "Morris" with varying levels of specificity
references to the name "Morris."
New Auto-Interp
Negative Logits
sheet
-0.76
fare
-0.67
mate
-0.67
sheets
-0.67
choes
-0.66
book
-0.65
REAM
-0.65
ccording
-0.64
politic
-0.61
ware
-0.60
POSITIVE LOGITS
sey
1.77
ison
0.99
sein
0.92
olini
0.92
ons
0.91
seys
0.91
esi
0.91
hip
0.90
dale
0.87
beck
0.87
Activations Density 0.067%