INDEX
Explanations
names, especially the name "Porter"
references to a specific individual named Porter
New Auto-Interp
Negative Logits
ership
-0.93
iliation
-0.76
merce
-0.75
orses
-0.72
ipeg
-0.71
ptoms
-0.70
ussions
-0.69
iencies
-0.67
ieving
-0.64
inosaur
-0.64
POSITIVE LOGITS
Wilkinson
0.83
Porter
0.83
field
0.79
obar
0.76
hurst
0.76
head
0.75
lake
0.72
ley
0.71
weather
0.71
ando
0.70
Activations Density 0.053%