INDEX
Explanations
proper names, particularly the name "Andrew"
New Auto-Interp
Negative Logits
pmwiki
-0.94
naire
-0.83
ioned
-0.82
kies
-0.80
leaders
-0.80
hips
-0.80
stice
-0.78
PLA
-0.76
vana
-0.74
doors
-0.73
POSITIVE LOGITS
Mellon
0.86
Sachs
0.85
Ng
0.84
Dice
0.83
Gerr
0.82
McMahon
0.82
Wiggins
0.82
Morton
0.80
Jackson
0.79
McCabe
0.79
Activations Density 0.014%