INDEX
Explanations
proper nouns, specifically names of individuals
mentions of the name "Stewart."
New Auto-Interp
Negative Logits
cles
-0.86
BLE
-0.80
orescent
-0.80
ntil
-0.79
lesiastical
-0.76
cle
-0.74
ilar
-0.74
duction
-0.72
fortune
-0.71
ptive
-0.70
POSITIVE LOGITS
Stewart
1.04
Olsen
0.87
yard
0.78
sey
0.75
Orn
0.72
ieri
0.71
ie
0.71
Downing
0.69
Reed
0.69
Rhodes
0.69
Activations Density 0.018%