INDEX
Explanations
proper nouns, specifically names like "Spencer" and "Sterling."
the name "Spencer" and references to the name "Sterling"
New Auto-Interp
Negative Logits
arist
-0.75
juven
-0.70
LOAD
-0.62
trust
-0.62
catast
-0.61
advers
-0.60
usterity
-0.59
CFR
-0.58
avez
-0.58
burdens
-0.58
POSITIVE LOGITS
Tracy
0.90
Spencer
0.86
cil
0.83
itta
0.79
chell
0.77
ENCE
0.76
ville
0.75
onite
0.75
olson
0.73
otle
0.73
Activations Density 0.024%