INDEX
Explanations
names likely to refer to a specific person ('Susan')
occurrences of the name "Susan" in various contexts
New Auto-Interp
Negative Logits
ORD
-0.76
iculty
-0.76
cffffcc
-0.72
unct
-0.68
compr
-0.66
unpredictable
-0.64
paced
-0.63
Constructed
-0.61
riter
-0.61
olkien
-0.61
POSITIVE LOGITS
ne
1.01
anne
0.95
icide
0.92
Rice
0.90
otte
0.89
gha
0.85
Boyle
0.85
mary
0.84
Anne
0.83
jit
0.82
Activations Density 0.034%