INDEX
Explanations
proper nouns or names, particularly the name "Liz"
New Auto-Interp
Negative Logits
sleeper
-0.85
sidx
-0.82
xual
-0.76
ierrez
-0.71
CLASSIFIED
-0.70
tesque
-0.68
icted
-0.68
convict
-0.68
PDATE
-0.67
icts
-0.67
POSITIVE LOGITS
Anne
0.90
Cheney
0.89
Anne
0.88
Lerner
0.86
Louise
0.84
anie
0.82
Bei
0.80
abel
0.80
otte
0.80
Nicole
0.79
Activations Density 0.040%