INDEX
Explanations
mentions of specific names
repeated mentions of certain names or proper nouns
New Auto-Interp
Negative Logits
nown
-0.94
undo
-0.86
orius
-0.86
imov
-0.85
alach
-0.84
arak
-0.84
pmwiki
-0.84
umatic
-0.83
perm
-0.81
llular
-0.81
POSITIVE LOGITS
Jenner
1.26
Kardashian
1.05
Reynolds
1.03
Dunn
1.01
Lynn
1.00
Michaels
1.00
McKenna
0.99
Duff
0.99
Collins
0.97
McCann
0.97
Activations Density 0.101%