INDEX
Explanations
actor and celebrity names, especially those named Kevin
New Auto-Interp
Negative Logits
schild
-1.09
yrinth
-1.01
recomm
-1.01
pmwiki
-0.93
¥µ
-0.93
atari
-0.91
ledged
-0.91
Ö¼
-0.90
MENTS
-0.87
bler
-0.87
POSITIVE LOGITS
Rudd
1.20
arios
1.13
Durant
1.08
Harris
1.06
Dunn
1.06
Bacon
1.04
istic
1.04
essen
1.03
ists
1.02
aceous
1.02
Activations Density 1.725%