INDEX
Explanations
references to a specific person's name
mentions of the name "Wayne."
New Auto-Interp
Negative Logits
rador
-1.20
yrinth
-1.05
gently
-0.96
alion
-0.79
inals
-0.77
gerald
-0.75
iffe
-0.75
inally
-0.74
avorite
-0.73
ateur
-0.71
POSITIVE LOGITS
Gret
1.00
Rooney
0.98
Enterprises
0.92
Manor
0.84
Til
0.73
Swan
0.70
Neville
0.69
Lug
0.69
Brand
0.68
Sheldon
0.68
Activations Density 0.026%