INDEX
Explanations
the names "Donald Trump" and "John Trump."
names of prominent political figures and candidates
New Auto-Interp
Negative Logits
--------
-0.58
sov
-0.58
eric
-0.57
Polo
-0.54
ngth
-0.53
onwards
-0.52
vous
-0.51
Clubs
-0.50
cens
-0.50
chy
-0.50
POSITIVE LOGITS
Diane
0.53
Shift
0.53
Jes
0.52
Dean
0.51
alcohol
0.51
EPA
0.51
Shield
0.50
apolis
0.50
ator
0.50
avanaugh
0.48
Activations Density 0.035%