INDEX
Explanations
proper names of politicians
mentions of specific individuals, particularly in a political context
New Auto-Interp
Negative Logits
orate
-0.72
istically
-0.72
Viking
-0.67
orous
-0.66
Jericho
-0.65
Brave
-0.64
istic
-0.64
toggle
-0.64
olithic
-0.63
othy
-0.63
POSITIVE LOGITS
Wasserman
1.15
Schultz
0.97
Tanz
0.92
ignt
0.81
wcs
0.74
Leaks
0.73
nodd
0.73
borough
0.72
sheets
0.72
engers
0.72
Activations Density 0.006%