INDEX
Explanations
mentions of a specific person named Bernie Sanders
references to Bernie Sanders
New Auto-Interp
Negative Logits
obar
-0.80
otropic
-0.80
ãĥ¼ãĥĨãĤ£
-0.79
tyr
-0.73
ebin
-0.71
perm
-0.71
odon
-0.70
obe
-0.68
conservancy
-0.68
omaly
-0.67
POSITIVE LOGITS
Sanders
1.37
Sanders
0.97
Supporters
0.95
Bernard
0.89
supporters
0.86
Bernie
0.83
Bernie
0.82
INTON
0.81
supporter
0.77
Bros
0.77
Activations Density 0.010%