INDEX
Explanations
references to a specific person, specifically "Sanders."
mentions of the name "Sanders."
New Auto-Interp
Negative Logits
ocaust
-0.87
ãĥ¼ãĥĨãĤ£
-0.78
obar
-0.77
tyr
-0.74
obe
-0.74
othy
-0.72
ãĥ¼ãĥĨ
-0.71
ogly
-0.71
————
-0.70
onut
-0.69
POSITIVE LOGITS
Sanders
1.12
Supporters
1.01
supporters
0.95
Sanders
0.92
Caucus
0.84
supporter
0.82
delegates
0.81
'
0.79
Bros
0.79
rade
0.77
Activations Density 0.020%