INDEX
Explanations
mentions of political debates
references to political debates
New Auto-Interp
Negative Logits
reated
-0.78
ridges
-0.68
uras
-0.68
icidal
-0.67
relative
-0.66
reads
-0.66
ocument
-0.63
ruce
-0.63
inness
-0.63
YE
-0.62
POSITIVE LOGITS
halla
0.91
moderators
0.85
transcripts
0.82
venue
0.79
transcript
0.78
nas
0.77
prep
0.75
contestant
0.74
airs
0.72
Fallon
0.71
Activations Density 0.031%