INDEX
Explanations
political and news-related terms, especially related to Democratic presidential candidates and political events
New Auto-Interp
Negative Logits
rawdownloadcloneembedreportprint
-0.37
fres
-0.35
inver
-0.34
rys
-0.33
izo
-0.33
ayne
-0.32
izons
-0.32
ĸļ
-0.32
ilion
-0.32
avorite
-0.32
POSITIVE LOGITS
Clintons
0.32
Gets
0.32
'
0.32
'[
0.32
Countdown
0.31
?'"
0.31
Beware
0.31
Doesn
0.31
poses
0.30
Bucket
0.30
Activations Density 8.737%