INDEX
Explanations
references to political parties, with a focus on the term "Liberal"
references to the Liberal political party
New Auto-Interp
Negative Logits
bler
-0.85
Atari
-0.70
enance
-0.68
Redditor
-0.68
olkien
-0.68
schild
-0.67
IDER
-0.66
ansom
-0.64
outer
-0.63
oxide
-0.63
POSITIVE LOGITS
Party
0.92
ism
0.89
Democrat
0.82
caucus
0.80
stronghold
0.78
Nadu
0.78
MPs
0.76
senator
0.76
Gazette
0.74
MP
0.74
Activations Density 0.020%