INDEX
Explanations
phrases that involve political or social controversy related to elections
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.06
3:0.06
4:0.07
5:0.04
6:0.35
7:0.06
8:0.05
9:0.06
10:0.08
11:0.06
Negative Logits
phia
-1.40
Ro
-1.38
Doodle
-1.36
GD
-1.35
whatsoever
-1.32
FB
-1.31
Day
-1.29
plane
-1.28
};
-1.28
PRE
-1.27
POSITIVE LOGITS
lished
1.54
BuyableInstoreAndOnline
1.36
corrid
1.36
romising
1.28
ikuman
1.23
separatist
1.22
tiss
1.21
zman
1.19
urst
1.18
prosec
1.17
Activations Density 0.002%