INDEX
Explanations
mentions of political figures, particularly lawmakers
references to lawmakers and legislative actions
New Auto-Interp
Negative Logits
ãĤ¨ãĥ«
-0.92
phal
-0.72
âĶĢâĶĢâĶĢâĶĢ
-0.72
omorph
-0.71
Tur
-0.68
istic
-0.66
ļé
-0.64
Customer
-0.63
Royale
-0.63
Palest
-0.62
POSITIVE LOGITS
hips
1.16
hip
0.99
arians
0.94
ervatives
0.89
woman
0.84
unanimously
0.80
voted
0.78
committees
0.77
rieve
0.76
doms
0.76
Activations Density 0.040%