INDEX
Explanations
specific phrases related to significant events or endorsements in a political context
New Auto-Interp
Head Attr Weights
0:0.08
1:0.02
2:0.04
3:0.03
4:0.04
5:0.03
6:0.28
7:0.05
8:0.07
9:0.27
10:0.01
11:0.03
Negative Logits
Alc
-3.69
LEG
-3.61
wing
-3.54
sac
-3.54
cad
-3.48
Emb
-3.46
hangar
-3.40
Spac
-3.38
Wing
-3.38
helic
-3.36
POSITIVE LOGITS
Murray
11.50
Murray
11.16
mine
4.37
Mine
4.26
Maria
4.25
Mining
4.05
Rooney
4.05
Ramirez
3.99
Ramsay
3.97
miners
3.91
Activations Density 0.008%