INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.09
3:0.07
4:0.09
5:0.08
6:0.06
7:0.07
8:0.08
9:0.08
10:0.08
11:0.08
Negative Logits
reth
-2.80
rates
-2.51
comply
-2.50
�士
-2.45
explosives
-2.39
ingred
-2.38
scen
-2.37
gypt
-2.36
cyl
-2.35
Weapons
-2.31
POSITIVE LOGITS
Santorum
3.19
edin
2.79
AMA
2.72
Libertarian
2.70
Maher
2.63
Rubin
2.61
Kaepernick
2.60
Krugman
2.60
Akin
2.55
["
2.54
Activations Density 0.000%
No Known Activations
This feature has no known activations.