INDEX
Explanations
references to political figures, particularly the president
New Auto-Interp
Head Attr Weights
0:0.07
1:0.16
2:0.03
3:0.04
4:0.04
5:0.33
6:0.07
7:0.03
8:0.03
9:0.03
10:0.09
11:0.03
Negative Logits
Manit
-1.97
Shal
-1.91
swast
-1.84
[&
-1.83
stret
-1.80
,,,,
-1.78
Staten
-1.77
dotted
-1.75
Marketable
-1.75
ratios
-1.73
POSITIVE LOGITS
older
2.56
runner
2.31
ee
2.24
ender
2.05
Party
2.04
mine
2.01
�
1.99
runners
1.99
aqu
1.98
aster
1.98
Activations Density 0.000%