INDEX
Explanations
references to government policies and their impacts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.14
3:0.35
4:0.09
5:0.04
6:0.02
7:0.04
8:0.04
9:0.08
10:0.07
11:0.04
Negative Logits
caveats
-1.62
ADRA
-1.53
advis
-1.53
antid
-1.48
hostilities
-1.44
DEBUG
-1.38
waivers
-1.35
Rockies
-1.34
exemptions
-1.33
redundancy
-1.33
POSITIVE LOGITS
↵
2.08
·
1.95
'?
1.86
Posted
1.84
david
1.80
avg
1.80
1.74
"?
1.72
ui
1.71
1.71
Activations Density 0.420%