INDEX
Explanations
key phrases and sentences that indicate important assumptions, decisions, or actions within governance and political contexts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.12
2:0.08
3:0.04
4:0.03
5:0.09
6:0.08
7:0.08
8:0.22
9:0.04
10:0.07
11:0.06
Negative Logits
ecycle
-1.46
assic
-1.42
utterstock
-1.41
quart
-1.38
bowl
-1.30
ciation
-1.28
plet
-1.24
}.
-1.24
.�
-1.22
etimes
-1.21
POSITIVE LOGITS
"#
1.58
"…
1.44
Dear
1.33
Dear
1.30
olicited
1.26
"...
1.26
"'
1.21
Warning
1.17
RM
1.16
Congratulations
1.16
Activations Density 0.153%