INDEX
Explanations
reporting verbs that indicate statements or confirmations
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.19
3:0.05
4:0.05
5:0.05
6:0.04
7:0.05
8:0.06
9:0.05
10:0.27
11:0.08
Negative Logits
repeal
-1.69
Debate
-1.68
debates
-1.66
treaties
-1.60
legisl
-1.59
repealing
-1.59
Politics
-1.57
controversies
-1.55
politics
-1.52
requ
-1.52
POSITIVE LOGITS
arten
1.84
Bagg
1.84
alerted
1.68
earcher
1.63
llah
1.56
atis
1.51
edi
1.49
evacuated
1.49
scan
1.49
scans
1.48
Activations Density 0.001%