INDEX
Explanations
references to government actions and political commitments
New Auto-Interp
Head Attr Weights
0:0.05
1:0.02
2:0.10
3:0.33
4:0.02
5:0.13
6:0.02
7:0.05
8:0.04
9:0.02
10:0.13
11:0.02
Negative Logits
Illust
-2.08
Recorded
-1.96
(@
-1.95
Subtle
-1.94
Illustrated
-1.91
gif
-1.88
Reporter
-1.82
pmwiki
-1.77
quot
-1.77
Typ
-1.76
POSITIVE LOGITS
intention
2.88
intentions
2.75
consent
2.61
intends
2.61
plans
2.56
intend
2.42
preferences
2.27
reservations
2.24
regrets
2.15
unwillingness
2.13
Activations Density 0.266%