INDEX
Explanations
requests and actions related to making a change or taking initiative
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.08
3:0.26
4:0.02
5:0.08
6:0.02
7:0.04
8:0.03
9:0.02
10:0.33
11:0.02
Negative Logits
undet
-2.63
until
-2.14
underage
-2.06
uninterrupted
-2.02
lasts
-2.00
during
-1.97
unnoticed
-1.93
trapped
-1.92
untreated
-1.92
continuously
-1.89
POSITIVE LOGITS
recons
2.94
reconsider
2.69
rethink
2.66
anew
2.56
rejo
2.47
rejoice
2.36
politic
2.25
rebirth
2.18
ocracy
2.11
ndum
2.09
Activations Density 0.039%