INDEX
Explanations
negative statements and expressions of doubt or refusal
New Auto-Interp
Head Attr Weights
0:0.12
1:0.21
2:0.04
3:0.05
4:0.02
5:0.13
6:0.04
7:0.03
8:0.12
9:0.07
10:0.04
11:0.07
Negative Logits
Monthly
-1.39
¶
-1.31
yip
-1.30
Ra
-1.28
>[
-1.27
resents
-1.26
Hig
-1.25
nesday
-1.25
↑
-1.23
circled
-1.22
POSITIVE LOGITS
pload
1.78
�
1.56
��
1.56
bish
1.54
autos
1.53
���
1.51
ufact
1.50
artment
1.49
okia
1.47
ulp
1.45
Activations Density 0.061%