INDEX
Explanations
negative sentiment or criticism
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.03
3:0.04
4:0.04
5:0.04
6:0.23
7:0.25
8:0.04
9:0.04
10:0.05
11:0.14
Negative Logits
thia
-1.82
ttes
-1.75
anchester
-1.66
ault
-1.59
omal
-1.56
etry
-1.53
Xin
-1.44
Camer
-1.41
quer
-1.40
forts
-1.40
POSITIVE LOGITS
ACTIONS
1.85
STRUCT
1.76
LESS
1.48
Decre
1.47
BRE
1.45
CAP
1.45
SAN
1.44
externalActionCode
1.44
Giving
1.41
judgement
1.40
Activations Density 0.000%