INDEX
Explanations
references to responsibility and accountability in various contexts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.07
3:0.05
4:0.08
5:0.03
6:0.05
7:0.45
8:0.03
9:0.04
10:0.06
11:0.07
Negative Logits
imity
-1.73
sidx
-1.63
upt
-1.62
ype
-1.55
mania
-1.53
Boo
-1.48
upuncture
-1.47
gran
-1.45
Suite
-1.44
ixie
-1.42
POSITIVE LOGITS
safegu
1.86
stewards
1.74
overseeing
1.72
mishand
1.71
sacrific
1.70
unpaid
1.66
responsibilities
1.60
directing
1.59
managing
1.58
risk
1.57
Activations Density 0.018%