INDEX
Explanations
references to decision-making or situational considerations
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.13
3:0.11
4:0.21
5:0.05
6:0.06
7:0.04
8:0.06
9:0.10
10:0.08
11:0.05
Negative Logits
Andrews
-1.22
Principal
-1.19
Hier
-1.19
Guinness
-1.18
Grant
-1.10
Haas
-1.10
Shea
-1.09
Colombian
-1.08
Circ
-1.08
Toro
-1.07
POSITIVE LOGITS
fml
1.77
ategories
1.55
rontal
1.51
claimer
1.50
etheless
1.49
CRIPTION
1.45
oldown
1.43
plugin
1.41
�
1.40
dylib
1.39
Activations Density 0.009%