INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.08
3:0.08
4:0.09
5:0.08
6:0.07
7:0.08
8:0.10
9:0.07
10:0.07
11:0.08
Negative Logits
Unloaded
-1.72
constitu
-1.68
unemploy
-1.64
Marketable
-1.62
minorities
-1.57
($)
-1.55
$$$$
-1.52
CoC
-1.51
profits
-1.51
moderate
-1.49
POSITIVE LOGITS
iott
1.83
Quest
1.67
Pats
1.64
Truth
1.63
aun
1.59
uron
1.57
Fair
1.55
Knot
1.55
iol
1.54
Keys
1.51
Activations Density 0.000%
No Known Activations
This feature has no known activations.