INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.08
3:0.08
4:0.08
5:0.07
6:0.07
7:0.08
8:0.08
9:0.09
10:0.07
11:0.08
Negative Logits
ngth
-2.30
packing
-2.07
maps
-1.81
ppings
-1.77
urgy
-1.66
required
-1.62
selage
-1.60
istries
-1.57
thood
-1.56
func
-1.55
POSITIVE LOGITS
unfair
1.68
false
1.60
discriminating
1.57
swing
1.53
sway
1.52
azo
1.50
spurious
1.48
prejud
1.48
fraudulent
1.47
favourable
1.47
Activations Density 0.000%
No Known Activations
This feature has no known activations.