INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.06
2:0.09
3:0.08
4:0.09
5:0.08
6:0.07
7:0.07
8:0.07
9:0.07
10:0.09
11:0.09
Negative Logits
decriminal
-1.88
sanitation
-1.74
antibiotics
-1.66
privat
-1.66
rubbish
-1.62
staples
-1.61
detainees
-1.60
shel
-1.54
compromises
-1.54
netflix
-1.51
POSITIVE LOGITS
Beir
2.16
PAN
1.85
Manz
1.79
Pere
1.79
McA
1.78
Particip
1.75
Nath
1.74
Archangel
1.70
Gle
1.70
Ong
1.68
Activations Density 0.000%
No Known Activations
This feature has no known activations.