INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.09
    1:0.06
    2:0.09
    3:0.08
    4:0.09
    5:0.08
    6:0.07
    7:0.07
    8:0.07
    9:0.07
    10:0.09
    11:0.09
    Negative Logits
     decriminal
    -1.88
     sanitation
    -1.74
     antibiotics
    -1.66
     privat
    -1.66
     rubbish
    -1.62
     staples
    -1.61
     detainees
    -1.60
     shel
    -1.54
     compromises
    -1.54
    netflix
    -1.51
    POSITIVE LOGITS
     Beir
    2.16
     PAN
    1.85
     Manz
    1.79
     Pere
    1.79
     McA
    1.78
     Particip
    1.75
     Nath
    1.74
     Archangel
    1.70
     Gle
    1.70
     Ong
    1.68
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.