INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.07
    2:0.09
    3:0.07
    4:0.07
    5:0.08
    6:0.09
    7:0.08
    8:0.09
    9:0.08
    10:0.07
    11:0.08
    Negative Logits
    adelphia
    -2.77
     gren
    -2.74
     destro
    -2.60
     felon
    -2.50
    rimp
    -2.46
     greed
    -2.43
     scorp
    -2.39
    usterity
    -2.39
    uncle
    -2.37
     blade
    -2.30
    POSITIVE LOGITS
     Communities
    2.85
     Cy
    2.84
     Ev
    2.75
     Cong
    2.74
     Mit
    2.71
     Diet
    2.63
     Ples
    2.57
     Met
    2.57
     Latvia
    2.56
     Athe
    2.56
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.