INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.07
    2:0.08
    3:0.09
    4:0.08
    5:0.08
    6:0.08
    7:0.08
    8:0.09
    9:0.07
    10:0.08
    11:0.08
    Negative Logits
     Badge
    -2.93
     Attend
    -2.89
     Songs
    -2.86
     Showdown
    -2.77
     Ticket
    -2.73
    azeera
    -2.71
     Redux
    -2.66
    ploma
    -2.65
    yet
    -2.56
     Admission
    -2.53
    POSITIVE LOGITS
    vati
    3.90
    2.86
     van
    2.58
     vans
    2.58
     clut
    2.53
    kil
    2.51
     ­
    2.46
    ertility
    2.45
    ][/
    2.42
     graz
    2.42
    Act Density 0.000%

    No Known Activations