INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.06
    1:0.10
    2:0.07
    3:0.08
    4:0.08
    5:0.08
    6:0.09
    7:0.08
    8:0.07
    9:0.08
    10:0.08
    11:0.07
    Negative Logits
     bench
    -1.73
     CPC
    -1.69
    GBT
    -1.68
    aples
    -1.67
    Vert
    -1.65
     plurality
    -1.60
    realDonaldTrump
    -1.59
     Huntington
    -1.58
     Oval
    -1.57
    obl
    -1.54
    POSITIVE LOGITS
    actionDate
    2.09
    raq
    2.05
     rul
    1.96
    ̶
    1.85
    agara
    1.81
     Sai
    1.81
    ategory
    1.74
    rir
    1.71
    roo
    1.68
    )</
    1.66
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.