INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.07
    2:0.09
    3:0.08
    4:0.08
    5:0.08
    6:0.09
    7:0.07
    8:0.09
    9:0.07
    10:0.07
    11:0.07
    Negative Logits
     Grassley
    -1.81
    ibly
    -1.75
     compromise
    -1.73
    enegger
    -1.71
     lobb
    -1.62
     denomin
    -1.60
     pursu
    -1.58
     Burke
    -1.58
     consumers
    -1.58
     Lauder
    -1.54
    POSITIVE LOGITS
    ipedia
    2.09
    duration
    2.08
    prison
    1.93
    rys
    1.88
    info
    1.81
    ilk
    1.78
    intern
    1.71
    num
    1.68
    1001
    1.68
    tro
    1.68
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.