INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.09
    3:0.08
    4:0.07
    5:0.07
    6:0.08
    7:0.10
    8:0.07
    9:0.06
    10:0.08
    11:0.08
    Negative Logits
     Cosponsors
    -1.89
     Jinn
    -1.77
    ||
    -1.73
     Skywalker
    -1.72
     QC
    -1.67
     Nielsen
    -1.64
     Immunity
    -1.64
    $$$$
    -1.63
     Schumer
    -1.63
     Superman
    -1.63
    POSITIVE LOGITS
    aughters
    2.03
    ongyang
    1.90
    orthy
    1.80
    sterdam
    1.78
    iffe
    1.74
    erva
    1.71
    iaries
    1.69
    lict
    1.68
    itiz
    1.68
     Franch
    1.66
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.