INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.06
    2:0.10
    3:0.07
    4:0.07
    5:0.07
    6:0.08
    7:0.08
    8:0.08
    9:0.06
    10:0.09
    11:0.09
    Negative Logits
     porous
    -1.81
     backing
    -1.70
     lucky
    -1.65
     sterling
    -1.65
     ounce
    -1.65
     buck
    -1.64
     alarmed
    -1.61
     lax
    -1.59
     brute
    -1.58
     bullish
    -1.58
    POSITIVE LOGITS
    thood
    2.26
    translation
    2.21
    world
    1.96
    onomous
    1.91
    [[
    1.84
    sbm
    1.84
    Race
    1.81
    aut
    1.80
    issions
    1.79
    cone
    1.78
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.