INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.06
    2:0.09
    3:0.09
    4:0.08
    5:0.08
    6:0.07
    7:0.09
    8:0.08
    9:0.08
    10:0.08
    11:0.08
    Negative Logits
     spec
    -1.92
     resear
    -1.90
     rods
    -1.83
     paintings
    -1.77
     brushes
    -1.76
     paints
    -1.73
     bart
    -1.72
     torches
    -1.71
     distribut
    -1.71
     discounts
    -1.69
    POSITIVE LOGITS
    BALL
    1.90
    facing
    1.87
    uberty
    1.79
    emies
    1.77
    cknow
    1.68
    achus
    1.68
    Warning
    1.67
    deen
    1.65
    onomous
    1.65
    asis
    1.64
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.