INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.09
    3:0.06
    4:0.08
    5:0.08
    6:0.07
    7:0.08
    8:0.08
    9:0.08
    10:0.08
    11:0.08
    Negative Logits
     sponge
    -2.98
     Sponge
    -2.72
     Spoon
    -2.66
     astronaut
    -2.57
     Sai
    -2.56
     Dow
    -2.52
     Brune
    -2.45
     Columbia
    -2.40
     Stones
    -2.39
    -2.39
    POSITIVE LOGITS
    reon
    3.12
    Els
    3.05
    soDeliveryDate
    2.88
    habi
    2.86
    yip
    2.85
    vez
    2.75
    rez
    2.70
    Revolution
    2.57
    enfranch
    2.57
     fuzz
    2.54
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.