INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.06
    2:0.08
    3:0.09
    4:0.10
    5:0.07
    6:0.08
    7:0.08
    8:0.10
    9:0.07
    10:0.07
    11:0.08
    Negative Logits
     busted
    -1.59
     Berks
    -1.58
     nas
    -1.55
     Neighbor
    -1.52
     Squirrel
    -1.49
     scalp
    -1.47
     snapped
    -1.45
     Kev
    -1.45
     BS
    -1.43
     nasal
    -1.42
    POSITIVE LOGITS
    Reviewer
    2.50
    :]
    1.99
    etus
    1.90
    VIEW
    1.89
    view
    1.79
    llan
    1.77
    イト
    1.69
    omo
    1.67
    edit
    1.59
    isen
    1.59
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.