INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.09
    1:0.07
    2:0.08
    3:0.08
    4:0.08
    5:0.07
    6:0.09
    7:0.08
    8:0.08
    9:0.08
    10:0.07
    11:0.07
    Negative Logits
     aval
    -3.42
     tigers
    -2.92
     gymn
    -2.86
     skelet
    -2.85
     buffalo
    -2.72
    inosaur
    -2.71
     welf
    -2.66
     athlet
    -2.62
    riad
    -2.61
     roy
    -2.59
    POSITIVE LOGITS
    ONSORED
    3.13
     Bridges
    2.88
     Vessel
    2.62
     Tone
    2.60
     Lane
    2.48
     Trigger
    2.48
     Trap
    2.46
     Hanson
    2.45
     LINK
    2.45
    forth
    2.44
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.