INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.06
    1:0.11
    2:0.10
    3:0.08
    4:0.08
    5:0.07
    6:0.08
    7:0.08
    8:0.07
    9:0.08
    10:0.07
    11:0.07
    Negative Logits
     Irwin
    -1.66
    alls
    -1.60
     Nug
    -1.54
     Accessed
    -1.54
     Interior
    -1.53
     Chim
    -1.53
     Rober
    -1.48
    lishing
    -1.46
     Kemp
    -1.46
     inherit
    -1.45
    POSITIVE LOGITS
     learners
    1.74
    isite
    1.68
    heid
    1.65
     veter
    1.65
     competence
    1.61
    halla
    1.59
     hemisphere
    1.58
    geries
    1.57
     lessons
    1.56
     behavi
    1.55
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.