INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.07
    3:0.08
    4:0.07
    5:0.09
    6:0.08
    7:0.08
    8:0.07
    9:0.09
    10:0.08
    11:0.09
    Negative Logits
    ////////
    -3.04
     vend
    -2.69
     violates
    -2.67
    owe
    -2.65
    psons
    -2.54
    ////////////////
    -2.52
    iques
    -2.51
     paw
    -2.46
     sab
    -2.45
     sweats
    -2.44
    POSITIVE LOGITS
     Howell
    2.77
     Maid
    2.74
     Atkinson
    2.74
    ��
    2.71
    adiq
    2.63
     Giul
    2.58
     Hatt
    2.56
     Thames
    2.55
     Shepherd
    2.55
     Marian
    2.50
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.