INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.11
    2:0.09
    3:0.06
    4:0.08
    5:0.09
    6:0.07
    7:0.07
    8:0.09
    9:0.07
    10:0.07
    11:0.07
    Negative Logits
    ��
    -1.87
    duction
    -1.65
     visitation
    -1.63
    ה
    -1.60
    onto
    -1.59
    whatever
    -1.56
    uv
    -1.52
    ulation
    -1.51
    breakers
    -1.51
    ���
    -1.48
    POSITIVE LOGITS
    ially
    1.94
    lopp
    1.85
    icent
    1.78
    minist
    1.75
    osponsors
    1.72
    atform
    1.68
    liest
    1.67
    htaking
    1.64
    igm
    1.63
     PACK
    1.61
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.