INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.08
    2:0.08
    3:0.08
    4:0.09
    5:0.09
    6:0.08
    7:0.06
    8:0.08
    9:0.08
    10:0.08
    11:0.08
    Negative Logits
    isSpecialOrderable
    -3.20
    -3.15
    quished
    -3.10
    caps
    -3.00
     Neph
    -2.77
    -2.74
    bath
    -2.74
    ubis
    -2.68
     Seraph
    -2.68
    Sword
    -2.68
    POSITIVE LOGITS
     collaborator
    2.93
     formulation
    2.82
     tune
    2.61
     patiently
    2.60
     Rove
    2.53
    eton
    2.51
     radio
    2.51
     tailor
    2.51
     unanim
    2.50
     collaborators
    2.47
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.