INDEX
    Explanations

    possessive pronouns

    New Auto-Interp
    Negative Logits
    dif
    -0.07
    _df
    -0.07
     pontos
    -0.06
     hop
    -0.06
     Perspective
    -0.06
     Pvt
    -0.06
    ائية
    -0.06
    .SDK
    -0.06
     없음
    -0.06
    77
    -0.06
    POSITIVE LOGITS
     dirig
    0.07
    's
    0.07
    0.06
     víde
    0.06
     ROUT
    0.06
    commons
    0.06
    ’s
    0.06
     their
    0.06
    0.06
    _IMP
    0.06
    Act Density 0.121%

    No Known Activations