INDEX
    Explanations

    possessive pronouns

    New Auto-Interp
    Negative Logits
     userType
    -0.06
    _DET
    -0.06
    _ERROR
    -0.06
     protector
    -0.06
     ورز
    -0.06
    .Phone
    -0.06
    <context
    -0.06
    ricular
    -0.06
     Tours
    -0.06
     бла
    -0.06
    POSITIVE LOGITS
    0.07
     Service
    0.06
     Υ
    0.06
     her
    0.06
    _matches
    0.06
     Tobias
    0.06
    ifth
    0.06
    -May
    0.06
    :s
    0.06
     service
    0.06
    Act Density 0.008%

    No Known Activations