INDEX
    Explanations

    the presence of possessive and related terms that indicate ownership or control

    New Auto-Interp
    Negative Logits
    ëĭ
    -0.15
    ÑĢаÑĤно
    -0.15
    ojis
    -0.14
    isi
    -0.14
    ARGIN
    -0.14
    acman
    -0.14
    ëıĮ
    -0.14
    atsapp
    -0.14
    ensibly
    -0.14
    ools
    -0.14
    POSITIVE LOGITS
    acher
    0.15
    sher
    0.14
     Kit
    0.14
    orsk
    0.14
     directions
    0.14
     Brief
    0.14
    ike
    0.14
    Brief
    0.13
    WithOptions
    0.13
    ãĤ¤ãĤº
    0.13
    Act Density 0.003%

    No Known Activations