INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     edilmiştir
    -0.07
    하면서
    -0.06
    ContentType
    -0.06
    ْ
    -0.06
    [prop
    -0.06
    oses
    -0.06
     filles
    -0.06
     Phelps
    -0.06
     oz
    -0.06
    ey
    -0.06
    POSITIVE LOGITS
    .cms
    0.06
     oppressive
    0.06
     awaits
    0.06
     noted
    0.06
     scarc
    0.06
     approved
    0.06
     justice
    0.06
    _ud
    0.06
    .exceptions
    0.06
    Southern
    0.06
    Act Density 0.095%

    No Known Activations