INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Hardy
    -0.08
     بعد
    -0.07
    $ar
    -0.07
     президент
    -0.07
    ţi
    -0.06
    імі
    -0.06
    ětš
    -0.06
    ρά
    -0.06
     menší
    -0.06
    amedi
    -0.06
    POSITIVE LOGITS
     own
    0.16
     Own
    0.14
    Own
    0.10
     OWN
    0.09
     owns
    0.08
    _own
    0.08
    alcon
    0.08
    pon
    0.07
     opener
    0.07
    ชน
    0.07
    Act Density 0.034%

    No Known Activations