INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    AFP
    -0.07
    -0.06
     определ
    -0.06
     tedavi
    -0.06
     Azerbai
    -0.06
     olun
    -0.06
     Take
    -0.06
     refl
    -0.06
    YO
    -0.06
    でも
    -0.06
    POSITIVE LOGITS
    isinin
    0.10
     esteem
    0.07
    term
    0.07
    SERVICE
    0.07
     aud
    0.06
    over
    0.06
    -names
    0.06
     pine
    0.06
    ниц
    0.06
     bottles
    0.06
    Act Density 0.000%

    No Known Activations